Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,978 datasets
July 2011 to October 2016 of round-trip acoustic travel time and ambient bottom pressure data collected by bottom-mounted instruments along 34.5°S in the Southwest Atlantic east of Uruguay. The dataset was produced by the NOAA Atlantic Oceanographic and Meteorological Laboratory for the Southwest Atlantic Meridional Overturning Circulation (SAM) project. Both processed, quality-controlled files and raw data files are available.
The JAFFE dataset contains images of Japanese female facial expressions. It was created by Michael J. Lyons of Ritsumeikan University for non-commercial scientific research. The dataset's background and use are documented in two specific arXiv papers from 2020 and 2021.
Six component projects collected data from Tomales Bay, California, over a nine-year period from March 1987 to September 1996. The data were gathered as part of the Land Margin Ecosystem Research and Biogeochemical Reactions in Estuaries projects, focusing on biotic variables, water composition, and sediment fluxes. It provides a multi-year record of estuarine processes.
Temperature and barometric pressure profiles were collected in the North Atlantic Ocean using expendable bathythermograph (XBT) casts from vessels including the GRAND BASSAM. The dataset covers a specific period from January 1 to April 26, 1993. It is managed by NOAA's National Centers for Environmental Information (NCEI).
NOAA Ship Miller Freeman collected 54 days of continuous oceanographic data during the 1997 Fisheries-Oceanography Cooperative Investigations (FOCI). The dataset contains vertical profiles of water temperature, salinity, Photosynthetic Active Radiation (PAR), and sigma-theta from the north Pacific Ocean. These measurements provide a snapshot of ocean conditions relevant to fisheries research.
Temperature profile and pressure data were collected via bottle casts from the BARUNA JAYA I research vessel. The dataset covers measurements taken in the Banda Sea, Celebes Sea, Ceram Sea, and Java Sea. Data collection was managed by Lamont-Doherty Earth Observation for the Arlindo Mixing project.
Top rated movie:2026 is a dataset published on Kaggle. Its title suggests it contains information about highly-rated films from the year 2026. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Netlify Movies is a dataset hosted on Kaggle. The dataset's specific content and structure are not detailed in the provided metadata. Its origin and purpose are inferred from the title, which suggests a collection of movie-related information.
Agent03_full_adapte_review is a dataset published on Kaggle. The title suggests it contains data related to reviews, possibly adapted for or generated by an agent-based model. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Hollywood Top Rated Movies is a dataset published on Kaggle. Its specific content, such as movie titles, ratings, genres, or release years, must be verified after download. The dataset likely contains information about films that have received high ratings.
A spatial bibliography of deep-water studies from 2000 to 2020, focusing on areas deeper than 200 meters within the U.S. Exclusive Economic Zone around Alaska. The National Oceanic and Atmospheric Administration compiled peer-reviewed literature and technical reports across six academic disciplines, including biology, chemistry, and geology. Seven geographic regions are covered, from the Gulf of Alaska to the Chukchi Sea.
Additional file 2 from the scoping review 'Intervening infant and young child feeding among Indian tribes' provides search strings adapted for different academic databases. The dataset, published under a CC-BY-4.0 license, supports the replication and extension of literature searches on infant nutrition and health promotion within Indigenous communities. Its structured format is likely designed for systematic review methodologies.
Indian tribal communities are the focus of this dataset, which compiles characteristics of studies from a scoping review on infant and young child feeding interventions. The data likely contains details on study design, population, and intervention types for analysis. It is published under an open license to support public health research.
1,537,359 data points comprise this dataset from the official Facebook accounts of the European People's Party (EPP) political family. It contains both quantitative metrics and qualitative content from a three-year period. The dataset's cross-platform presence on figshare suggests it is a recognized resource for studying political communication.
SESAME Fluxgate Magnetometer Key Parameters contain H, D, and Z components of Earth's magnetic field, measuring variation relative to an arbitrary baseline. The data is accurate to 1nT, with 1 minute 'spot' values derived from 1Hz sampling. The dataset originates from the National Aeronautics and Space Administration and is referenced in a 1995 Space Science Reviews publication.
Approximately 14.2 million measurements of surface water partial pressure of CO2 (pCO2) collected from the global oceans between 1957 and 2019. The data were processed into a uniform file by the Lamont-Doherty Earth Observatory (LDEO) and are available via NOAA's Ocean Carbon data System (OCADS). This version 2019 adds 567,632 data points from 158 cruise segments, primarily from high-latitude research vessels.
Trendingmovies in 2017 to 2019 is a dataset published on Kaggle. The dataset likely contains information about movies that gained popularity over a three-year period. Specific details such as the number of records, features, and data collection method are not provided in the available metadata.
Kaggle hosts the KIIS Movie Recommender dataset. The dataset likely contains user-movie interaction data for building recommendation systems. Its author, organization, and specific details are unknown.
Records related to cultural communication and its audience impact. The dataset likely contains information about platform interactions and audience responses. It is published on Kaggle, but the author, organization, and specific details are unknown.
User reading and borrowing behavior records published on Kaggle. The dataset likely contains logs of user interactions with library materials. Its specific size, temporal coverage, and creator are unknown.