Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,957 datasets
SCIOPS collected CTD (conductivity, temperature, depth) and bottom pressure data from the Santa Barbara Channel and Santa Maria Basin. The dataset includes observations from multiple transects and cruises, with data organized by cruise code and instrument type. Data collection was funded by the Minerals Management Service and occurred primarily in the 1990s.
1992 CTD data from the JGOFS Equatorial Pacific Process Study, collected along 140°W longitude across five research cruises. Measurements include depth, pressure, temperature, conductivity, salinity, fluorescence, and photosynthetically available radiation. The dataset was compiled by the SCIOPS organization from the Joint Global Ocean Flux Study.
U.S. JGOFS Arabian Sea Process Study collected Conductivity, Temperature, Depth (CTD) data and mixed layer depths across eleven research cruises. Measurements were taken from October 1994 to January 1996, providing seasonal coverage of monsoon and inter-monsoon cycles southeast of Oman. The dataset includes temperature, conductivity, salinity, potential temperature, density, and derived mixed layer depths.
A survey dataset by AARP, last updated on 2026-04-23, focusing on the media consumption patterns of caregivers. It likely contains information on viewing devices, streaming services, hours spent watching TV, social media use, and perceptions of caregiver depiction in media.
Social media engagement records are provided for digital cultural communication analysis. The dataset is hosted on Kaggle, but its author, organization, and specific creation date are unknown. Its exact size, row count, and file formats are also unspecified.
Kaggle hosts a collection of top-rated movies sourced from the Rapid API for fetching data. The dataset's author, organization, and specific size are unknown. Its last update date is also unspecified.
A dataset titled 'movies' is hosted on the Kaggle platform. The dataset's specific content, size, and origin are not detailed in the provided metadata. Further details such as the number of rows, columns, and the data's creator are unknown and require verification after download.
San Francisco Planning Department review time metrics for project applications, including total days to approval and milestone-specific durations. The dataset is generated daily from the Project and Permit Tracking System (PPTS) by the City of San Francisco. It includes both project-level and event-level metrics, with targets and outcomes for tracking review timelines.
United States public opinion survey data from a poll conducted in February 2026 by ABC News and The Washington Post. The dataset is hosted by the Roper Harvested Dataverse and was last updated in May 2026. It likely contains tabular survey responses on political and social topics.
Information presented by the Canadian Radio-television and Telecommunications Commission to Parliament regarding Bill C-10. The document details proposed amendments to the Broadcasting Act and related legislation. It was last updated on March 17, 2026.
A collection of Instagram comments potentially related to cyberbullying, sourced from the Kaggle platform. The dataset likely contains text data for analysis of online harassment. Specific details on volume, author, and collection date are unavailable from the provided metadata.
Kaggle hosts a dataset titled 'fake_news_data'. The dataset likely contains text samples labeled for veracity, intended for training machine learning models. Its author, size, and specific collection details are not provided in the available metadata.
PRISM is a dataset of movie posters and magazine covers, hosted on Kaggle. The collection likely contains images used in media and entertainment. The specific number of images, collection method, and creator are not detailed in the available metadata.
A dataset from Kaggle concerning global content engagement and reach related to Chinese culture. The raw description suggests it likely contains metrics on communication and audience interaction. Specific details on size, structure, and authorship are not provided.
Geoscience Australia Data provides a literature review and spatial analysis of the sedimentology and geomorphology of the Northwest Marine Region. The sedimentology information is based on consistent quantitative point assays of grainsize and carbonate content from the MARS database as of August 1, 2007. The dataset was last updated on March 25, —.
A list of top-rated movies from the year 2026. The dataset is hosted on Kaggle, but its specific source, size, and compilation method are not detailed in the provided metadata. The content likely includes movie titles and associated rating scores or rankings.
Electronic Records Express (ERE) management information data is provided by the Social Security Administration. The dataset contains information on evidence collected electronically through the ERE website, prepared for transition to the electronic Disability (eDib) system. The dataset was last updated on 2026-04-03.
Movielens1M is a widely-used benchmark dataset containing 1 million movie ratings. The 'traitalign' prefix suggests this version may be aligned or processed for trait-based analysis. It is hosted on Kaggle, but the specific modifications and update date are unknown.
This database lists community solar projects identified as of December 2023. It includes updated low-income and low- and moderate-income provisions for projects based on program data collected through March 2024.
This dataset lists community solar projects identified as of December 2022, with updated low-income and low- and moderate-income provisions based on program data from July 2023. The data is maintained by the Department of Energy and is flagged as deprecated, with current data available from a separate source. Row and column counts are unknown.