Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,037 datasets
Twitter Engagement Dataset is a collection of multi-topic Twitter posts intended for engagement and trend analysis. The dataset was sourced from Kaggle, but its author, size, and last update date are unknown.
Kaggle hosts a dataset titled 'Depression_data'. The dataset likely contains information related to depression, though its specific content, size, and origin are not detailed in the available metadata. Its publication on Kaggle suggests it is intended for analysis by the data science community.
CSCI 4521 Project 5 Sports Images is a dataset published on Kaggle. The title suggests it contains images related to sports, likely intended for a computer science or machine learning course project. The dataset's specific content, size, and origin require verification after download.
ReviewBeauty2018 is a dataset of consumer reviews for beauty products, published on Kaggle. The dataset likely contains textual feedback and associated metadata from the year 2018. Specific details on volume, features, and authorship are not provided in the available metadata.
Kaggle hosts a dataset titled 'movie and rating recomendation system'. The platform suggests it likely contains user-movie interactions, such as ratings or reviews, which are fundamental for building recommendation engines. Specific details on volume, authorship, and recency are unavailable from the provided metadata.
Data and materials from a 2016 psychological science study by Moors et al. The research investigated whether scene congruency is processed during continuous flash suppression, a visual masking technique. The dataset likely contains behavioral or response data from the reported experiments.
An R package by Jun Cai provides functions for calculating six standard water vapor measures used in meteorology. It computes saturation vapor pressure, partial water vapor pressure, relative humidity, absolute humidity, specific humidity, and mixing ratio from temperature and dew point inputs. The package also includes conversion functions between these humidity measures.
Julian Faraway's three textbooks on linear models and regression provide the source for these datasets. The books include 'Linear Models with R' and 'Extending the Linear Model with R', with editions published from 2004 to 2025. The data likely contains example tables for illustrating statistical concepts like ANOVA and regression analysis.
A report on biological threats authored by the National Academy of Engineering and National Research Council. The dataset description is unavailable as the DOI was created in error and the metadata record is not attached. The original title suggests the content likely contains information on biological agents and agricultural vulnerabilities.
John E. Farbry authored a research review covering the potential safety implications of electronic billboards (EBBs) on driving safety. The review spans literature from a similar 1980 review to the present, focusing on driver performance, state regulatory practices, and identified knowledge gaps. Research questions are organized around roadway characteristics, EBB characteristics, and driver characteristics.
Reviewer Guidance data from the FDA's Center for Biologics Evaluation and Research focuses on human pregnancy outcomes. The dataset's specific content and structure are not detailed in the available metadata. It is authored by a U.S. federal health agency, suggesting a regulatory or research context.
Kaggle dataset titled 'data_reviews2026'. The title suggests it likely contains textual review data from the year 2026. The dataset's specific content, size, and origin are not detailed in the provided metadata.
CSCI 4521 Sports Pictures is a dataset hosted on Kaggle, likely associated with a university computer science course. The dataset's content and scale are unspecified, requiring verification after download. Its origin and creation date are not provided in the available metadata.
Southeast Asia is the focus of this historical text documenting the final years of the Vietnam War and the collapse of Cambodia. The book, 'Without Honor' by Arnold Isaacs, is based on the author's firsthand reporting, previously classified documents, field reports, and eyewitness accounts. It covers the period from the 1973 Paris peace agreement through the 1975 evacuation of Saigon.
A 2007 systematic review published in Obstetrics & Gynecology analyzing procedure-related complications of amniocentesis and chorionic villus sampling. The work was authored by Faris Mujezinović and aggregates findings from multiple studies. The dataset likely contains extracted and synthesized data from the reviewed literature.
A dataset about movies, likely containing information on titles, genres, and other film-related attributes. It is hosted on the Kaggle platform. The specific source, collection method, and temporal coverage are not provided in the available metadata.
A dataset titled 'news-analyzer' is hosted on Kaggle. The dataset's content is inferred to be related to news articles or media analysis, but specific details about its size, origin, and structure are not provided in the available metadata. Its author, organization, and temporal coverage are unknown.
MovieLensProcessed is a dataset from Kaggle. Its title suggests it contains processed movie rating data, likely derived from the MovieLens research project. The specific content, scale, and processing steps require verification after download.
Movie_genre_dtaset is a dataset hosted on the Kaggle platform. Its specific contents, such as the number of records or the exact features, are not detailed in the available metadata. The dataset likely contains information related to movies and their associated genres, which could be used for classification or analysis tasks.
BenchBench Anonymous Review Artifact is a dataset hosted on Kaggle. The title and platform tags suggest it contains text artifacts related to the academic peer review process. The dataset's specific content, scale, and origin are not detailed in the provided metadata.