Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,992 datasets
MARS Express MARSIS Active Ionospheric Sounder data includes full-resolution spectral information calibrated in spectral density units. The dataset covers the seventh extended mission of the Mars Express spacecraft, providing spectrogram overview plots and individual ionograms for each sounding activity. It was produced by the National Aeronautics and Space Administration and last updated in March 2026.
A dataset of 1503 records collected from a medical hospital via a Google form questionnaire. The data includes 15 attributes, with 10 selected for this release, including 9 features and 1 target attribute. The target attribute 'Feeling Anxious' was chosen as a predictor for postpartum depression.
World Ocean data from Air-Launched Autonomous Micro Observer (ALAMO) profiling floats, which measure temperature, salinity, and pressure. The dataset was developed by NOAA NCEI for deployment in challenging environments like tropical cyclones and around sea ice. It contains measurements from July 15, 2014, to November 11, 2018.
Eight oceanographic parameters, including water pressure, temperature, salinity, and dissolved oxygen, were collected via CTD from the Brazilian Navy research vessel Cruzeiro do Sul. Measurements were taken across the Southwest Atlantic Ocean during a 38-day expedition from April 1 to May 8, 2015. The dataset was archived by NOAA NCEI under accession number 0184862.
A dataset of Tamil memes annotated for troll classification, created to facilitate computational modeling of trolling in Indian languages. The dataset, named TamilMemes, contains meme images categorized as "troll" or "not_troll" and includes Latin transcriptions of the text from the memes. It was released by researchers associated with the University of Galway and presented at the WILDRE-5 workshop in 2020.
TamilMemes is a dataset for identifying offensive or trolling content in region-specific Tamil-language memes. The resource, created by researchers at the University of Galway and presented at WILDRE-5 in 2020, contains meme images categorized as 'troll' or 'not_troll' and includes Latin-transcribed text from the memes. A shared task evaluation found the top-performing system achieved a weighted average F1-score of 0.55.
Syrian conflict news articles labeled as fake or credible. The dataset was created by Fatima K. Abu Salem of the American University of Beirut. Credibility was determined by matching crowdsourced article information against the Syrian Violations Documentation Center database.
A dataset of human facial expressions intended for emotion detection tasks using deep learning models. The dataset is hosted on Kaggle, but specific details about its size, origin, and creation date are not provided.
World maps and movies of the Rate of TEC change (ROTI) index, a measure of ionospheric disturbance. The data is derived from Total Electron Content (TEC) measurements between GPS satellites and ground receiver stations, calculated at 15-minute intervals. The dataset is provided by the National Aeronautics and Space Administration and was last updated in March 2026.
Kaggle hosts this dataset focused on restaurant recommendations. The data likely contains user preferences, restaurant features, or interaction logs to support AI-driven recommendation systems. Its specific size, origin, and update history are not detailed in the provided metadata.
Amazon Reviews Dataset likely contains user-generated text reviews for products sold on the Amazon platform. The dataset is hosted on Kaggle, a popular platform for data science competitions and resources. Specific details regarding the number of reviews, time period covered, and the original data collector are not provided in the available metadata.
Kaggle hosts a dataset listing top-rated movie names. The dataset's specific contents, such as rating scores, genres, or release years, are not detailed in the provided metadata. Its size, origin, and the criteria for 'top-rated' are unknown.
newsSU is a text dataset published on Kaggle. The title suggests it contains news articles, but the specific source, size, and collection method are not provided. Further details about the content, such as the number of articles, time period, or language, require verification after download.
One short video summarizing Geoscience Australia's work on managing Australia's marine jurisdictions. The film is part of a series of six produced to communicate the agency's value to the nation. The video content is provided in HTML format.
Movie data likely tracking popularity or performance metrics over multiple years. The dataset is hosted on Kaggle, a platform for data science projects. Specific details regarding its creation, size, and authorship are not provided in the metadata.
AG News Cleaned is a dataset of news articles likely intended for text classification tasks, sourced from Kaggle. The title suggests it contains cleaned versions of articles from the AG News corpus, a common benchmark for topic classification. No details on the number of articles, cleaning methodology, or publication date are available in the provided metadata.
Podcast audio data likely intended for media production or machine learning tasks. The dataset is hosted on Kaggle, but its specific contents, size, and creation details are not described. Metadata is minimal; actual content requires verification after download.
The Mars Express MARSIS Active Ionospheric Sounder (AIS) full resolution data set includes all spectral information calibrated in units of spectral density for the entire Mars Express nominal mission. The data set consists of a transmit frequency followed by a time series of spectral density measurements of the received power. Browse products contain a spectrogram overview plot and individual ionograms for each sounding activity.
Mars Express MARSIS Active Ionospheric Sounder data includes calibrated spectral density measurements for the entire Mars Express nominal mission. The dataset consists of a transmit frequency followed by a time series of spectral density measurements of received power. Browse products contain spectrogram overview plots and individual ionograms for each sounding activity.
The sixth extended mission of Mars Express provides spectral density measurements of the Martian ionosphere. The dataset consists of calibrated spectral information, including a transmit frequency followed by a time series of received power measurements. Browse products include spectrogram overview plots and individual ionograms for each sounding activity.