Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,984 datasets
A collection of user reviews for mobile devices, sourced from the Kaggle platform. The dataset likely contains textual feedback and ratings. Specific details on volume, time period, and collection method are unavailable from the provided metadata.
Top rated movies dataset is a collection of movie information and ratings published on Kaggle. The dataset's specific contents, such as rating sources, time period, and number of entries, are not detailed in the provided metadata. Its author, organization, and exact creation date are unknown.
Giving access to line representations of designated Wild and Scenic River segments managed by the Bureau of Land Management in Oregon and Washington. The data is part of the National Wild and Scenic Rivers System, established by Congress in 1968 to preserve rivers with natural, cultural, and recreational values.
Polygon representations of Wild and Scenic River Corridors managed by the Bureau of Land Management in Oregon. It was created to preserve rivers with natural, cultural, and recreational values under the 1968 National Wild and Scenic Rivers System Act.
Decile dispersion ratios measure income inequality by comparing the average income of the richest 10% of a population to the poorest 10%. The dataset is produced by the LAC Equity Lab, part of the World Bank. It provides a focused metric for analyzing the extremes of income distribution across different economies.
LAC Equity Lab data quantifies income disparity by comparing the average income of the richest 25% to the poorest 25% across economies. This decile dispersion ratio provides a focused measure of inequality at the extremes of the income distribution. The dataset is produced by the World Bank's LAC Equity Lab.
Dataset_genrefilm is a dataset hosted on Kaggle. The title suggests it contains information related to movie genres. Metadata is minimal; actual content requires verification after download.
Political news dataset published on Kaggle. The dataset likely contains text articles or headlines related to political events. Specific details such as the number of rows, source, and time period are unknown.
10,000 movie records sourced from IMDb, a major online film database. The dataset is hosted on Kaggle, a popular platform for data science competitions and projects. Its specific contents and creation date are not detailed in the available metadata.
Fake-News is a dataset hosted on Kaggle. The dataset likely contains text articles or social media posts labeled as real or fake news. Metadata is minimal; actual content requires verification after download.
Kaggle hosts a dataset titled 'Movies_like_all'. The dataset likely contains information related to films and their attributes for generating recommendations. Metadata is minimal; actual content requires verification after download.
This research collection by Jorge Leal da Silva analyzes neo-Pentecostal gospel lyrics, videos, and official documents from 2011 to 2018 across Brazil, New York, Jerusalem, and India. It applies Critical Discourse Studies to compare Indian behavioral change programs like Mission LiFE with Latin American consumption patterns through the lens of musical cultural evidence.
SoloHI Level 1 data provides decompressed, uncalibrated image data from the Solar Orbiter Heliospheric Imager. The images are rectified so the right side corresponds to the spacecraft's sunward (+X) direction, with data units expressed in DN. The dataset is maintained by the National Aeronautics and Space Administration and was last updated on March 13, 2026.
SoloHI data cube files contain time-lapsed movies of solar imagery in IDL/SolarSoft MVI formats. The files are structured with a header, image headers, and byte-scaled image arrays, with all images sharing the same dimensions. The dataset is provided by the National Aeronautics and Space Administration.
PubTables-v2 is a dataset for full-page and multi-page table extraction, created by kensho and released in February 2026. It is described as large-scale and comes in 3 collections, each containing tables in a specific context.
Kaggle hosts a collection of 50,000 movie reviews intended for binary sentiment classification. The dataset is balanced, suggesting an equal number of positive and negative labels. The original author and specific collection details are unknown.
A 30-day record of water level measurements from a temporary pressure transducer deployed at Beaver Lake and the Stillwell Hills in the Australian Antarctic Territory. The data was collected by the Australian Antarctic Data Centre (AU_AADC) between December 1996 and January 1997. From these measurements, a value for Mean Sea Level (MSL) was derived for the location.
Data from the 1987 Kuroshio Current Study includes station data, XBT, STD, GEK, Acoustic Profiling Current Meter, and DBT measurements. The Japan Oceanographic Data Center submitted the data, which covers the East China Sea and other areas from January to November 1987. The dataset comprises 37 distinct reference numbers (L00797 to L00833), each representing different institutions, ships, and cruise dates.
Review checkpoints likely contain text data for analysis. Published on Kaggle, this dataset's specific content and scope require verification after download. The author and organization details are unknown.
June 1957 to December 1958 data provides Southern Hemisphere sea-level pressure and 500 millibar height measurements on a 5-degree latitude/longitude grid from 15S to the South Pole. The dataset was created by SCIOPS from original South African data, which was interpolated to fill missing grid points. It captures atmospheric conditions during the International Geophysical Year.