Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,998 datasets
SCIOPS collected water temperature and pressure data at 2-minute intervals throughout the year 2000 at a depth of 18 meters in McMurdo Sound, Antarctica, using a moored SB-39 Seabird temperature logger. The dataset captures daily and yearly variations, with temperature precision reported as plus or minus about 0.005°C. Pressure data is included to allow for the calculation of tidal excursions over the same period.
World Development Indicators provide data on the composition of service exports for countries, expressed as a percentage of total service exports. The data covers categories including communications, computer and information services, royalties, and other business services. The World Bank compiles this data from international trade and balance of payments statistics.
World Development Indicators data measures the share of communications, computer, and information services in total service imports for various countries. The indicator is expressed as a percentage of total service imports, covering transactions between residents and nonresidents. It is compiled by the World Bank from officially recognized international sources.
NOAA NCEI Accession 9400159 contains depth, temperature, salinity, and oxygen data collected for the World Ocean Circulation Experiment. The Chinese National Oceanographic Data Center provided the data, which was gathered from three research vessels between November 1992 and December 1993. Data submission was on two tapes and four diskettes.
TMDB movies dataset likely contains ratings, release dates, and overviews for top-rated films. The dataset originates from Kaggle, but its author, size, and last update date are unknown. Its description suggests it aggregates information from The Movie Database.
Kaggle hosts a dataset related to movies. The specific content, scale, and origin are not detailed in the provided metadata. Actual data characteristics require verification after download.
1.7 million rows of cleaned Indonesian user reviews for the Grab super-app, sourced from the Google Play Store. The dataset likely contains textual feedback and associated metadata from a major Southeast Asian market. Its origin and specific collection timeframe are not detailed in the provided metadata.
4.5 million cleaned user reviews for the DANA Wallet mobile application, sourced from the Google Play Store. The dataset is focused on Indonesian users and appears to have been collected for analysis. The original author, organization, and specific collection date are not provided.
Tamil text classification dataset for sentiment analysis and fake news detection. The dataset is hosted on Kaggle, but the author, organization, and creation details are unspecified. The number of rows, file formats, and license information are unknown.
Indonesian user reviews for the Mobile JKN application sourced from the Google Play Store. The dataset likely contains textual feedback and ratings from users of the Indonesian national health service app. The author, organization, and specific collection date are unknown.
Cleaned Indonesian user reviews for the Binance App from Google Play Store. The dataset is hosted on Kaggle, but the author, organization, and specific collection details are unknown.
Cleaned Indonesian user reviews for the Facebook application sourced from the Google Play Store. The dataset was uploaded to Kaggle, but the author, organization, and specific collection details are not provided. The number of reviews, date of collection, and other metadata are unknown.
English_Movie is a dataset hosted on Kaggle. The dataset's title suggests it contains information related to English-language films. Metadata is minimal; the specific content, scale, and origin require verification after download.
Kaggle hosts a dataset titled 'Movies_prediction'. The dataset likely contains features for predicting movie-related outcomes, such as box office performance or ratings. Its specific contents, size, and origin are not detailed in the available metadata.
MovieDetails is a dataset hosted on Kaggle. The dataset's specific contents, scale, and origin are not detailed in the provided metadata. Further inspection after download is required to confirm its structure and coverage.
2022 Changelog News Transcripts is a collection of complete transcripts from the 2022 episodes of the Changelog News podcast. The dataset was created by willtheorangeguy and is sourced from a GitHub repository. It was last updated on the platform in April 2026.
A dataset of 1,432 survey records from a study on life conditions in rural zones, published by Frankcc on Kaggle. It contains 23 columns including demographic, family, asset, income, and expense variables. The target variable is a binary indicator for depression status.
Harmonic Frontier Audio created a high-fidelity human vocal dataset designed for AI training and speech research. The preview version, v0.95, provides a compact reference set demonstrating the quality and formatting conventions of their Human Vocality Primitives series. The dataset was last updated on March 6, 2026.
Heilbronn, Germany's urban development plan for the 'Cäcilienwiese' children's playground and sports field. The dataset is provided as a WFS service transformed according to the INSPIRE directive, based on an XPlanung dataset in version 5.4. It is managed by the Bundesamt für Kartographie und Geodäsie and was last updated on March 12, 2026.
Schöningen, Germany, provides geospatial data for a specific development zone. The dataset describes a special area designated for accommodation and sports, formatted in XPlanGML Version 5.4. It is served via a Web Feature Service (WFS) and was last updated by the Bundesamt für Kartographie und Geodäsie on March 12, 2026.