Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,995 datasets
The sixth extended mission of Mars Express provides spectral density measurements of the Martian ionosphere. The dataset consists of calibrated spectral information, including a transmit frequency followed by a time series of received power measurements. Browse products include spectrogram overview plots and individual ionograms for each sounding activity.
Over 2,000 Brownfield sites across London, equivalent to more than 2% of the land in Greater London. The database, maintained by the Homes and Communities Agency, provides polygon boundaries for sites down to 0.1 hectares and includes overlays for transport, deprivation, and environmental assets. It was last updated on 2026-03-25.
Eyak place names cover a vast territory along Alaska's North Gulf Coast from Prince William Sound to beyond Yakutat Bay. This prototype digital atlas draws from archival information preserved at the Alaska Native Language Archive, with names adapted into modern orthography and reviewed with linguist Michael Krauss. The dataset reflects the multicultural nature of the region, overlapping with places also named in Chugach and Tlingit.
Kaggle hosts a dataset titled 'review-chekpoints--2026 06 02--13272-13272'. The dataset likely contains text data related to reviews, possibly with checkpoints or versioning indicated by the title. Its author, size, and specific content are not detailed in the provided metadata.
Cross-Domain Review Dataset is an aspect-level sentiment dataset. The description indicates it includes attributes for retrieval and explainability. The dataset's author, organization, size, and last update date are unknown.
Customer_E-Commerce_Reviews is a text dataset published on Kaggle. Its platform tags indicate a focus on e-commerce and sentiment analysis. The dataset likely contains customer feedback text, though its specific content and scale require verification after download.
A dataset titled Movie_Database, published on Kaggle. Its specific contents, such as movie titles, genres, ratings, or cast information, are not detailed in the provided metadata. The dataset's size, structure, and temporal coverage are unknown.
2014 television program logs submitted by Canadian broadcasting undertakings. The data is provided by the Canadian Radio-television and Telecommunications Commission (CRTC). It includes program logs for that specific year, though the exact volume of records is unspecified.
Alaska is covered by approximately 70,000 frames of high-altitude aerial photography. The collection includes 1:60,000 color infrared and 1:120,000 black and white images, acquired by the AHAP program. The data is held by the GeoData Center and was captured between 1978 and 1986.
Reddit text data intended for bipolar disorder research, sentiment analysis, and NLP tasks. The dataset was sourced from Kaggle, but its author, organization, and creation date are unknown. The number of posts, file formats, and license details are also unspecified.
Reddit-style post engagement data suitable for data analysis and exploratory data analysis (EDA) practice. The dataset is hosted on Kaggle, but its author, organization, and specific temporal coverage are unknown. Its size, row count, and file formats are also unspecified.
IMDb movie reviews likely contain user-generated text for sentiment classification. The dataset is published on Kaggle, a platform for data science projects. Specific details like the number of reviews, collection date, and original author are unknown.
A dataset for building recommendation systems, likely using Alternating Least Squares (ALS) and Item Response Theory (IRT) methods. It is hosted on Kaggle, but the specific source, size, and creation date are unknown. The dataset's content and structure must be verified after download.
Genetreta trilix, a Cambrian phosphatic brachiopod, is the subject of this three-dimensional musculoskeletal animation. The video illustrates simulated contraction and relaxation of the main muscle groups during valve opening, highlighting the geometry and spatial arrangement of the musculature relative to the shell. This model underpins analysis of microstructural biomechanics related to brachiopod diversification.
Extracted data from published studies enables effect size calculations for graded motor imagery interventions. This dataset contains sample sizes, outcome measures, and statistical values focused on orthopedic conditions. All source studies are cited in an associated systematic review and meta-analysis manuscript.
An interactive Excel dashboard for analyzing movie trends. The dataset likely contains tabular data on movies, enabling visual exploration of patterns. It was sourced from Kaggle, but specific details about its creation and update history are unknown.
Trending movie data published on Kaggle. The dataset likely contains information about films that are currently popular or gaining attention. Specific details such as the number of records, included features, and time period covered are not provided in the available metadata.
viet_culture is a dataset hosted on Kaggle. The dataset's title suggests it contains information related to Vietnamese cultural topics. No further metadata on its size, source, or specific content is available.
Snapshot, point-probe, and time-average data from a high-fidelity computational simulation of turbulent air flow over a low pressure turbine blade. The simulation was performed using the open-source PyFR flow solver on over 5000 Nvidia K20X GPUs of the Titan supercomputer at Oak Ridge National Laboratory. The data can be used to understand complex three-dimensional unsteady air flow patterns in jet engines.
Golden Raspberry Awards data tracks the annual recognition of the worst in film. The dataset likely contains nominees and winners for categories like Worst Picture and Worst Actor. It is published on Kaggle, but its specific temporal coverage and completeness are unknown.