Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,956 datasets
BMR Research Newsletter 15 contains titles from Geoscience Australia publications covering petroleum geology topics. The newsletter lists 15 titles related to basin studies, seismic interpretation, and petroleum databases. It was last updated in March 2026.
A 1987 research newsletter from the Bureau of Mineral Resources compiles titles and summaries of 21 geoscience projects and publications. The collection documents Australian geological research on topics including basin tectonics, seismic surveys, and mineral deposits from that year.
Geoscience Australia's report compiles reviewed palynology data from 407 wells and includes new sampling results from 20 wells in the Great Artesian Basin. The study standardizes biostratigraphic zonation schemes and updates stratigraphic charts across the Surat, Eromanga, and Carpentaria basins. It was conducted by MGPalaeo to support a consistent geological framework for groundwater management.
A public dataset lists 500 top Instagram influencers from the Indian state of Uttar Pradesh. The data was posted on Kaggle, but the author, collection method, and last update date are unknown. It includes influencers from all niches.
Melaka Restaurant Review Dataset is a collection of restaurant reviews from the Malaysian state of Melaka, published on the Kaggle platform. The dataset's specific size, columns, and collection methodology are not detailed in the available metadata. Its content likely pertains to customer feedback on dining establishments in a popular tourist destination.
A collection of customer reviews for fashion products sold on Amazon. The dataset is hosted on Kaggle, but its specific size, time range, and author are unknown. Columns and sample data are unavailable, limiting immediate assessment of its content.
movie_genre is a dataset hosted on Kaggle. The title suggests it contains information for classifying or analyzing movie genres. No further metadata, such as column descriptions, size, or authorship, is available.
7.97 million live-chat messages from YouTube broadcasts of the BGMI esports scene, annotated with gold-standard sentiment labels. The data captures audience reactions in Hinglish, a blend of Hindi and English. The dataset's origin, author, and specific collection period are not detailed in the available metadata.
June 2026 is the indicated time period for this collection of highly-rated films. The dataset, sourced from Kaggle, likely contains a list of movies ranked by user or critic ratings. Its specific contents and creator details require verification after download.
An extract of 1 million comments from the pushshift.io archive of all Reddit comments from May 2019. The data includes 25,000 comments from each of the 40 most frequented subreddits for that month, with removed, deleted, and very short comments excluded. It was created by an individual contributor and is shared under a CC0 public domain license.
A dataset of top-rated movies, likely containing information about titles and ratings. It is hosted on Kaggle, but the specific source and creation date are unknown. The dataset's content and structure require verification after download.
NOAA_NCEI Accession 9400167 contains Conductivity, Temperature, Depth (CTD) and barometric pressure data. The dataset includes 4,757 profiles assembled into 77 files by Russ Burgett of the Woods Hole Oceanographic Institution. Measurements were collected over two decades, from October 29, 1972, to July 26, 1992.
Pressure gauge data from fixed platforms in the Chukchi Sea and Gulf of Alaska provides time-series measurements of seawater pressure and temperature. The dataset covers a period from September 17, 1986, to October 1, 1987, and was processed by the National Oceanographic Data Center (NODC) into the standard F017 format. It is published by the National Oceanic and Atmospheric Administration.
HYDROSTATIC PRESSURE, WATER TEMPERATURE and INSTRUMENT - DEPTH, and other data from FIXED PLATFORM OF UNITED STATES provides time series measurements from the North Pacific Ocean. The National Oceanic and Atmospheric Administration processed the data to the NODC standard Pressure Gauge (F017) format. Measurements were collected between January 19 and July 1, 1984.
May 10-15, 1996 data from the R/V Wecoma cruise W9605A off the Oregon coast. The dataset contains conventional CTD casts and SeaSoar sampling from six tows, providing hydrographic measurements along standard lines. It was collected by NOAA and covers a specific coastal region between 42.5°N to 43.4°N and 124.5°W to 125.2°W.
8500049 NCEI accession contains hydrostatic pressure and water temperature measurements from the Cape Henlopen station in the North Atlantic Ocean. The National Oceanic and Atmospheric Administration processed this data into the NODC standard Pressure Gauge (F017) format. Measurements were collected from April 10, 1981, to November 1, 1982.
Uttarakhand Eco-Homestay Guest Reviews 2024 contains guest reviews from eco-homestays across three major cities in the Indian state of Uttarakhand. The dataset was uploaded to Kaggle, but details on the author, collection method, and exact size are unknown. The data appears to be from 2024.
NOAA NCEI Accession 0184338 contains surface underway measurements of partial pressure of carbon dioxide in the atmosphere and water, alongside sea surface salinity, temperature, and barometric pressure. Data were collected by the Antarctic Research and Supply Vessel Laurence M. Gould in the Southern Ocean, Drake Passage, and South Atlantic Ocean from February 2018 to February 2019 using a CO2 gas analyzer and equilibrator.
NASA's Magellan Compressed Thrice Mosaicked Image Data Records (C3-MIDRs) are derived from radar imagery of Venus. The dataset consists of mosaics generated by computing 3x3 pixel arithmetic moving averages from higher-resolution C2-MIDRs, with each mosaic in a sinusoidal equal area projection. Each C1-MIDR source file has 7168 lines and 8192 samples, arranged as 56 1024 x 1024 VICAR2 format files on CD-ROM.
932 blood pressure measurements were collected in a community screening effort in Jordan. The dataset, created by Anas Khaleel, was last updated in March 2026 and is available under a CC BY 4.0 license. It is stored in a small 5.5 KB XLS file.