Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
11,001 datasets
A 1998 map layer portrays the shallowest principal aquifers of the conterminous United States, Hawaii, Puerto Rico, and the U.S. Virgin Islands as polygons. It was developed as part of the effort to produce the printed "Ground Water Atlas of the United States" series at a 1:2,500,000 scale. This dataset is a replacement for a previous July 1998 map layer.
Visible and Infrared satellite imagery from NOAA's NIMBUS, TIROS, ITOS, ESSA, and NOAA satellite campaigns, spanning August 1964 to March 1985. The collection includes quadrilateral-shaped images and composite maps in Mercator, Polar Stereographic, and Conic projections. Image files were created by scanning hard-copy prints, negatives, and microfilm from the NOAA archives.
Nitrogen dioxide volume mixing ratios are provided at 8 pressure levels between 24 and 40 km altitude. Data coverage spans from pole to pole at 5-degree increments and includes both orbit-track and monthly-averaged records. The dataset was produced by SCIOPS from the Solar Mesosphere Explorer satellite's visible spectrometer measurements between 1982 and 1986.
NOAA/WDS Paleoclimatology archives a tree-ring width dataset from Big Cypress State Park, Louisiana. The data provides a paleoclimate record covering the period from 750 to 37 calendar years before present. It was published by the NOAA National Centers for Environmental Information in 1987.
NOAA/WDS Paleoclimatology archives fire history data derived from tree ring analysis in New Mexico. The dataset covers a time period from 638 to -43 calendar years before present (BP). It is maintained by the NOAA National Centers for Environmental Information (NCEI) and was last updated in 1993.
NEMS/Nimbus-5 Level 2 Output Data contains surface reflectivity, water vapor, liquid water, layer thickness, temperature at standard pressure levels, surface brightness temperature, and surface type information. The dataset includes input antenna and brightness temperatures at five microwave channels (22.235, 31.4, 53.65, 54.9, and 58.8 GHz). Principal investigator David H. Staelin from MIT led the experiment, which collected data from December 17, 1972, to October 31, 1973.
Synthetic Aperture Radar data from the JERS-1 satellite launched by Japan's National Space Development Agency on February 11, 1992. The dataset consists of Standard Geocoded Images with a spatial resolution of 12.5 meters, processed with radiometric and geometric corrections. Data production concluded in October 1998.
IAI-Science-ISP-3-041 is a final report evaluating the feasibility of estimating El Niño-Southern Oscillation (ENSO) effects on sugar cane yields. The project used mechanistic-model simulations within a Geographic Information System (GIS) for experimental zones in several Latin American countries. The work was conducted by Principal Investigator Angel Utset and the organization SCIOPS, with a last update recorded in December 1999.
A 1899-1939 dataset of extratropical cyclone positions digitized from historical weather maps. It contains 12-hourly positions with direction and speed of movement for cyclones in the Northern Hemisphere. The data was archived by the National Climatic Data Center (NCDC) and is provided by NOAA NCEI.
Australian Antarctic Division archives contain logbooks detailing ice compression experiments conducted in 1977 and 1979-1982. The data consists of setup methods and results from tests using five 'Box Hill' rigs at controlled temperatures of -10°C and -30°C. The 1977 series ended prematurely due to cold room failure.
WBAN-31 Adiabatic Charts contain thousands of historical weather observations recorded by the Weather Bureau, Army, and Navy. The collection spans from the 1930s to the mid-1990s, covering stations worldwide. Parameters include pressure, height, temperature, dew point depression, wind direction, and wind speed.
Kaggle hosts a dataset of discrete tone images with distortions intended for compression research. The dataset's author, organization, and specific size are unknown. The last update date is also unknown.
The Books Price & Rating Dataset contains information on books scraped from the web using Python. The dataset likely contains price and rating data for various books. The author, organization, and specific details about the data's origin are unknown.
A dataset titled 'Movies' is hosted on the Kaggle platform. The dataset's specific content, size, and origin are not detailed in the available metadata. Further details about the columns, sample data, and license require verification after download.
Published on huggingface by alihxn on 2026-04-24. The dataset likely contains information related to game compression. Its specific content, size, and structure require verification after download.
A UKCCSRC Call 1 project paper presents a validated two-fluid transient flow model for simulating outflow from failed high-pressure CO2 pipelines. The model accounts for thermal and mechanical non-equilibrium effects using constitutive relations for inter-phase transfer and couples fluid dynamics with a transient heat conduction model. The work was conducted by the British Geological Survey and validated with experimental data from UK/EC funded projects.
IMDB top 100 movies is a dataset listing films ranked by the Internet Movie Database. It likely contains ratings, genres, and other details for each entry. The dataset's author, organization, and last update date are unknown.
AlaminI created a bilingual Hausa-English STEM reasoning dataset containing 2,640 high-quality question-answer pairs. The dataset was translated from the STEM-Reasoning-Complex dataset using a culturally-embedded framework called 'Shehin Malamin Kimiyya'. It was last updated on March 7, 2026.
Bangla_Fake_News is a dataset for detecting misinformation in the Bengali language, sourced from Kaggle. The dataset's specific size, collection method, and author are currently unknown. Its content likely contains text samples labeled for veracity, intended for natural language processing tasks.
Aya Nos is a revised and expanded red-teaming dataset for the Iberian linguistic sphere. It provides human-reviewed prompts in Spanish, Galician, and Portuguese, created by proxectonos and last updated in March 2026.