Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,950 datasets
The U.S. Department of Agriculture's Forest Service provides a geospatial layer depicting areas where silviculture reforestation activities were performed. The data originates from the Forest Service Activity Tracking System (FACTS) and is used for performance measures related to Forest Vegetation Improved. The layer was last updated on March 13, 2026.
The U.S. Department of Agriculture's Forest Service data portrays areas where silviculture activities, such as planting, seeding, and site preparation for natural regeneration, were accomplished. This geospatial feature layer is funded through budget allocation and reported via the Forest Service Activity Tracking System (FACTS) within the Natural Resource Manager suite. The data, last updated on March 13, 2026, is used as a performance measure for agency strategic goals.
The dataset portrays areas where U.S. Forest Service silviculture activities, such as thinning and pruning, were accomplished as part of a performance measure program. It is managed by the Department of Agriculture and was last updated on 2026-03 -13. Data is reported through the Forest Service Activity Tracking System (FACTS) within the Natural Resource Manager suite of applications.
Noosa Shire Council provides stormwater infrastructure data for the Noosa region in Queensland, Australia. The dataset includes geospatial layers available in multiple formats, such as GeoJSON and Shapefile, and was last updated in March 2026.
NE Atlantic and other ocean data were collected using SOFAR, RAFOS, and ALACE subsurface floats from October 1984 to June 1993. The Woods Hole Oceanographic Institution gathered this information as part of the World Ocean Circulation Experiment. Measurements include East-West and North-South current components, water temperature, and pressure.
April 5th to 14th, 2006 hydrographic data from 17 full-depth casts along Line W in the Northwest Atlantic. The dataset includes calibrated CTD measurements of pressure, temperature, salinity, and dissolved oxygen, plus water sample analyses for dissolved chlorofluorocarbons (CFCs 11, 12, 113). It was collected by NOAA NCEI during a research cruise on the R/V OCEANUS.
48 current meters deployed on 8 moorings collected ocean current, temperature, and wind data in the equatorial Pacific. This dataset was gathered by a consortium including the University of Southern California and NOAA's Pacific Marine Environmental Laboratory. Measurements span a four-year period from April 1991 to May 1995.
Gulf of Mexico hydrographic data includes water temperature, salinity, and oxygen parameters collected via CTD from the R/V Pelican. The dataset supports calibration of moored current- and pressure-recording inverted echo sounders deployed during two research cruises. Data collection occurred between June 14 and October 3, 2019, and is managed by NOAA NCEI.
Geoscience Australia produced a short video summarizing the value of its work for managing Australia's marine jurisdictions. The video is part of a series of six films communicating the agency's value to the nation. Further information is available via the agency's website.
A dataset from Kaggle explores digital gaming culture and participation among youth. It likely contains records related to screen time and engagement with the game Elden Ring. The author, organization, and specific data scale are unknown.
Top rated movies is a dataset published on Kaggle. The dataset likely contains information about films and their user or critic ratings. Metadata is minimal; actual content requires verification after download.
Trained TCQ codebooks enable KV cache compression at 2-3 bits per parameter, as presented in the paper 'Closing the Gap: Trellis-Coded Quantization for KV Cache at 2-3 Bits'. The dataset was created by authors buun and Claude (Anthropic) from Anthropic and was last updated in April 2026. It includes the quantization codebooks, training scripts, and the associated research paper.
A text dataset titled 'News Reports' uploaded by author 'bobotsalos' to the Hugging Face platform. The dataset was last updated on 2026-05-21 10:48:59. Columns, sample data, and specific content are unknown.
Statistical contrasts between comparison durations, expressed as differences of log-scaled reaction time values. The dataset contains two-sided t-tests with Bonferroni correction, authored by Valeria Centanino and last updated in March 2026.
Statistical contrast estimates from a study of duration preferences across streams, presented as Bonferroni-corrected two-sided t-tests. The data contains difference values expressed on a square-root scale. The dataset is a small 8.5 KB CSV file authored by Valeria Centanino.
A 2026 dataset lists public open spaces in New York City, primarily managed by NYC Parks, including partner sites like Central Park. It provides the percentage and acreage of land designated for active versus passive recreation in each space. The data is maintained by the City of New York for environmental review purposes.
A collection of transcripts from the science fiction novel 'Project Hail Mary' by Andy Weir and its film adaptation. The dataset includes a full transcript and a separate version containing only dialogue between the characters Grace and Rocky. The source platform is Kaggle, but the original author and compilation method are unknown.
A list of movies considered among the best and latest, sourced from the Kaggle platform. The dataset likely contains titles and associated ranking or rating information. Specific details on the number of entries, columns, and compilation methodology are not provided in the available metadata.
News and Cultural Content Delivery Records from a 5G Vehicular Ad-hoc Network (VANET) smart city context. The dataset is hosted on Kaggle. The specific volume, creation date, and authorship details are not provided in the available metadata.
Aplication_Review_Labeled_Dataset is a labeled dataset published on Kaggle. Its title suggests it contains data related to application reviews, likely for classification tasks. The dataset's specific content, size, and authorship are unknown.