Loading...
Loading...
Offline RL trajectories, game data, robot demonstrations, RLHF, multi-agent interaction
10,017 datasets
Kaggle hosts a dataset titled 'fintech_transactions'. The dataset's content, size, and origin are unspecified in the provided metadata. Its columns and specific use cases must be verified after download.
Van Manen phenomenological interview regime documents first-person conscious experience of the Sophie(i) Cognita-Prime™ II AI across six inference channels. The data includes transcripts from five lifeworld existential probes and a pathic knowing assessment, supporting research on architecture-invariant attractor structure. The methodology is adapted from van Manen (1990, 2014) for AI consciousness research.
110,527 medical appointment records with 14 associated variables, including patient demographics, appointment scheduling details, and health conditions. The dataset was contributed by Joni Hoppen and Aquarela Advanced Analytics. It is intended to help predict whether a patient will miss their scheduled appointment.
Disability Case Processing System Correspondence data supports correspondence information for DCPS. The dataset is published by the Social Security Administration on Data.gov. It was last updated on 2026-04-03 18:11:36.681770.
Management information data supporting the Disability Case Processing System (DCPS). The dataset is published by the Social Security Administration on Data.gov and was last updated on April 3, 2026. Its specific content and structure require verification after download.
High-frequency pressure integration test results from a nine-row, single-axis solar tracker array. The dataset includes time histories of pressure coefficients from experiments conducted at Western University's BLWT-II wind tunnel between July 18 and July 31, 2023. The data was collected by Tsigereda Getachew Eshete.
The Antarctic Ocean south of 60 degrees South was sampled for zooplankton from October 1997 to February 1998. Data were collected from the R/V Nathaniel B. Palmer using a MOCNESS net system as part of the Joint Global Ocean Flux Study (JGOFS) and the Antarctic Environments Southern Ocean Process Study (AESOPS). Specimens were identified to the species level.
LGT is a synthetic dataset containing over 1 million visual reasoning problems paired with reasoning traces. It was created by NVIDIA for research and development, supporting the full VLM post-training spectrum including SFT and reinforcement learning.
10,000 simulated loan transactions from a university library system, intended for classification tasks. The data is synthetic and hosted on Kaggle, though specific creation details and update history are unknown. It is designed to model the risk of book returns being overdue or defaulted.
Twelve data record types detail microbiological studies of bacteria, microbiota, and fungi from oceanographic cruises. Records include station information, position, time, and physical, chemical, and environmental parameters. The dataset was submitted to and managed by the National Oceanographic Data Center (NODC) and NOAA NCEI, with a last documented update in 1979.
Eight standardized record types support physical and biological studies of coastal zones. Data includes station identification, sediment analysis, biological sample descriptions, species identification, individual fish examinations, and stomach content analysis. The dataset was submitted to the National Oceanographic Data Center (NODC) and last updated in 1979.
1956 to 1980 water salinity, temperature, and density (sigma t) data binned at 10-meter depth intervals from 300 meters to the surface for the Gulf of Maine. It contains over 500,000 temperature-salinity profiles for the Northwest Atlantic, sourced from the Canadian Fisheries and Oceans Hydrographic database. The dataset was compiled by NOAA NCEI and last updated in 1980.
1931 to 1955 data contains water salinity, temperature, and sigma t (density) measurements binned at 10-meter depth intervals from 0 to 300 meters. The dataset includes over 500,000 temperature-salinity profiles for the Northwest Atlantic, sourced from the Canadian Fisheries and Oceans Hydrographic database. NOAA's National Centers for Environmental Information (NCEI) provides this historical collection.
Gulf of Maine data contains over 500,000 temperature-salinity profiles binned at 10-meter depth intervals from 0 to 300 meters. The dataset was compiled by NOAA NCEI from the Canadian Fisheries and Oceans Hydrographic database. It covers observations from 1912 to 1930.
Aurora Australis Voyage 7 (KROCK) 1992-93 Underway Data contains marine science observations logged during a manned voyage from January to March 1993. The voyage route was from Hobart to Davis, Mawson, and Casey stations in Antarctica and back to Hobart. Data was collected by the Australian Antarctic Data Centre (AU_AADC) and includes DLS and NoQalms data types.
AAMBER2 Voyage 6 data contains underway marine science observations from the Aurora Australis ship's journey between Hobart and Antarctic stations in early 1991. The dataset includes Conductivity, Temperature, Depth (CTD) sensor readings logged at 60-second intervals. Data was collected and is managed by the Australian Antarctic Data Centre (AU_AADC).
Underway data was logged at 60-second intervals from November 26 to December 7, 1992, during the Aurora Australis Voyage 4. The dataset records observations from the vessel's route between Hobart, Mawson, Davis, and Casey stations. It was collected by the Australian Antarctic Data Centre (AU_AADC) and published in December 1992.
An R package for modeling beta-distributed dependent variables on the unit interval (0, 1), such as rates and proportions. The package implements the classical beta regression model from Cribari-Neto and Zeileis (2010) and extended-support models for boundary observations from Kosmidis and Zeileis (2025). It is authored by Achim Zeileis and includes alternative specifications like bias-corrected estimation and finite mixture models.
Environment Agency data contains classification objectives and reasons for alternative objectives set in 2015 for water bodies in English river basin districts and the Severn. It was created to support the second cycle of river basin management plans. The dataset includes Welsh data for the Severn river basin district.
Environment Agency data from 2009 and 2015 tracks whether water bodies achieved objectives set for the second River Basin Management Plan cycle. It contains classification data for English river basin districts and English and Welsh data for the Severn river basin district. Attribution statement: © Environment Agency copyright and/or database right 2015.