Loading...
Loading...
Telescope observations, star catalogs, exoplanet surveys, galaxy morphology, gravitational waves, spectroscopy
2,977 datasets
Approximately 8.5 million 512x512 pixel JPEG cutouts of galaxies centered on their source, sourced from the DESI Legacy Survey Data Release 8. The dataset, created by Smith42, includes a 98% training, 1% validation, and 1% test split and was last updated in September 2025. It also contains accompanying metadata with galaxy properties.
The ACCFELE project addresses a scarcity of applied studies and research materials in Spanish as a foreign language phonetics. This dataset contains a battery of phonetic errors made by learners of Spanish as a foreign language from nine different native languages, categorized using the ACCFELE taxonomy. The dataset was authored by Ana Blanco Canales and last updated on October 14, 2025.
NASA's International Halley Watch Comet Halley Archive contains infrared image data submitted by scientists. The Infrared Imaging subnetwork includes 66 images of Halley and 29 images of calibration stars. Observations span from March 11, 1985, to May 23, 1986.
Replication data and programs reproduce Tables 1-4 and Figures 3&4 from the paper "Welfare Enhancing Time Consistent Environmental Policies in fixed–numbers and free–entry Oligopolies". The dataset was authored by Petrakis, Basak and Xepapadeas and harvested into Borealis Dataverse. It was last updated on October 13, 2025.
Joe Skeens created this dataset for the Journal of Geodesy paper titled 'A unified model of feed rotation in radio telescopes and GNSS antennas.' The dataset was last updated on October 15, 2025, and is hosted by the Texas Data Repository via the Dataverse platform. It likely contains scripts and data supporting the unified mathematical model described in the paper.
IRSA curates calibrated science products from NASA's infrared and sub-millimeter missions, including five major large-area surveys. Its datasets are cited in about 10% of astronomical refereed papers. The archive provides access to data from missions like Spitzer, 2MASS, IRAS, and COSMOS.
NASA's WISE (now NEOWISE) mission data provides discovery statistics for near-Earth asteroids (NEAs), potentially hazardous asteroids (PHAs), and comets. The dataset includes daily updated counts and individual object records with orbital parameters.
Raw spectrum data files authored by Michael Lanzillotti and hosted by the Texas Data Repository. The dataset was last updated on October 15, 2025. The specific content and scale of the data require verification after download.
Thermo Orbitrap .raw spectrum files for top-down proteomics analysis. The dataset was contributed by Michael Lanzillotti and last updated on October 15, 2025. Its exact size and internal structure are not detailed in the available metadata.
Radio Science data from the Rosetta orbiter's PRELANDING phase, collected between 2014-01-21 and 2014-11-18. It provides Global Gravity measurements for comet 67P/Churyumov-Gerasimenko, specifically covering a 10.6-hour period on 2014-09-27. The dataset was produced by NASA as part of the International Rosetta Mission.
KAGUYA LUNAR SP DERIVED SPECTRA V1.0 is a survey of spectra from small, fresh lunar craters observed by the JAXA Kaguya Spectral Profiler. It provides crater latitude, longitude, FeO and OMAT values, and estimated mineralogical composition for each spectral observation. The dataset is produced by the National Aeronautics and Space Administration and was last updated in August 2025.
Numerically-generated gravitational waveforms for binary black hole systems. The catalog is provided by the National Aeronautics and Space Administration and was last updated in September 2025.
A collection of a cleaned collection of tool-calling and reasoning conversations derived from the hermes_reasoning_tool_use source. It is structured specifically for Axolotl fine-tuning using a chat-based format that organizes interactions into system, user, and assistant roles.
A collection of five datasets converted and tokenized for the Qwen3 large language model. The datasets focus on sequential tool use, single-step reasoning, multi-turn reasoning, and function calling with chain-of-thought. The collection was prepared by author jtl11 and was last updated in August 2025.
Fireball and bolide reports from U.S. Government sensors document exceptionally bright meteors. The data includes event date, time, geographic location, altitude, velocity, and calculated total impact energy. The dataset is maintained by the National Aeronautics and Space Administration and was last updated in May 2025.
Federal Communications Commission data provides daily transfers of ULS 3650 locations with submitted grandfathered wireless protection zone information. The dataset is maintained by the FCC and was last updated on July 9, 2025. Specific row and column counts are not provided in the input.
100,000 observations of celestial objects from the Sloan Digital Sky Survey (SDSS) Data Release 17. The dataset, authored by Allanatrix and last updated in June 2025, contains spectral characteristics for classifying stars, galaxies, and quasars.
Galaxy Zoo volunteer labels for telescope images of galaxies, curated by Mike Walmsley. The dataset includes images and labels for visible features like spiral arms and galaxy-galaxy collisions, intended for training foundation models.
Gelbooru 20250526 Add is a temporary internal dataset created by author NebulaeWis, last updated on 2025-05-27. The description suggests it contains images for re-processing, correction, or supplementing internal systems. Its specific size, structure, and license are not detailed in the provided metadata.
Eyes on the Solar System is a 3D environment developed by NASA JPL and Caltech using data from NASA's Navigation and Ancillary Information Facility. It provides real-time visualization of the solar system and NASA mission trajectories. The tool requires a one-time software download and installation to access its interactive modules.