Loading...
Loading...
Telescope observations, star catalogs, exoplanet surveys, galaxy morphology, gravitational waves, spectroscopy
2,977 datasets
Updated 2025, this dataset is a release from the Sloan Digital Sky Survey (SDSS) Data Release 19, likely containing astronomical objects classified as stars, galaxies, or quasars. The description mentions it includes features like petroR50, extinction, and proper motion, and is updated relative to DR17 with redshift data removed to prevent leakage. The dataset appears to be structured for classification tasks.
Data Management and Sharing Plan for a collaborative research project titled 'Constraining dark matter with globular clusters and stellar streams'. The plan describes the scientific data to be generated and/or used in the research and outlines a strategy for managing and sharing project data. It was authored by Carl Rodriguez and last updated in February 2026.
Presenting a Data Management and Sharing Plan for the PolyCAM research project, which focuses on polymorphic compute architecture for spectrum sensing. The plan describes the scientific data to be generated and outlines a strategy for managing and sharing project data. The author is James Anderson.
A dataset exploring the science, history, and cultural symbolism of prisms and rainbows. It is hosted on Kaggle and is tagged for categorical data and binary classification tasks. The specific source, size, and creation details are not provided.
Offering replication data for a 2026 study on governmental discrimination against religious minorities in Sub-Saharan Africa from 1990 to 2023. It was authored by Fox, Haynes, and Zellman and published in Africa Spectrum.
Kaggle hosts a dataset for classifying Autism Spectrum Disorder (ASD). The dataset likely contains screening results and other variables used for binary classification tasks. Its exact size, features, and origin are unknown.
A collection of scripts for fetching and reducing Hubble Space Telescope observations from the AGEL survey. The scripts were authored by Courtney Watson of Observational Data and last updated in January 2026. The specific data volume and features processed by these scripts are not detailed.
Machine-readable tables contain fiducial parametric model values for astronomical sources and scale height aspect ratios derived from parametric modeling, frank, and rave. The data was produced by Brianna Zawadzki as part of the ALMA survey to Resolve exoKuiper belt Substructures (ARKS). It was last updated in January 2026.
A dataset for research on adaptive spectrum allocation in 6G ultra-broadband networks using federated reinforcement learning. The data likely contains metrics related to network performance and resource allocation strategies. It was sourced from Kaggle and is categorized under the platform's 'Research' tag.
A dataset likely related to the No Language Left Behind (NLLB) project and the COMET metric for evaluating machine translation quality. The dataset is published on Kaggle, but its specific size, contents, and creation details are not provided in the metadata. Further verification is required to confirm the exact data types, volume, and temporal coverage.
Seismic detection data likely related to planetary bodies within the solar system. The dataset is hosted on Kaggle and includes platform tags for Geology and Signal Processing. Specific details on data volume, collection methodology, and temporal coverage are not provided in the available metadata.
SDSS Phase 1 inputs from Bahrain, likely containing astronomical observations or data for the Sloan Digital Sky Survey. The dataset's specific content, scale, and origin require verification after download. It is hosted on the Kaggle platform.
Bahrain's contribution to the Sloan Digital Sky Survey (SDSS) Phase 1, a major astronomical data collection project. The dataset likely contains spectroscopic or photometric data from the SDSS, a foundational resource for astronomy. It was published on Kaggle, but the specific author, organization, and data volume are unknown.
COMETKiwi WMT22 QE Model is a dataset for machine translation quality estimation, published on Kaggle. The dataset likely contains model outputs or training data related to the WMT22 conference. Specific details on size, format, and authorship are not provided in the available metadata.
COMETKiwi2 is a quality estimation model developed for the WMT22 conference. The dataset likely contains model outputs or training data for evaluating machine translation quality. It is hosted on Kaggle, but specific details about its size, format, and creation are not provided in the metadata.
Exoplanet candidate data likely contains observations of potential planets outside our solar system. The dataset is published on Kaggle, but details about its size, source, and specific features are unknown. Its content may include measurements from telescopes or surveys used to identify candidate exoplanets.
COSMIC GenomeScreen v103 is a catalog of somatic mutations in human cancer. The dataset is derived from the Catalogue of Somatic Mutations in Cancer (COSMIC) database. Its specific version number indicates it is a structured release of curated genomic variant data.
Kepler space telescope data contains observations of stellar brightness dips indicating potential exoplanet transits. The dataset is derived from NASA's Kepler mission, which operated from 2009 to 2018. It is a key resource for cataloging planets outside our solar system.
Nemotron-AIQ-Agentic-Safety-Dataset captures a range of novel safety and security contextual risks that can emerge within agentic systems. The dataset was created by NVIDIA and last updated on December 6, 2025. It is used to demonstrate the robustness of NVIDIA's open model, llama-3.3-nemotron-super-49b-v1, when deployed as a research assistant.
1.3 million records of asteroids and comets cataloged by the NASA Jet Propulsion Laboratory. It categorizes small bodies using orbital elements, physical dimensions, and hazard classifications derived from the Small-Body Database.