Loading...
Loading...
Particle physics, nuclear physics, condensed matter, plasma physics, optics, acoustics, quantum mechanics
6,342 datasets
Theprint created a dataset of 80,000 entries named after Python creator Guido van Rossum. It was assembled using DataMix to combine several highly-rated Python-centric datasets, providing a sampling from each. The dataset was last updated on Hugging Face in November 2024.
1,587,709 samples of chemical reactions, with a median of 7 molecules per sample. The dataset was created by IDEA-AI4S and was last updated on Hugging Face in October 2024. It is derived from USPTO patent data, likely containing interleaved representations of reaction sequences.
WFS) to the BPlan 017-Osterscheps (origin) of the municipality of Edewecht is a land use plan dataset provided by the Bundesamt für Kartographie und Geodäsie. It is formatted in XPlanGML Version 5.1.2, a standard for exchanging spatial planning information in Germany. The dataset was last updated on November 7, 2024.
A Web Map Service (WMS) providing the original land-use plan BPlan 017-Osterscheps for the municipality of Edewecht. The data is formatted according to the INSPIRE PLU Version 4.0.1 standard and is published by the Bundesamt für Kartographie und Geodäsie. The dataset was last updated on November 7, 2024.
September 2024 data from the PACE-PAX airborne campaign over Southern and Central California. The Laser Imaging Nephelometer (LI Neph) was flown on the CIRPAS Twin Otter aircraft to collect in-situ atmospheric measurements. This dataset was produced by LARC_CLOUD for validating and refining data products from the PACE satellite mission.
Licensee Event Reports (LERs) document operational incidents at U.S. nuclear power plants, collected by the Nuclear Regulatory Commission. The reports cover events like transients and plant trips, searchable by plant name, event dates, characteristics, and keywords.
Nuclear Regulatory Commission data lists significant enforcement actions, known as 'escalated' actions, issued to licensees, individuals, and non-licensees for non-compliance. The dataset includes tags for action types such as Civil Penalty, Confirmatory Order, and Severity Level, and covers violations related to Reactor, Materials, and Fuel Cycle Facilities.
A lookup file mapping Lower layer Super Output Areas from December 2011 to their December 2021 counterparts and Local Authority Districts as of December 2022 in England and Wales. It includes a 'change indicator' field categorizing the relationship between 2011 and 2021 LSOAs as unchanged, split, merged, or complex. The dataset was last updated on August 8, 2024.
Map layer images of Sargassum density sourced from the University of South Florida Optical Oceanography Laboratory. The data is reprojected for web mapping and maintained by NOAA, with a last update in October 2024. Specific row counts and column features are not detailed in the provided metadata.
Homicide Alpaca is an Alpaca-styled export of the homicide-investigation dataset, created by 'theprint' and last updated on August 23, 2024. It contains 11,103 entries of question-answer pairs, with an average question length of 91 characters and an average answer length of 775 characters. The dataset includes confidence scores for the entries, with a top score of 0.74 and a median of 0.46.
CleverBoi aggregates several datasets focused on logic, inference, empathy, math, and coding into a unified Alpaca instruction format. The collection was created by user 'theprint' and last updated on August 26, 2024. Its constituent datasets include LogicInference_OA, Evol-Instruct-Python-26k, Open-Platypus, and python_code_instructions_18k_alpaca.
Ongoing land consolidation procedures in the German state of Rhineland-Palatinate, published by the Bundesamt für Kartographie und Geodäsie. The data is served via WMS and was last updated on June 30, 2024. Publications do not provide grounds for appeal, which are based on notices in official municipal publicity bodies.
A dataset of material samples digitized at two scales using a hemispherical light dome. The data includes raw captures and optimized parameters for geometric properties, anisotropic reflectance, and transmittance, released by author Elena Garces in May 2024. The digitization process leverages polarized directional lighting and neural networks to propagate microscopic details to a mesoscopic representation.
MATLAB code and micrographs for quantitative light optical microscopy analysis developed by Pilar Fernandez-Pison et al. (2021). The dataset supports research on the flow and fracture of austenitic stainless steels at cryogenic temperatures, specifically for AISI 304L tested at 4 K to a maximum strain of 16%. The work involved institutions in Spain (University Carlos III of Madrid) and Switzerland (CERN).
A 2021 dataset containing MATLAB code and supporting test data for determining elasto-plastic fracture toughness. It implements four JR-curve construction approaches described in the associated research paper. The data was authored by Pilar Fernández-Pisón and originates from institutions in Spain and Switzerland.
550 complete broadcast soccer games from major European leagues form the core of SoccerNet-V3, a large-scale dataset for video understanding tasks. It has evolved to support challenges in action spotting, camera calibration, and player re-identification. The dataset is maintained by Voxel51 and was last updated in May 2024.
Raw simulation data of the near plume of an SPT-100 Hall Effect thruster, obtained using the 3D code "EP2PLUS". The data was generated by author Cichocki, Filippo and was last updated on May 5, 2024. It uses a hybrid particle-in-cell/fluid formulation to assess 3D effects of neutralizer position on plasma properties.
Raw simulation data from the 3D code EP2PLUS, time-averaged over 0.2 ms periods at a final simulation time of 1.6 ms. The data models the near plume of an SPT 100 Hall Effect thruster with a central neutralizer to assess 3D effects on plasma properties. It was authored by Filippo Cichocki and last updated in May 2024.
A 3D simulation of the near plume of an SPT-100 Hall Effect thruster, created by Cichocki, Filippo and harvested on 2024-05 05. The data was generated using the EP2PLUS code with a hybrid particle-in-cell/fluid formulation to assess the 3D effects of neutralizer position. Results are time-averaged over the final simulation period of 1.6 ms.
AERONET data from the Juan Carlos I base on Livingston Island provides columnar aerosol optical properties, water vapor, and microphysical parameters via sun-lunar photometry. The dataset includes aerosol optical depth at multiple wavelengths, Ångström exponent, single scattering albedo, and volume size distribution. It is part of a NASA and PHOTONS network project, with data from 2024 available at Level 1.0, 1.5, and quality-assured Level 2.0.