Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,496 datasets
New York's Integrated Energy Data Resource aggregates statewide energy information from electric, gas, and steam utilities. The platform, funded by NYSERDA, provides analytic tools for querying data types like feeder hosting capacity and customer billing. The dataset was last updated in April 2026.
Geochemical measurements from the Northern Denison Trough, Bowen Basin, Australia, assess sandstone reservoir suitability for CO2 storage. Mean maximum vitrinite reflectance ranges from 0.55% to 0.93%, and Rock-Eval Tmax ranges from 421°C to 447°C. The data was collected by the Australian Ocean Data Network as part of the Queensland Government's ZeroGen project.
Data compiled by the World Bank from the International Energy Agency and the Carbon Dioxide Information Analysis Center. It contains indicators on energy production, use, dependency, and efficiency for Belgium. The dataset was last updated on 2026-04-27.
A 2026 study by Hamit Hakan Bekir compares the effects of human umbilical cord-derived mesenchymal stem cells (HucMSCs) and their exosomes on wound healing in 24 Wistar rats with alkaline-induced skin burns. The dataset includes results showing 78.15% wound closure for HucMSC-treated rats versus 57.35% for controls, with histopathological evaluations over 21 days. Supporting files are in PNG, JPG, and DOCX formats, totaling 4.4 MB.
World Bank Group data on energy production, use, dependency, and efficiency for Burundi, compiled from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset is available in CSV format and was last updated on 2026-04-27. It addresses trends in energy use related to economic growth, living standards, and poverty reduction.
Thursday Island weather sensor data provides a time-series record of rainfall measurements starting from 08 February 2012. The dataset was collected by sensors deployed at the NERP Weather Station site and is managed by the Australian Ocean Data Network. It likely contains continuous environmental observations for the region.
World Bank data on energy production, use, dependency, and efficiency for Azerbaijan, compiled from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset is licensed under CC-BY-4.0 and was last updated on HDX in April 2026.
Alexis Comber provides data supporting a submission to the International Journal of Geographical Information Science (IJGIS). The dataset, 326.8 KB in size, includes simulated data and an empirical case study used to develop and test a novel multiscale space-time varying coefficient modeling approach using Generalized Additive Models (GAMs). The data was last updated on May 5, 2026, and is shared under a CC-BY-4.0 license.
World Bank data on Austria's energy production, use, dependency, and efficiency, compiled from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset addresses trends in energy consumption and sustainability challenges for economic growth. It was last updated on 2026-04-27 and is provided under a CC-BY-4.0 license.
A 95.9 KB PDF authored by Varrsan Dindukurthi, last updated on 2026-04-30, describes a graph-centric decision-support framework for nutrition. The framework uses a Neo4j knowledge graph modeling diseases, nutrients, foods, and demographic-specific Recommended Dietary Allowances (RDA). It was evaluated in case studies for anemia, hypertension, and diabetes using diverse user profiles.
Antigua and Barbuda energy and mining data compiled by the World Bank from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset covers topics such as energy production, use, dependency, and efficiency. It was last updated on 2026-04 27 and is provided under a CC-BY-4.0 license.
World Bank Group data on energy production, use, dependency, and efficiency for American Samoa, compiled from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset is available in CSV format and was last updated on 2026-04-27. It addresses trends in energy use related to economic growth, living standards, and poverty reduction.
A catalog of 3,561 blazars and candidate blazars, providing coordinates and multi-frequency data. The 5th edition, maintained by NASA HEASARC, includes 1,151 BZB, 1,909 BZQ, 274 BZG, and 227 BZU sources, with updates based on CDS archives. All sources have a confirmed radio band detection and, with noted exceptions, published spectroscopic information.
Isotope-based measurements from published studies quantify microbial metabolic responses to nitrogen addition across global terrestrial ecosystems. The dataset, compiled by Lei Zhang and shared on figshare in 2026, includes metrics for microbial carbon growth, respiration, nitrogen mineralization, and use efficiencies. It evaluates how ecosystem type, nitrogen addition rate, experimental duration, and soil properties regulate these context-dependent responses.
World Bank data on energy production, use, dependency, and efficiency for Armenia. The data is compiled by the World Bank from the International Energy Agency and the Carbon Dioxide Information Analysis Center. The dataset was last updated on 2026-04-27.
Han Zhao's dataset on figshare contains measurements from 141 poplar saplings across different ontogenetic stages. It quantifies aboveground biomass, biomass allocation, hydraulic resistance partitioning, leaf gas exchange, and water potential. The data supports empirical scaling rules for large-scale vegetation models.
Leonharper's Naime Corpus V1 is a multilingual text dataset for language model pre-training, containing approximately 28.1 billion tokens across over 38 million documents. The data is tokenized using the Qwen3-8B tokenizer and formatted into sequences of length 4096. It was last updated on Hugging Face in May 2026.
Benthic recycling accounted for 63% and 72% of the annualized nitrogen and phosphorus input, respectively, to Port Phillip Bay. Measurements of oxygen, ammonium, nitrate, phosphate, silicate, and other solutes were taken using benthic chambers at various sites during the summers of 1994 and 1995. The data, from Geoscience Australia, distinguishes four bay regions and quantifies nutrient regeneration rates and denitrification efficiency.
Mount Meager, British Columbia, Canada, is the source of ash used in rheology experiments. The dataset contains shear rate sweep measurements for monodisperse ash grain sizes of 500 µm, 250 µm, 125 µm, and 63 µm, tested across a range of volumetric gas flow rates. Data were generated using an Anton Paar MCR302 rotational rheometer with a powder flow cell, funded by NERC Grant NE/W003767/1.
105 articles from The Guardian and The New York Times, compiled by Stela Lechpammer for a chapter in the Bloomsbury volume '30 Years of Pokémon'. The corpus focuses on two key periods, 2016–2017 and 2022–2025, with five additional articles from the early 2000s for historical context. Each record includes article title, publication date, media outlet, URL, and a unique ID for replicability.