Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
41,396 datasets
Greater London Historic Environment Record provides a report on all Historic Environment Records (HERs) for the London Borough of Barnet. HERs are dynamic resources documenting the historic environment of a defined geographic area for public benefit. The data is a snapshot from the Greater London Historic Environment Record (GLHER), intended for personal reference.
2018 emissions from Queensland's stationary energy sector were 77.64 million tonnes of carbon dioxide equivalent (MtCO2e), representing 45% of the state's total. Emissions increased by 20% between 2005 and 2018, driven by growth in mining, exports, population, and economic activity. The dataset is provided by the Queensland Department of Environment, Tourism, Science and Innovation.
Hourly in-situ soil moisture measurements from data loggers in organic soils at two locations: along the Sag River in Alaska and near Red Earth Creek in Alberta. The dataset includes soil moisture probe periods, temperature readings, and calibration coefficients for deriving volumetric moisture content. Provided by NASA, the data spans from July 2017 to July 2021 with some interruptions.
1.8 KB of data describes the discovery of TNG456, a brain-penetrant PRMT5 inhibitor for cancers with MTAP loss. Homozygous deletion of the MTAP gene occurs in 10–15% of all human cancers and up to 50% of high-grade gliomas. The dataset, authored by Kevin M. Cottrell and last updated on 2026-05-18, relates to a compound currently in Phase I/II clinical studies.
39 municipalities in the department of Norte de Santander, Colombia, are covered by this dataset detailing the School Feeding Program (PAE). It lists the number of student beneficiaries in rural and urban educational institutions, sourced from the Colombian open data portal www.datos.gov.co. The data was last updated on 2026-05-18.
NASA's 2006 NAMMA campaign deployed a DC-8 aircraft to capture detailed vertical profiles of pressure, temperature, humidity, and winds over West Africa. These high-resolution GPS-located dropsondes tracked the evolution of African Easterly Waves and Mesoscale Convective Systems from August 7 to September 12, 2006. The data provides a foundational record for studying monsoon dynamics and regional water and energy budgets.
Surface temperatures were measured at a 30-degree view zenith angle with an Everest infrared thermometer and at approximately a 60-degree angle with a Scheduler Plant Stress Monitor at 4 view azimuths. The dataset contains net radiation, incoming and reflected photosynthetically active radiation, shortwave radiation, and reflected and emitted longwave radiation. Measurements were taken using specific instruments including Eppley Precision pyranometers and an infrared radiometer.
Hydro Quebec provides annual data on direct greenhouse gas emission factors for energy sources in Quebec's import and export markets. The dataset covers North-East America, including Quebec, Ontario, New Brunswick, New York State, New England, MISO, and PJM, with values compiled from public sources like EPA eGrid and IESO. It was last updated on April 17, 2026.
Quarterly accountability reports detail the performance of Montreal's 311 Network. The data likely contains processed call and email volumes, online request counts, and performance standards for telephone and email services. Published by the Government and Municipalities of Québec, the latest metadata update was on April 17, 2026.
British Columbia's annual dataset tracks hunting license sales from 2005 to 2024. The Government of British Columbia collects this data through the BC Hunting Online licensing system, summarizing sales by residency type, product category, and specific license type. Each record corresponds to a license year, defined as April 1 to March 31.
Chicago foreclosed rental properties registered with the city's Department of Housing under the Keep Chicago Renting ordinance. The dataset tracks ownership and management details for properties in foreclosure, with property addresses always located within Chicago. It includes records of owner information, management and legal agents, and submission dates for registration.
50 geometric shapes (Triacontominoes) are analyzed with multiple quantitative descriptors. The dataset, created by Ruth Dalton and last updated in May 2026, contains symmetry and compactness measurements. It supports replication of analyses in an accompanying paper and enables further investigation of shape morphology.
Proteomic Insights into the Impact of Drought Stress on Barley Grain Composition and Malt Quality is a dataset from figshare, authored by Qingqing Qin and last updated in May 2026. It contains proteomic analysis results from field-based gradient water deficit experiments on three barley cultivars (Copeland, Synergy, Planet). The study identified 1,057 differentially abundant proteins and links drought stress to changes in grain protein content, kernel weight, and malt quality indicators.
1,057 differentially abundant proteins were identified in a proteomic analysis of three barley cultivars (Copeland, Synergy, Planet) subjected to field-based gradient water deficit experiments. The dataset, authored by Qingqing Qin and last updated in May 2026, links drought-induced changes in grain protein content and kernel weight to alterations in malt quality metrics like friability and β-glucan levels. An optimized malting process with extended steeping and germination was tested to partially alleviate these defects.
Qi Liang published this dataset on figshare in May 2026. It contains experimental data from a study investigating S-palmitoylation's role in nucleus pulposus cell senescence under hypoxic conditions, a model for intervertebral disc degeneration. The dataset is stored in an XLS file and is 5.5 KB in size.
Two prospective cohorts from the UK Biobank, comprising approximately 340,000 and 358,000 participants, were analyzed over a median follow-up of 13.7 years. The study, authored by figshare admin karger, investigates the association between a validated frailty phenotype and the risk of developing various gastrointestinal and hepatobiliary–pancreatic diseases. It was last updated on May 7, 2026, and includes hazard ratios for conditions like irritable bowel syndrome, cirrhosis, and inflammatory bowel disease.
2,414 dark web pages were collected via a custom crawler from the Torch search engine for a comparative analysis of text mining techniques. Jin Gyeong Kim published this dataset on figshare in May 2026, which contains the top 20 keywords extracted using the TF-IDF method. The work evaluates TF-IDF, Eigenvector Centrality, and Word2Vec for extracting investigative keywords related to child sexual abuse materials.
A list of 20 keywords extracted from child sexual abuse material (CSAM) related content on the dark web. The data was collected from 2,414 dark web pages indexed by the Torch search engine and processed using Word2Vec and other text mining techniques. The dataset was created by Jin Gyeong Kim and last updated on 2026-05-08.
A geospatial dataset identifies housing development potentials within Cologne's urban area. The data, produced by the city administration using an in-house algorithm, categorizes areas as building gaps, replenishment, and restructuring potentials. It serves as a technical initial assessment for residential construction feasibility, published under § 200 (3) of the German Building Code.
Marine field tests in the Bohai Sea demonstrated over 90% biofouling suppression after 180 days. The dataset, authored by Xin Guo and last updated in May 2026, describes a nanocellulose-based slippery liquid-infused porous surface (SLIPS) designed for synergistic drag reduction, corrosion protection, and antifouling. It reports performance metrics including up to 30% drag reduction and 95.7% corrosion inhibition in saline water.