Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,989 datasets
25.7 MB of code and documents implement the ACLB framework for automated transparency assessment. Authored by Hui Xia and released under CC-BY-4.0, the dataset includes Python code, documents, and spreadsheets related to an empirical study of five major Chinese social media platforms. It was last updated on May 16, 2026.
A 30-meter resolution Digital Elevation Model (DEM) of bathymetry for Northern Australia, compiled in 2018. The dataset was created by Geoscience Australia and the Australian Hydrographic Office, integrating multibeam surveys, LiDAR, satellite data, and an intertidal model. It covers a continental shelf over 400 km wide and approximately 1500 km long, including coral reefs, sand cays, and slope canyons.
GOES-11 satellite imagery at five-minute intervals was used to generate this wind dataset for the Tropical Cloud Systems and Processes (TCSP) mission. The dataset focuses on the 0.65 micrometer visible, 3.9 micrometer infrared, and 10.7 micrometer IR channels to study cyclogenesis. NASA produced this data during July 2005 to analyze the formation and intensification of tropical storms and hurricanes.
A 618.4 KB PDF contains qualitative interview data from four ex-conspiracy theorists. The study, authored by Chanais Matthias and uploaded to figshare in 2026, uses Reflective Thematic Analysis to identify four main themes and eight subthemes related to entering and leaving conspiracy communities.
25 exchange-correlation functionals were investigated for predicting frequency-dependent molecular properties. Dynamic polarizabilities were evaluated at five specific perturbation wavelengths: 632.99, 594.10, 543.52, 514.50, and 325.13 nm. This dataset, authored by Rodrigo A. Mendes and shared on figshare in 2026, contains the results of this computational chemistry study.
Áreas de Protección para la Producción de Alimentos (APPA) are zones designated for food production under Colombian law, constituting a higher-order territorial determinant. The dataset, referenced by Resolution 330 of October 6, 2025, from the Ministry of Agriculture and Rural Development, covers a total area of 3498.43 hectares. It was last updated on May 18, 2026, and originates from the national platform www.datos.gov.co.
Conservation Areas Documentation provides the authoritative source documents for designated areas of special architectural or historic interest in the UK. The dataset includes character appraisals and boundary maps, published to support the Open Digital Planning project. These documents provide the legal and historical context for conservation areas established under the Planning (Listed Buildings and Conservation Areas) Act 1990.
Alicia McDonough's 2026 study profiles renal transporter and channel abundance in 14-week-old male and female spontaneously hypertensive rats (SHR). The dataset, shared on figshare, compares these profiles to normotensive Sprague Dawley rats under high-salt diet and angiotensin II-induced hypertension conditions. It contains 1.6 MB of data, likely in tabular form, supporting integrative analysis of sex differences in hypertensive models.
Northeast China is the focus of this study on the intergenerational transmission of fertility levels. It uses data from the 2022 China Family Panel Studies (CFPS) to examine fertility intentions and behavior, authored by Xiaoxia Sun and last updated in June 2026. The analysis explores how parental attitudes and behaviors influence adult children's fertility decisions.
DuoWikiBias is a novel parallel corpus derived from Wikipedia for Spanish bias classification. The dataset was created by Karla Salas-Jimenez and last updated on 2026-05-13. It is used to evaluate Large Language Models and classical approaches for detecting framing, epistemological, and demographic biases.
Western Port, Australia, is the geographic scope for this dataset, which models the shoreline inundation extent for a 10% Average Exceedance Probability catchment-generated flood under current mean sea level. The data was produced by the Department of Energy, Environment and Climate Action as part of the Local Coastal Hazard Assessment and was last updated on 2026-04-09. It is derived from modelling that considers storm surge and catchment inflows for different sea level rise scenarios.
Land suitability maps for commercial banana cultivation in Colombia's Huila department, created under Agreement 283 of 2017 by UPRA. The dataset categorizes land into high, medium, low, technically unsuitable, and legally unsuitable zones based on physical, socio-ecosystem, and socio-economic criteria at a 1:100,000 scale. Each of the seven TUT maps includes statistics by suitability category for the entire department and by municipality.
Collection 4 Aura OMI data provides hyperspectral surface and underwater UV irradiance measurements from 290 to 399 nm. Each Level-2 netCDF file contains approximately 53 minutes of data from the sunlit portion of a satellite orbit, with about 14 such files generated daily. The dataset includes planar and scalar irradiances and diffuse attenuation coefficients over water, supporting analysis of UV penetration and its environmental impacts.
OMI/Aura Surface UV Irradiance Version 003 (OMUVB) is a Level-2 satellite product providing measurements of ultraviolet radiation reaching the Earth's surface. The dataset contains erythemally weighted daily dose and dose rate, spectral irradiances at specific wavelengths (305, 310, 324, and 380 nm), and ancillary data including cloud optical depth and total column ozone. Data is stored in HDF-EOS5 format, with files representing the sunlit portion of individual satellite orbits, approximately 14 per day.
Pegah Safari published a dataset on figshare in May 2026 detailing the distribution of slot tagsets in annotated data. The data supports research on profile extraction from Persian dialogue systems, a less-resourced language. The dataset is 5.5 KB in size and is available under a CC-BY-4.0 license.
Lina Yuan's 2026 figshare dataset presents simulation results for a joint power-time optimization framework in RFID-based wireless power transfer systems. The data, stored in a 5.5 KB XLS file, compares the proposed method against conventional PID control, reporting metrics like peak efficiency and power fluctuation reduction. The simulation covers heterogeneous IoT sensors operating over distances from 0.5 to 5 meters.
37.6% peak energy efficiency was achieved by the proposed Joint Power-Time Optimization framework for RFID-based wireless power transfer systems. The 5.5 KB XLS file, authored by Lina Yuan and last updated in May 2026, contains simulation results comparing the MPC-based method to conventional PID control. The data likely details performance metrics like power fluctuations and fairness indices for IoT tags operating over 0.5–5 meter distances.
5.5 KB of simulation results validating a joint power-time optimization framework for 915 MHz RFID-based wireless power transfer systems. The dataset, authored by Lina Yuan and last updated in May 2026, likely contains metrics comparing the proposed Model Predictive Control and ADMM-based algorithm against conventional PID controllers. Results described in the associated paper demonstrate a 37.6% peak efficiency and 61.9% reduction in power fluctuations for IoT tags over 0.5–5 meter distances.
Simulation results comparing the performance of a proposed Joint Power-Time Optimization framework against conventional PID control for RFID-based wireless power transfer. The dataset, authored by Lina Yuan and last updated in May 2026, likely contains metrics such as peak efficiency, power fluctuation reduction, and fairness indices derived from the described system models. The data is stored in a 5.5 KB XLS file.
Algorithm 2: Convex TDMA Slot Allocation is a dataset by Lina Yuan, last updated on 2026-05-04. It contains simulation results for a joint power-time resource allocation framework designed for long-range RFID-based wireless power transfer systems. The data, stored in an XLS file of 5.5 KB, validates the proposed optimization's performance metrics.