Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,668 datasets
Cong Hien Dinh's dataset compares cleaning time and energy consumption for different coverage strategies in aircraft cabin cleaning, specifically for a 100-seat scenario. The data is stored in an XLS file sized 5.5 KB and was last updated on May 28, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
5.5 KB Excel file compares cleaning time and energy consumption for different coverage strategies in aircraft cabin cleaning. Author Cong Hien Dinh published the dataset on figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-05-28. The data focuses on a specific 50-seat aircraft scenario.
Treatment conditions for processing cockroach and prawn exoskeletons. The dataset is a 5.5 KB Excel file authored by Adebowale E. Aderogba and last updated on 2026-05-28. It is shared under a CC-BY-4.0 license on figshare.
Detailed parameters of a distributed generation grid-connected system are provided in this dataset. Wen Sun authored the dataset, which is available under a CC-BY-4.0 license. The dataset was last updated on 2026-05-28.
A 9.5 KB Excel data matrix for numerical taxonomy, contributed by Maad S. Ytemi. The characters used for classification are morphological, anatomical, and palynological. The dataset is licensed under CC-BY-4.0 and was last updated on May 28, 2026.
A 5.5 KB Excel file summarizes the morphological characteristics of Commicarpus species found in Saudi Arabia. The dataset was authored by Maad S. Ytemi and is available under a CC-BY-4.0 license. It was last updated on May 28, 2026.
13.5 KB of experimental and theoretical vibrational frequency data for the hydroxychloroquine molecule. The dataset includes potential energy distribution (PED) analysis and comparisons with existing literature. It was authored by Aliye Demet Demirag and is available as an XLS file.
A 9.5 KB Excel file summarizing fixed effects for a generalized linear mixed model analyzing relative strength and distance. The dataset was authored by Zongwei Chen and last updated on May 28, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
5.5 KB of statistical analysis results focusing on factors associated with contamination. The dataset, authored by Fumiya Miyako, is available as an Excel file under a CC-BY-4.0 license and was last updated on May 28, 2026. It likely contains the output of a univariable generalized linear mixed model analysis.
Decker's map work initiated state surveying for the Prussian General Staff. The collection includes 9 facsimile map sheets of the Berlin area, reproduced from originals held by the Staatsbibliothek zu Berlin. The total area extends from Kremmen and Oranienburg in the north to Mittenwalde and Storkow in the south.
Nine facsimile map sheets covering the area around Berlin, reduced from the original Prussian General Staff survey. The total area extends from Kremmen and Oranienburg in the north to Mittenwalde and Storkow in the south, and from Nauen and Ketzin in the west to Strausberg in the east. These originals from the holdings of the Staatsbibliothek zu Berlin were reproduced by the Landesvermessung und Geobasisinformation Brandenburg (LGB).
An in silico reconstruction of the theoretical peptide landscape from proteolytic cleavage of beta-synuclein. The dataset includes wild-type and disease-associated variants (V70M and P123H), with each fragment annotated with physicochemical properties. It was created by Axel Petzold and last updated on 2026-04 -13.
Brandenburg province in Prussia was mapped between 1816 and 1821. The resulting 667 'square mile sheets' at 1:25,000 scale were later reduced to 1:50,000 under Carl von Decker, with 9 sheets covering the area around Berlin published as 'Umgebung von Berlin'. These facsimile prints are reproduced from originals held by the Staatsbibliothek zu Berlin.
Global Affairs Canada periodically conducts evaluations of its priorities, programs, and projects. Each evaluation generates a report intended as a management tool for reviewing performance and improving future program design and implementation. The reports are published in HTML format under the OGL-CA-2.0 license.
Queensland's threatened fauna species data shows a total of 228 species listed as vulnerable, endangered, or presumed extinct between 2007 and 2019. The dataset was published by the Queensland Department of Environment, Tourism, Science and Innovation. It indicates one species was listed as presumed extinct since 2017.
Evaluation reports from Global Affairs Canada assess the performance of its priorities, programs, and projects in the Latin American and Caribbean region. The reports serve as a practical management tool for reviewing program implementation and improving future initiatives. Each evaluation results in a published HTML report, with the collection last updated on 2026-05-21.
A 4.5-year seabed mapping project from Bellambi Point to Stanwell Park, NSW, captured detailed 5-meter resolution bathymetry and backscatter data using an R2Sonic 2022 multibeam sonar. This 32-bit floating point GeoTIFF dataset provides a baseline for seabed type distribution and was processed through a rigorous pipeline including Hypack, Qimera, and FMGT software. The survey was funded by the NSW government's SeabedNSW program as part of coastal reforms and habitat mapping initiatives.
Culturay Kk V1 is a Kazakh-language text dataset derived from the CulturaY multilingual corpus built from Internet Archive data. The subset underwent a multi-stage cleaning and quality-filtering pipeline. The dataset is prepared by author 'salyamq' and was last updated on June 13, 2026.
26,000 line-kilometres of Total Magnetic Intensity (TMI) data were acquired for Geoscience Australia in 2008/2009. This processed data measures variations in the Earth's magnetic field to reveal sub-surface geological structure. The data underwent quality checks by GA geophysicists to ensure it is fit-for-purpose.
Experimental data from a study on the biocontrol efficacy of an endophytic fungus Fusarium lateritium strain A1 against apple fungal pathogens. The dataset likely contains results from in vitro assays, bioassays on detached plant material, field experiments, and transcriptome analyses. The dataset was uploaded by Xiuna Guo to figshare in April 2026.