Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,020 datasets
Australian Ocean Data Network collected this air temperature data set from weather sensors deployed on the NERP Weather Station site Bramble Cay. The data collection began on 16 July 2015. The dataset is hosted on the data.gov.au platform.
Hail data collected by weather sensors deployed on the NERP Weather Station site on Saibai Island. The dataset covers a period from April 27, 2016, to March 5, 2021, and was aggregated by the Australian Ocean Data Network. The data was last updated on June 16, 2026.
Australian wind data collected by weather sensors deployed on the NERP Weather Station site Badu. The data covers a period from 08 May 2018 to 03 Jun 2021 and is provided by the Australian Ocean Data Network.
Wind data collected by sensors deployed on the NERP Weather Station site Thursday Island. The dataset is provided by the Australian Ocean Data Network and was last updated in 2026. The specific temporal coverage likely begins on 08 February 2012.
Salinity data collected by weather sensors deployed on the NERP Weather Station site at Thursday Island. The dataset covers a period from 02 February 2012 to 19 July 2014. The data is aggregated by the Australian Ocean Data Network.
A seamless topographic color map service covering all of Australia, its outer islands, and external territories including the Australian Antarctic Territory. The map integrates data from Geoscience Australia, the Australian Antarctic Division, OpenStreetMap, and other sources, portraying cultural, hydrography, marine, transport, vegetation, and relief themes. This version of the base map does not include any text labels.
Birthelmer et al.'s dataset contains daily diary reports from 85 couples where one partner had experienced a stroke. Participants completed morning and evening reports of anger, sadness, and relationship satisfaction for up to 14 consecutive days, with post-stroke intimacy assessed at study exit. The data was used to analyze actor and partner associations between emotions and relationship satisfaction, published in 2026.
05_HGNC_Mapped_Data contains standardized molecular data files from the TCGA Lower Grade Glioma Python pipeline. Aaliah Aly published this dataset on figshare in May 2026. The files include HGNC-mapped gene expression, copy number alteration, and mutation datasets.
A 5.5 KB Excel file published by Wenfeng Huang on April 24, 2026. It contains inference rules for Burrows-Abadi-Needham (BAN) logic, used for formal security analysis of a blockchain-based authentication scheme for the Internet of Vehicles.
A small dataset containing notations and descriptions for Burrows-Abadi-Needham (BAN) logic formal analysis. It was created by Wenfeng Huang and uploaded to figshare on April 24, 2026. The dataset likely supports the security validation of a blockchain-based authentication scheme for the Internet of Vehicles.
A comparative analysis of blockchain-based authentication schemes for the Internet of Vehicles (IoV). The dataset likely contains performance metrics, such as computational cost, comparing a proposed scheme with existing ones. It was authored by Wenfeng Huang and uploaded to figshare in April 2026.
169 professional psychological questionnaires were used to fine-tune large language models for assessment tasks. The optimized Qwen-2.5 and GLM-4 models showed gains in text generation quality, logical consistency, and cultural adaptability. Zhitao Yuan published this 66.5 KB dataset under a CC-BY-4.0 license in April 2026.
A 2.3 MB ZIP file contains model and training configuration scripts from a study that reimagined psychology questionnaire development using LLM technology. The research fine-tuned the Qwen-2.5 and GLM-4 models on a corpus of 169 professional psychological questionnaires, integrating instruction fine-tuning with human feedback reinforcement. Author Zhitao Yuan uploaded this dataset to figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-04-24.
169 professional psychological questionnaires used to fine-tune large language models for scale development. The dataset, created by Zhitao Yuan and last updated in April 2026, supports research into automating psychological assessment tools. It is a small 11.7 KB XLSX file containing raw data for evaluating model performance on text generation, scientific rigor, and cultural adaptability.
A 12.0 KB Excel file containing raw data for evaluating text generation metrics from a study that fine-tuned Qwen-2.5 and GLM-4 models on a corpus of 169 professional psychological questionnaires. The research, authored by Zhitao Yuan and uploaded to figshare in April 2026, aimed to overcome bias in traditional scale development using instruction fine-tuning and human feedback reinforcement. The optimized models demonstrated gains in BLEU-4, ROUGE-L, logical consistency, and cultural adaptability.
5.5 KB of normalized RHOA mRNA expression data from individual experiments in a regeneration model of L. corallorrhiza, as shown in a related research figure. The dataset, authored by Kseniia V. Skorentseva, includes mean and standard deviation values and was last updated on 2026-05-21. It is provided in XLS format under a CC-BY-4.0 license.
An archive of bacterial artificial chromosome full-length sequences for the giant panda. The sequences were assembled by Flye from ultra-long reads generated by a QiTan Nanopore sequencer. The dataset is 1.1 MB in size and was last updated on June 4, 2026.
Molecular dynamics simulation data investigates the binding of RNA-binding motif protein 45 (RBM45) to unmodified and m6A-modified RNA motifs. The dataset, created by Raeyeon Park and last updated in May 2026, includes simulations of RNA complexes with individual RRM domains and with RRM3 in the context of the full-length protein. It provides biophysical insights into how RBM45 preferentially recognizes m6A-modified motifs through domain synergy.
Molecular dynamics simulations investigate the binding of RNA-binding motif protein 45 (RBM45) to unmodified and N6-methyladenosine (m6A)-modified RNA motifs. The dataset, created by Raeyeon Park and last updated in May 2026, contains 17.3 MB of simulation data in PDB format. It provides biophysical insights into how RRM domains, particularly RRM3, synergize with other protein regions for preferential m6A binding.
Molecular dynamics simulation data for RNA-binding motif protein 45 (RBM45) interacting with RNA sequences. The dataset, created by Raeyeon Park and last updated in May 2026, provides structural insights into how RBM45 preferentially binds N6-methyladenosine (m6A)-modified RNA motifs over unmodified ones. Simulations investigate binding with GACG, GACU, and GACA RNA motifs in complex with individual RRM domains and the full-length protein.