Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,338 datasets
A 25.5 KB dataset of associations between proteins and retinal nerve fiber layer thickness from the UK Biobank study. It was authored by Huangdong Li and last updated on June 2, 2026. The data is shared under a CC-BY-4.0 license.
55.0 MB of raw fluorescence microscopy images showing ROS staining in mouse aortas from a study on the anti-atherosclerotic effects of a polyphenol extract. The dataset, authored by He-Ping Liu and shared under CC-BY-4.0, was last updated in May 2026. Images were generated to investigate lipid plaques, oxidative stress, and CD36 protein expression in ApoE-deficient mice fed a high-fat diet.
He-Ping Liu uploaded raw immunofluorescence images of CD36 protein expression in the aorta on 2026-05-05. The 55.8 MB dataset is associated with a study investigating the anti-atherosclerotic effects of a polyphenol extract in ApoE-deficient mice. The images were generated as part of the in vivo experimental analysis described in the accompanying research article.
6.4 MB of raw cell images from a study investigating the anti-atherosclerotic effects of a polyphenol extract. The dataset contains fluorescence staining images for reactive oxygen species (ROS) from in vitro experiments using a THP-1 macrophage-derived foam cell model. The data was authored by He-Ping Liu and shared under a CC-BY-4.0 license on figshare.
Raw images of cell reactive oxygen species (ROS) fluorescence staining from a study investigating the anti-atherosclerotic effects of a polyphenol extract from the herb Syzygium brachythyrsum. The dataset is 6.4 MB in size, shared under a CC-BY-4.0 license by author He-Ping Liu, and was last updated on May 5, 2026. The study involved in vivo experiments with ApoE-/- mice and in vitro experiments using an ox-LDL-induced THP-1 macrophage-derived foam cell model.
Raw images of liver tissue stained with Hematoxylin and Eosin (H&E) from a study investigating the anti-atherosclerotic effects of a polyphenol extract. The 85.3 MB dataset, authored by He-Ping Liu and last updated in May 2026, was generated from ApoE-deficient mice fed a high-fat diet for 12 weeks. These images were used to examine hepatic lipid accumulation and morphological characteristics.
A study from Yunnan province, China, investigates the anti-atherosclerotic effects of a polyphenol extract from the herb Syzygium brachythyrsum. The dataset contains raw images of Oil Red O and H&E stained aorta and liver tissue from ApoE-/- mice fed a high-fat diet for 12 weeks. He-Ping Liu uploaded the 35.1 MB ZIP file to figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-05-05.
A methodology paper from Lafayette College demonstrates a global sensitivity analysis technique for Anaerobic Digestion Model No. 1 (ADM1). The work transforms time-dependent model outputs using functional principal component analysis (fPCA) for input into Morris' screening design. Results indicate that 95-99% of output variation can be captured by principal components, with the first PC sufficient to represent model outputs.
Growth parameters, foliar nutrient concentrations, and ectomycorrhizal colonization data for two dipterocarp seedling species. The dataset results from a 20-month factorial experiment applying nutrients and fungicide in the Kabili-Sepilok Forest Reserve. It was authored by Francis Brearley and last updated in June 2026.
P3D-Bench provides 1,003 cases for evaluating parametric 3D generation models across three distinct tasks. SpatiaOS released this lightweight benchmark data in 2026, containing UID lists and derived annotations. The data splits include 400 text-to-3D cases, 400 image-to-3D cases, and 203 assembly-level 3D cases.
A collection of agent traces generated with Swival, an agent designed for open-source models. The traces focus on security audits of open-source software. The dataset was uploaded by author jedisct1 and was last updated on June 8, 2026.
The Canberra 1:100,000 Geological Sheet covers approximately 2500 km² of hilly, upland terrain in the Australian Capital Territory and southeastern New South Wales. The bedrock comprises Ordovician to Silurian sediments and acid volcanics which have been invaded by several generations of Silurian intrusions. The dataset is hosted by the Australian Ocean Data Network and was last updated on 2026-06-05.
Field measurements from four plots in Gonarezhou National Park, southeastern Zimbabwe, used to analyze selective foraging by aardvarks. The dataset includes structural attributes, spatial coordinates, and derived cost metrics for termite mounds on basalt and granite substrates. Data were collected by Justice Muvengwi and last updated on April 10, 2026.
A geospatial dataset of provincial Crown Land, including land managed by the Department of Energy. The data is hosted by the Government of New Brunswick on the Socrata platform and was last updated on 2026-05-29. The dataset's row count and temporal coverage are not specified in the available metadata.
Around 89.9k conversation examples for instruction tuning models in Algerian Darija, a dialect characterized by code-switching between Arabic, French, and local expressions. The dataset was created by the awras-ai project and is hosted on Hugging Face. It was last updated on 2026-06-18.
Global Affairs Canada periodically conducts evaluations of its priorities, programs, and projects. The reports serve as a practical management tool for reviewing performance and improving the design and implementation of upcoming initiatives. Each evaluation generates a report, with this collection likely focusing on reconstruction assistance in the Philippines from 2013-14 to 2018-19.
Four CSV files contain outputs and evaluation scores from experiments described in the paper 'MLLM-as-a-Judge for Financial Document Image Machine Translation'. The dataset likely includes translations generated by Gemma models, scores assigned by a judge model, and its reasoning. Yanco Amor and Torterolo Orta published this data via e-cienciaDatos Harvested Dataverse on June 14, 2026.
Bathymetry data for the Port Fairy Wave Energy Site was acquired by Deakin University Marine Mapping lab on March 30, 2021. The survey was conducted from the Motor Vessel Yolla using a Kongsberg EM2040c multibeam echosounder. These data were collected to assess the impact of a wave energy structure placed on the seafloor.
Evaluation reports are generated by Global Affairs Canada to review the performance of its priorities, programs, and projects. The reports serve as a practical management tool to improve the design and implementation of upcoming initiatives. The dataset is published under the OGL-CA-2.0 license and was last updated in May 2026.
February 2005 saw a Fisheries Science Partnership survey of sole and plaice in ICES Divisions VIIf&g in the eastern Celtic Sea. Sixty-four hauls were conducted using twin 4-metre beams and 80 mm mesh cod-ends aboard the commercial beam trawler FV Nellie, off the north coasts of Cornwall and Devon and the Bristol Channel. The dataset likely contains haul-level catch data for these two flatfish species.