DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

MSBench: A Large Mass Spectrometry Dataset for Proteomics | DataSalon

Home ChemistryMSBench: A Large Mass Spectrometry Dataset for Proteomics

Chemistry

MSBench: A Large Mass Spectrometry Dataset for Proteomics

Name: MSBench: A Large Mass Spectrometry Dataset for Proteomics
Creator: Gao, Yu
Published: 2026-05-06T21:09:48
Keywords: Biochemistry Data, Mass Spectrometry, Tabular, Proteomics, Foundational Models

by Gao, Yu / Harvard Dataverse·Updated 1mo ago

Available on 1 platform

Description

MSBench is a large dataset containing mass spectrometry data for bottom-up proteomics. The dataset was authored by Yu Gao and is hosted on Harvard Dataverse, with a last recorded update in May 2026. Its primary purpose is to serve as training data for foundational models in the field.

Use Cases

Training foundational AI models based on mass spectrometry data mentioned in the description
Benchmarking computational proteomics methods based on the described data
Developing spectral prediction or peptide identification tools based on the described mass spectrometry data

Strengths

Dataset is described as 'large', suggesting a substantial volume of data for model training
Data is specifically curated for training foundational models, indicating a targeted purpose

Limitations

Description metadata is limited; actual data quality requires manual inspection after download
Column-level documentation is absent; field semantics must be inferred after download

Provenance

Source: Harvard Dataverse
Collection Method: Likely contains experimental mass spectrometry data from bottom-up proteomics workflows.
Time Range: null
Freshness: Last updated 2026 05 06 21:09:48; freshness should be verified
Geography: null

null

Tabular Biochemistry Data Mass Spectrometry Proteomics Foundational Models

Related Datasets

Quality Score

D29

Description

Source

Reputation

Quality Score

D29

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Author: Gao, Yu
Org: Harvard Dataverse
Created: May 6, 2026
Updated: May 6, 2026
Last synced: May 19, 2026

Access

Community

0 views

Dataset Info

Author: Gao, Yu
Org: Harvard Dataverse
Created: May 6, 2026
Updated: May 6, 2026
Last synced: May 19, 2026

MSBench: A Large Mass Spectrometry Dataset for Proteomics

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info