A table of non-synonymous single nucleotide polymorphisms (nsSNPs) consistently classified as damaging by three independent functional prediction tools: SIFT, PolyPhen-2, and PROVEAN. The dataset is 14.4 KB in size and was authored by Arwa Ibrahim Alwabran. It was last updated on 2026-05-21.
Use Cases
- Benchmarking new variant effect prediction tools based on consensus classifications from established methods.
- Identifying high-confidence deleterious variants for downstream genetic association studies based on multi-tool agreement.
- Training machine learning models for variant prioritization based on labels derived from expert tool consensus.
Strengths
- Consistent classification from three established prediction tools (SIFT, PolyPhen-2, PROVEAN) likely increases confidence in labels.
- Dataset is openly licensed under CC-BY-4.0, facilitating reuse and redistribution.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
Provenance
- Source
- figshare
- Freshness
- Last updated 2026-05-21 17:28:28; freshness should be verified.