Full SFS for CHES: Genetic Mutation Contexts and Derived Allele Counts
by Deepjyoti Ghosh·Updated 8d ago
106.4 MB2files
Available on 1 platform
Sign in to view source links and access this dataset
Description
A dataset containing genetic mutation contexts and derived allele counts from 1,000,000 haplotype samples. It includes columns for mutation type, ancestral and changed trinucleotide contexts, CpG site methylation levels, and derived allele counts. The associated mut_rates.csv file provides point mutation rate estimates for each unique context triplet, with analysis code available on GitHub.
Use Cases
Inferring mutation rates based on trinucleotide context and methylation level data.
Modeling selection using derived allele counts from a large sample of haplotypes.
Analyzing the relationship between CpG methylation and mutation patterns.
Benchmarking new inference methods against the provided gamma distribution estimates.
Strengths
Includes data from 1,000,000 haplotype samples for derived allele counts.
Provides mutation rate estimates (mean, standard deviation, point rate) for unique context triplets.
Associated analysis code is publicly available on GitHub for reproducibility.
Limitations
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent for some files; field semantics must be inferred after download.
Provenance
Source
Deepjyoti Ghosh via figshare
Freshness
Last updated 2026-05-28 20:22:55; freshness should be verified.
License is CC-BY-4.0. The primary data file is compressed (.csv.gz).