Respiratory Chain Gene Mutations Associated with Drug-Resistant Mycobacterium Tuberculosis
by Qiang Ji·Updated 18d ago
1.0 MB1files
Available on 1 platform
Sign in to view source links and access this dataset
Description
13,402 isolates of Mycobacterium tuberculosis were analyzed via whole-genome sequencing to identify mutations linked to multidrug resistance. The dataset, authored by Qiang Ji and last updated in May 2026, contains results from random forest, gradient boosting, and generalized linear mixed models. It identifies specific single nucleotide polymorphisms in genes like atpH, cydA, and qcrB significantly associated with phylogenetic clustering and MDR formation.
Use Cases
Identify genetic markers for multidrug-resistant tuberculosis based on respiratory chain gene mutations.
Train predictive models for drug resistance classification based on whole-genome sequencing data.
Analyze phylogenetic clustering patterns of resistant isolates based on identified single nucleotide polymorphisms.
Strengths
Includes 13,402 Mycobacterium tuberculosis isolates, providing a substantial sample size for analysis.
Identifies specific SNPs in genes like atpH A428G and cydA C942A with statistical significance for MDR.
Released under a permissive CC-BY-4.0 license, facilitating reuse and sharing.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count for the specific table is unknown, which may limit suitability assessment.
Data may reflect bias inherent to the specific collection of isolates sequenced.
Provenance
Source
figshare, author Qiang Ji.
Collection Method
Whole-genome sequencing performed on analyzed strains, with analysis via random forest, gradient boosting decision tree, and generalized linear mixed models.
Time Range
null
Freshness
Last updated 2026-05-20 04:38:20; freshness should be verified.
Geography
null
File is in XLSX format, requiring software like Excel or a compatible library to open.