7,554 Streptococcus suis genomes, including 195 newly sequenced isolates, analyzed by Ruanyang Sun in 2026. The dataset supports research into phylogenetic structure, antimicrobial resistance, virulence factors, and global transmission patterns of this zoonotic pathogen.
Use Cases
- Analyzing phylogenetic structure and global distribution based on cgMLST scheme and UMAP embedding
- Investigating antimicrobial resistance gene (ARG) distribution across different clusters and continents
- Studying virulence factor (VF) profiles across hosts and geographic regions
- Comparing genome reduction patterns in predominant serotypes (2, 1/2, and 9)
- Modeling transmission patterns and identifying epidemic hotspots like China, Europe, and Japan
Strengths
- Includes 7,554 genomes, combining 195 newly sequenced isolates with 7,359 publicly available genomes
- Analysis identifies 20 high-density clusters with significant geographical dispersion
- Focuses on three predominant serotypes (2, 1/2, and 9) with detailed genomic feature comparisons
Limitations
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment
- Data may reflect geographic bias inherent to the source collections
Provenance
- Source
- Ruanyang Sun via figshare
- Collection Method
- Combination of newly sequenced isolates (195) and publicly available genomes (7,359)
- Freshness
- Last updated 2026-04-10 06:10:22; freshness should be verified
- Geography
- Global, with specific analysis of clusters in China, Europe, and Japan