Supplementary file 1_Complete genome sequencing of Lactobacillus pentosus HP-B1718 and ide
by Zhi-Wen Tan·Updated 16d ago
3.7 MB1files
Available on 1 platform
Sign in to view source links and access this dataset
Description
A 3.7 MB DOCX file uploaded on 2026-05-20 contains the complete genome sequence of the probiotic strain Lactobacillus pentosus HP-B1718. The genome is 3,257,491 bp with a GC content of 44.58% and includes a plasmid, 3,081 CDS, and various RNA genes. The study identifies the Aryl-phospho-beta-D-glucosidase (ApgA) enzyme responsible for converting liquiritin to liquiritigenin, with a reported conversion rate of 99% under optimized conditions.
Use Cases
Comparative genomic analysis of Lactobacillus pentosus strains based on the provided complete genome sequence.
Identifying genes related to probiotic properties and stress tolerance as mentioned in the genomic evaluation.
Studying the enzymatic function and optimization of Aryl-phospho-beta-D-glucosidase (ApgA) for biotransformation processes.
Screening for similar biotransformation enzymes in other microbial genomes using the identified apgA gene sequence.
Strengths
Provides a complete, circular chromosome sequence of 3,257,491 bp with detailed annotation counts (3,081 CDS, 67 tRNAs, 16 rRNAs).
Includes data on a separate circular plasmid (53,560 bp, 38.68% GC content).
Reports a specific, high enzymatic conversion rate of 99% for the key reaction under studied conditions.
Limitations
The primary data file is a DOCX document, which likely contains textual analysis rather than raw, machine-readable genomic data files (e.g., FASTA, GFF).
Row count and column-level documentation for any underlying data tables are unknown.
Data freshness should be verified, as the last update date is in the future (2026-05-20).
Provenance
Source
figshare, authored by Zhi-Wen Tan.
Collection Method
Complete genome sequencing technology and subsequent bioinformatic analysis (BLAST, genome annotation) as described.
Freshness
Last updated 2026-05-20 13:02:50.
Dataset is a 3.7 MB DOCX file; users may need to extract underlying data or sequence accession numbers from the document text. Licensed under CC-BY-4.0.