Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
1,072 non-redundant protein-coding sequences derived from comparative genomic analysis of bacteria frequently reported as probiotics. The dataset, named ProbioSML, was created by Diego Lucas Neres Rodrigues using pangenomic analysis and supervised machine learning models like Random Forest. It was last updated on April 22, 2026, and is available under a CC-BY-4.0 license.
Data is provided in an XLSX file format, requiring compatible software to open.