More than 300,000 oxygenase genes, primarily monooxygenases and dioxygenases, were identified from a global ocean gene dataset. The dataset comprises approximately 200 million predicted genes from global ocean metagenomes. It was authored by Changfei He and last updated on figshare in April 2026.
Use Cases
- Identify and classify oxygenase enzyme families based on predicted gene sequences.
- Study the distribution and diversity of monooxygenases and dioxygenases in global ocean microbiomes.
- Model the potential role of microbial enzymes in marine carbon cycling processes.
- Benchmark gene prediction and annotation pipelines for large-scale metagenomic data.
Strengths
- Contains a large-scale collection of ~200 million predicted genes from global ocean samples.
- Focuses on a specific, functionally relevant subset of more than 300,000 oxygenase genes.
- Data is openly available under a CC-BY-4.0 license, facilitating reuse.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count for the specific oxygenase subset is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality and completeness require manual inspection.
Provenance
- Source
- figshare, author Changfei He.
- Collection Method
- Constructed from global ocean metagenomes; genes were predicted and oxygenases were identified.
- Freshness
- Last updated 2026-04-21 09:14:14; freshness should be verified.
- Geography
- Global ocean coverage.