DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

05_HGNC_Mapped_Data: TCGA Lower Grade Glioma Molecular Data | DataSalon

Home NLP & Text05_HGNC_Mapped_Data: TCGA Lower Grade Glioma Molecular Data

NLP & Text

05_HGNC_Mapped_Data: TCGA Lower Grade Glioma Molecular Data

Name: 05_HGNC_Mapped_Data: TCGA Lower Grade Glioma Molecular Data
Creator: Aaliah Aly
Published: 2026-05-07T02:48:35
License: CC-BY-4.0
Keywords: Gene Expression, ZIP, Multi Omics, CSV, Tcga, Tabular, Copy Number Alteration, Mutations, Excel, Synthetic

by Aaliah Aly·Updated 1mo ago

98.7 MB3files

Available on 1 platform

Description

05_HGNC_Mapped_Data contains standardized molecular data files from the TCGA Lower Grade Glioma Python pipeline. Aaliah Aly published this dataset on figshare in May 2026. The files include HGNC-mapped gene expression, copy number alteration, and mutation datasets.

Use Cases

Integrating expression, CNA, and mutation data based on standardized gene identifiers for multi-omics analysis.
Constructing graph databases or SQL tables based on reliable gene-linked molecular records.
Performing downstream cancer genomics analysis based on datasets with resolved gene naming inconsistencies.

Strengths

Gene identifiers were standardized using official HGNC information, including approved symbols and aliases.
Unmapped or invalid gene entries were filtered out to improve accuracy for downstream integration.
The dataset is 98.7 MB in size and includes CSV and XLSX files ready for integration.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2026-05-07 02:49:04; freshness should be verified.

Provenance

Source: Generated from the TCGA Lower Grade Glioma Python pipeline.
Collection Method: HGNC-based gene identifier mapping performed using the hgnc_mapping.py script.
Freshness: Last updated 2026-05-07 02:49:04.

License is CC-BY-4.0.

Tabular ZIP CSV Excel Gene Expression Multi Omics Tcga Copy Number Alteration Mutations Synthetic

Related Datasets

Quality Score

C50

Description

Source

Reputation

Quality Score

C50

Description

Source

Reputation

Access

Community

0 views

Dataset Info

License: CC-BY-4.0
Author: Aaliah Aly
Files: 3
Created: May 7, 2026
Updated: May 7, 2026
DOI
Last synced: May 7, 2026

Access

Community

0 views

Dataset Info

License: CC-BY-4.0
Author: Aaliah Aly
Files: 3
Created: May 7, 2026
Updated: May 7, 2026
DOI
Last synced: May 7, 2026

05_HGNC_Mapped_Data: TCGA Lower Grade Glioma Molecular Data

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info