583 million records form a complete mirror of the OpenAlex knowledge graph, including works, authors, and institutions. The dataset, uploaded by Mearman, provides a pre-built graph index and tile repository for client-side visualization. It was last updated on Hugging Face in February 2026.
Use Cases
- Analyze publication trends and research impact based on the scholarly works subset.
- Map academic collaboration networks based on author and affiliation linkages.
- Build visual exploration interfaces for academic literature using the pre-built graph index and tile repository.
- Conduct large-scale bibliometric studies across the complete set of entities.
Strengths
- Contains approximately 583 million total records across all entity types.
- Includes a substantial subset of roughly 449 million scholarly works.
- Provides a pre-built graph index and tile repository for visualization, which suggests a structured format.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- OpenAlex
- Collection Method
- Complete mirror of the OpenAlex knowledge graph.
- Time Range
- null
- Freshness
- Last updated 2026-02-02 06:56:35; freshness should be verified.
- Geography
- null