A dataset collects function-level information from ArkTS (HarmonyOS Ark TypeScript) projects. It includes original functions, docstrings, abstract syntax tree representations, obfuscated versions, and source code metadata. The dataset was created by author hreyulog and last updated on April 15, 2026.
Use Cases
- Code retrieval tasks based on function-level information and docstrings.
- Code analysis and understanding based on abstract syntax tree representations.
- Research on AST representations based on the provided AST data.
- Code obfuscation studies based on the included obfuscated versions of functions.
Strengths
- Includes multiple representations of code: original functions, docstrings, ASTs, and obfuscated versions.
- Dataset is open-source and provides links to a paper, evaluation code, and processing code.
- Last updated date is explicitly stated as 2026-04-15.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- hreyulog
- Collection Method
- Likely collected from open-source ArkTS projects.
- Freshness
- 2026-04-15