DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Tr Law Orige5 Hdbscan 45K: Turkish Legal Text Clustering Dataset | DataSalon

Home Government & LegalTr Law Orige5 Hdbscan 45K: Turkish Legal Text Clustering Dataset

Government & Legal

Tr Law Orige5 Hdbscan 45K: Turkish Legal Text Clustering Dataset

Name: Tr Law Orige5 Hdbscan 45K: Turkish Legal Text Clustering Dataset
Creator: erdem-erdem
Published: 2025-11-11T17:17:13
Keywords: Turkish Language, Legal Text, Text, Clustering

by erdem-erdem·Updated 7mo ago

Available on 1 platform

Description

45,000 Turkish legal documents processed with the HDBSCAN clustering algorithm, uploaded to Hugging Face by author 'erdem-erdem'. The dataset was last updated on November 11, 2025. Its columns and specific content are not described in the available metadata.

Use Cases

Benchmarking text clustering algorithms on Turkish legal corpora (inferred from domain, verify after download)
Analyzing topic distribution within a collection of Turkish legal documents (inferred from domain, verify after download)
Fine-tuning language models for the Turkish legal domain (inferred from domain, verify after download)

Strengths

Published on the Hugging Face platform, facilitating access for the ML community.
Last updated on 2025 11 11 17:17:29, indicating recent maintenance.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.

Provenance

Source: huggingface
Collection Method: Uploaded by author 'erdem-erdem'; specific collection method is unknown.
Time Range: null
Freshness: Last updated 2025-11-11 17:17:29.
Geography: null

null

Text Turkish Language Legal Text Clustering

Related Datasets

Quality Score

D27

Description

Source

Reputation

Quality Score

D27

Description

Source

Reputation

Access

Community

5 downloads

1 likes

0 views

Dataset Info

Author: erdem-erdem
Created: Nov 11, 2025
Updated: Nov 11, 2025
Last synced: Apr 16, 2026

Access

Community

5 downloads

1 likes

0 views

Dataset Info

Author: erdem-erdem
Created: Nov 11, 2025
Updated: Nov 11, 2025
Last synced: Apr 16, 2026

Tr Law Orige5 Hdbscan 45K: Turkish Legal Text Clustering Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info