Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A text corpus for language modeling in automatic speech recognition systems, created by Jiejie and hosted on Hugging Face. The dataset was last updated in February 2022. Its size is categorized as 1K to 10K entries.
Data format is Parquet, requiring compatible tools like Polars or Pandas for loading.