Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Between 10,000 and 100,000 expert-annotated sentences comprise this dataset for token-level acronym identification in the scientific domain. Created by Amirveyseh for the AAAI-21 Workshop on Scientific Document Understanding, it includes standardized training, validation, and test splits.
Released under the MIT license; requires tools compatible with Parquet files such as Pandas or Polars.