Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A 226.5 MB dataset compiled from biodiversity occurrence records with textual field notes and spatial information. It was created by Mahsa Hadikhah Mozhdehi for the paper "Habitat Inference from Biodiversity Field Notes" and last updated in June 2026. The main text column, `aggregated_text`, contains aggregated habitat descriptions for each spatial grid cell.
License is CC-BY-4.0. The accompanying code is in a linked GitHub repository; to reproduce experiments, download the dataset and update the `DATA_PATH` variable.