Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
CrossER is a benchmark dataset for context-dependent cross-system entity resolution where surface features are deliberately misleading. Match pairs average only 0.29 string similarity, while non-match pairs average 0.94 similarity, simulating real enterprise scenarios. The dataset was created by author smurthy5 and was last updated on Hugging Face in June 2026.
License is unknown; terms of use must be verified before application.