Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Harisundar R developed PALL, an open training corpus for a dental-domain Llama-3.1-8B model. It contains three subsets totaling at least 405,959 text entries, designed to cover the full CPT → SFT → DPO post-training pipeline. The dataset was last updated on 2026-06-12.
License is CC-BY-NC-4.0 for the composite corpus, but individual sources may carry additional terms.