Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Software Heritage is the largest existing public archive of software source code and development history. The dataset is a fully deduplicated Merkle DAG representation linking file content, directories, commits, and repository states from major forges, distributions, and package managers. Author and committer information is anonymized.
License is listed as 'other', requiring review before use.