code-archive
Software Heritage
A universal archive of source code — the 'Library of Alexandria of code' — preserving over 20 billion unique source files from more than 350 million projects. You can search for projects, look up files by hash (sha1/sha256/git), and resolve permanent SWHID identifiers, on the website or through its keyless REST API. The place to go when a repository has vanished from its original host, or when you need to identify a file by its hash.
Why it’s useful & how it works
Standout. Keyless JSON both ways; serves BOTH file-by-hash (sha1/sha256/git) AND code/repo lookup AND SWHID resolution. Anonymous rate-limited; token raises limits.
What’s inside
20B+ unique source files; 350M+ projects.
API access
https://archive.softwareheritage.org/api/1/ — /content/sha1:<h>/, /content/sha256:<h>/, /origin/search/<text>/, /resolve/<SWHID>/
Access
Programmatic API access (a key may be required — see the API tag).