web-archive
Webrecorder / Browsertrix
The maker of Browsertrix and the successor toolchain to Conifer: self-hostable or hosted crawling software that produces WACZ/WARC web-archive files. There is no central public corpus to search — every deployment holds its own captures. Relevant when you want to create high-fidelity archives of sites yourself rather than search existing ones.
No programmatic check — opens the archive’s own search.
Why it’s useful & how it works
Tooling/SaaS, not a queryable public archive. Relevant only if we want to RUN captures (e.g. our own Save-Page feature via ReplayWeb.page/WACZ). Exclude from search fan-out.
What’s inside
Per-tenant; no central public corpus.
API access
Per-deployment REST API (token).
An API key is required — usually free; see the endpoints above for where to get one.
Access
Catalogue link only — open the archive to search it yourself.