web-archive
Arquivo.pt (Portuguese Web Archive)
Portugal's national web archive, preserving billions of files since 1996, including many non-Portuguese sites. Unusually among web archives it offers full-text search over the preserved pages as well as URL lookup, through both its website and free APIs. Turn to it when you need to search inside archived page text rather than only retrieve a known address, or when researching Portuguese and European web history.
Why it’s useful & how it works
FINDING: unreachable from our Indonesian direct IP (network/geo), but 200 via datacenter proxy — so route through the pool. Excellent documented CDX + text APIs, JSON, no key. Documented rate limit 250 req/180s per IP (not 2026-confirmed). High integration value.
What’s inside
Billions of files / hundreds of TB since 1996.
API access
CDX https://arquivo.pt/wayback/cdx?url=&output=json ; full-text https://arquivo.pt/textsearch?q= ; replay https://arquivo.pt/wayback/ <ts>/<url>
Access
Programmatic API access (a key may be required — see the API tag).