Archivarix · Echo

social-twitter

IA Twitter Stream Grab

A set of Internet Archive bulk dumps of the old Twitter public sample stream, stored as monthly JSON files and covering billions of tweets from roughly 2011 to 2023. You access it by downloading the dataset files, which are listed via the item's metadata API. It is meant for bulk, offline analysis of historical Twitter data rather than looking up an individual tweet; the dataset is frozen, since the source stream no longer exists.

Social API

Why it’s useful & how it works

Frozen bulk dataset (source firehose gone). For offline indexing, not per-tweet live lookup. Metadata API works both ways.

What’s inside

Billions of tweets ~2011–2023; frozen.

API access

https://archive.org/metadata/twitterstream ; files via https://archive.org/download/ <item>/<file>

Access

Programmatic API access (a key may be required — see the API tag).

Homepage

https://archive.org/details/twitterstream