social-twitter
IA Twitter Stream Grab
A set of Internet Archive bulk dumps of the old Twitter public sample stream, stored as monthly JSON files and covering billions of tweets from roughly 2011 to 2023. You access it by downloading the dataset files, which are listed via the item's metadata API. It is meant for bulk, offline analysis of historical Twitter data rather than looking up an individual tweet; the dataset is frozen, since the source stream no longer exists.
Why it’s useful & how it works
Frozen bulk dataset (source firehose gone). For offline indexing, not per-tweet live lookup. Metadata API works both ways.
What’s inside
Billions of tweets ~2011–2023; frozen.
API access
https://archive.org/metadata/twitterstream ; files via https://archive.org/download/ <item>/<file>
Access
Programmatic API access (a key may be required — see the API tag).