Twitter shut off API access; users volunteering their own data for an open API
Motivation for a Community Twitter Archive
- Goal: rebuild useful API-like access by letting users donate their own Twitter data exports.
- Seen as a way to escape Twitter’s lockdown, preserve conversation history, and enable new tools (e.g., question-answering over someone’s tweet history).
- Some frame it as helping people migrate off Twitter while keeping their “living wiki” of posts and threads.
Data Storage, Cost, and Infrastructure
- Skepticism about “just put everything in S3” as “cheap.”
- Suggestions to avoid big clouds due to bandwidth costs; proposals for dedicated servers (e.g., unmetered storage boxes) or S3‑compatible object storage / self-hosted systems.
API Lockdowns, Scraping, and the Changing Web
- Widespread frustration with Twitter/Reddit-style API restrictions and high pricing; many foresee a return to heavy web scraping.
- Others argue operators understandably don’t want AI companies and scrapers to extract huge value from their data for free.
- Historical context: early Twitter had RSS, SMS posting/alerts, and a more open API; shutdowns are tied to ad/engagement models.
Privacy, Abuse Risks, and Cognitive Security
- Concern that making identifiable archives queryable enables targeted phishing and other manipulation.
- Calls for explicit consent and clear warnings that data may be permanently mirrored by others.
- Some suggest making datasets private or invite‑only, or smaller community‑scoped collections.
- Worries about data poisoning (fake tweet archives); ideas include requiring multiple independent corroborating uploads and web‑of‑trust mechanisms.
Decentralized Alternatives and Their Trade-offs
- Some advocate moving to Mastodon, Bluesky, or Nostr; others report Mastodon’s culture, admin drama, defederation, and lack of post migration as major drawbacks.
- Debate over whether a federated, server‑oriented design is inherently flawed or simply the only non‑corporate option.
Crowdsourced Scraping via Browser Extensions
- Interest in extensions/userscripts that passively and anonymously upload what users already view (likened to RECAP for PACER or other crowdsourced tools).
- Recognized as TOS‑gray but potentially hard to distinguish from normal browsing; privacy and site‑specific tailoring are unsolved challenges.
Broader Social Media Reflections
- Many describe Twitter as increasingly toxic and engagement‑optimized; a minority say it still works well with careful curation.
- Lock‑ins are attributed to network effects, income from engagement programs, and behavioral inertia despite perceived harm.