Wikipedia deprecates Archive.today, starts removing archive links

Wikipedia’s policy change and rationale

  • Commenters quote Wikipedia’s new guidance: remove archive.today links when originals are still online; replace with other archives (Internet Archive, Ghostarchive, Megalodon); or replace the source with one that doesn’t need archiving.
  • Some users support the decision, arguing an archival site that alters stored content loses its core purpose and cannot be used for verifiable citations.
  • Others say archives are critical to Wikipedia’s integrity and claim there is “no credible alternative” for some content, especially paywalled or easily deleted material.

DDoS and manipulation of archives

  • Multiple users confirm archive.today is running JavaScript on its CAPTCHA page that repeatedly fetches a critic’s blog (with cache‑busting), characterizing this as a low‑rate but deliberate DDoS using visitors’ browsers.
  • Evidence is cited that archive.today temporarily modified existing snapshots (e.g., global find/replace of a name; changing logged‑in account names in archived social media), with third‑party archives capturing the altered state.
  • Many argue that this single act of retroactive tampering is enough to disqualify it as a reliable archival source.

Doxxing, retaliation, and “everyone sucks here”

  • There’s broad disapproval of attempting to unmask the operator, but also strong criticism of archive.today’s response (DDoS and smear threats).
  • Some say the operator is still a victim of doxxing regardless of their behavior; others see it as hypocritical given archive.today’s refusal to remove personal data for others.
  • Long subthread debates what “doxxing” means vs. OSINT using only public information, and when it becomes harassment.

Comparisons: archive.today vs Internet Archive and others

  • Defenders prefer archive.today because it:
    • Respects fewer takedown/robots requests, preserving controversial or unwanted content.
    • Often archives tricky, JS‑heavy sites and paywalled news more reliably than the Wayback Machine.
  • Critics note archive.org’s weaknesses (robots-based removals, JS execution that can alter rendered pages) but say these are categorically different from an operator editing stored HTML.
  • Perma.cc and self‑hosted tools (ArchiveBox, Readeck, Omnom) are discussed; Perma.cc’s cost and institutional focus are seen as a mismatch for “anyone can edit” Wikipedia.
  • Some advocate Wikimedia building its own archiver; others warn of legal and DMCA risks, especially around paywalls.

Paywalls and technical speculation

  • A major reason people rely on archive.today is paywall bypass. Alternatives include browser paywall‑bypass extensions and header tricks.
  • Speculation about archive.today’s methods: residential proxies, anti‑paywall scripts, possibly paid accounts; others doubt large‑scale account management is feasible.

Broader concerns and suspicions

  • Some see a coordinated campaign by news publishers or governments against archive.today, noting FBI interest and paywall economics; others think archive.today’s own actions “Streisand Effected” the controversy.
  • Several lament that a unique, very large corpus (especially social media and paywalled news) may now become practically unusable as a citation source, calling the situation a “stupid tragedy.”