2026-02-20

Wikipedia deprecates Archive.today, starts removing archive links

Wikipedia’s policy change and rationale

Commenters quote Wikipedia’s new guidance: remove archive.today links when originals are still online; replace with other archives (Internet Archive, Ghostarchive, Megalodon); or replace the source with one that doesn’t need archiving.
Some users support the decision, arguing an archival site that alters stored content loses its core purpose and cannot be used for verifiable citations.
Others say archives are critical to Wikipedia’s integrity and claim there is “no credible alternative” for some content, especially paywalled or easily deleted material.

DDoS and manipulation of archives

Multiple users confirm archive.today is running JavaScript on its CAPTCHA page that repeatedly fetches a critic’s blog (with cache‑busting), characterizing this as a low‑rate but deliberate DDoS using visitors’ browsers.
Evidence is cited that archive.today temporarily modified existing snapshots (e.g., global find/replace of a name; changing logged‑in account names in archived social media), with third‑party archives capturing the altered state.
Many argue that this single act of retroactive tampering is enough to disqualify it as a reliable archival source.

Doxxing, retaliation, and “everyone sucks here”

There’s broad disapproval of attempting to unmask the operator, but also strong criticism of archive.today’s response (DDoS and smear threats).
Some say the operator is still a victim of doxxing regardless of their behavior; others see it as hypocritical given archive.today’s refusal to remove personal data for others.
Long subthread debates what “doxxing” means vs. OSINT using only public information, and when it becomes harassment.

Comparisons: archive.today vs Internet Archive and others

Defenders prefer archive.today because it:
- Respects fewer takedown/robots requests, preserving controversial or unwanted content.
- Often archives tricky, JS‑heavy sites and paywalled news more reliably than the Wayback Machine.
Critics note archive.org’s weaknesses (robots-based removals, JS execution that can alter rendered pages) but say these are categorically different from an operator editing stored HTML.
Perma.cc and self‑hosted tools (ArchiveBox, Readeck, Omnom) are discussed; Perma.cc’s cost and institutional focus are seen as a mismatch for “anyone can edit” Wikipedia.
Some advocate Wikimedia building its own archiver; others warn of legal and DMCA risks, especially around paywalls.

Paywalls and technical speculation

A major reason people rely on archive.today is paywall bypass. Alternatives include browser paywall‑bypass extensions and header tricks.
Speculation about archive.today’s methods: residential proxies, anti‑paywall scripts, possibly paid accounts; others doubt large‑scale account management is feasible.

Broader concerns and suspicions

Some see a coordinated campaign by news publishers or governments against archive.today, noting FBI interest and paywall economics; others think archive.today’s own actions “Streisand Effected” the controversy.
Several lament that a unique, very large corpus (especially social media and paywalled news) may now become practically unusable as a citation source, calling the situation a “stupid tragedy.”

Related topics