I Stored a Website in a Favicon

Technical approaches to favicon storage

  • Several commenters explore encoding HTML directly in favicon pixels and decoding via JavaScript, similar to the article’s approach.
  • Others propose using SVG favicons and simply embedding markup or HTML inside the SVG, then fetching and injecting it into the page.
  • Suggestions include bespoke compression schemes for HTML tags to pack more content into limited pixels.
  • ICO format’s support for multiple resolutions is noted as a way to store more data.

Alternative carriers and polyglots

  • PNG comment chunks (tEXt, zTXt, iTXt) are mentioned as an easier way to stash arbitrary data in an otherwise normal image.
  • Ideas extend to animated GIFs, SVG embedding raster images, and even putting the whole site in an SVG <foreignObject>.
  • Some cite earlier polyglot tricks where a single file is valid HTML and PNG (and sometimes ZIP/PDF), eliminating the need for a separate loader.

Security, privacy, and tracking

  • Multiple comments highlight favicon caches as a fingerprinting / supercookie vector, especially if shared across normal and private browsing.
  • One idea is to encode unique session IDs or cookies in per-user favicons to evade standard cookie clearing.
  • People ask whether browsers and anti‑fingerprinting measures address favicon + canvas abuses; status of fixes is unclear.

Performance, caching, and browser behavior

  • Browser favicon caching is seen as both a challenge (stale data) and an opportunity (streaming data by rotating favicons).
  • Some discuss tricks with content negotiation so the same URL serves as both HTML and favicon, or symlink-based hacks on static hosting.
  • Past bugs with “unbounded” favicons that crashed browsers are referenced.

Use cases and practicality

  • Many treat this as a fun, impractical hack that makes the web more interesting.
  • Speculative uses include covert storage (passwords, data exfiltration), censorship workarounds, or “new ecosystems” built on hiding data in unexpected places.
  • Others question real-world value, especially when privacy tools already block parts of the setup.

Meta: writing style and AI debate

  • A substantial subthread debates whether the article text is LLM‑generated.
  • Some find the terse, bullet‑heavy style “AI‑like” and off‑putting; others find it clear and time‑respectful.
  • There is disagreement over the reliability of AI‑detection tools and concern that people over-attribute writing to LLMs.