2026-02-14

YouTube as Storage

Project concept & reactions

Tool encodes arbitrary files into video frames using fountain codes, then stores/retrieves them via YouTube uploads/downloads.
Many commenters find it clever and nostalgic (compared to cassette/VHS data storage, GmailFS, Flickr-as-storage, qStore, etc.), but most say they would never rely on it for real backups.

Technical feasibility & YouTube compression

Multiple people ask how data survives YouTube’s re-encoding and lossy compression; some assume “after compression, all data is lost.”
Others infer that redundancy plus error-tolerant coding (fountain codes, QR-like patterns, heavy parity) can make it work, but at very poor efficiency.
Several note that it’s likely fragile: future transcoding passes, AI “enhancement,” or changes to codecs/bitrates could silently corrupt data.

YouTube infrastructure, growth, and deletions

An anecdote from early YouTube infra: the long tail of unwatched videos was “a drop in the bucket” compared to incoming data, so deleting for space wasn’t needed.
Commenters debate whether this still holds with explosive upload growth (including AI-generated “slop”).
Some argue storage is still cheap vs revenue; others say Kryder’s Law is ending and one day old, low-value videos will have to be compressed harder or deleted.
People point out that videos already disappear for copyright/ToS, government requests, uploader deletions, and abandoned accounts; YouTube’s ToS explicitly bans using it as generic storage, so channels can be wiped at any time.

Ethics, “commons,” and exploitation

One side calls this “burden on the commons” and urges developers to pay for storage instead of abusing free platforms.
Others reply that YouTube is a profit-driven monopoly, not a true commons, and “siphoning back” value within legal limits is fair.
There’s tension between YouTube as corporate ad machine vs. YouTube as a massive cultural archive that should be preserved.

Alternatives and practical backup advice

Suggestions: Backblaze B2 + tools like restic/borg, other cloud storage, or cheap tape libraries (LTO) for large archives.
Some discuss par2’s limitations at modern scales and error models.
A few propose other “parasitic” vectors (Reddit text, other video hosts) but most agree serious backups should use paid, purpose-built storage.

Related topics