YouTube as Storage
Project concept & reactions
- Tool encodes arbitrary files into video frames using fountain codes, then stores/retrieves them via YouTube uploads/downloads.
- Many commenters find it clever and nostalgic (compared to cassette/VHS data storage, GmailFS, Flickr-as-storage, qStore, etc.), but most say they would never rely on it for real backups.
Technical feasibility & YouTube compression
- Multiple people ask how data survives YouTube’s re-encoding and lossy compression; some assume “after compression, all data is lost.”
- Others infer that redundancy plus error-tolerant coding (fountain codes, QR-like patterns, heavy parity) can make it work, but at very poor efficiency.
- Several note that it’s likely fragile: future transcoding passes, AI “enhancement,” or changes to codecs/bitrates could silently corrupt data.
YouTube infrastructure, growth, and deletions
- An anecdote from early YouTube infra: the long tail of unwatched videos was “a drop in the bucket” compared to incoming data, so deleting for space wasn’t needed.
- Commenters debate whether this still holds with explosive upload growth (including AI-generated “slop”).
- Some argue storage is still cheap vs revenue; others say Kryder’s Law is ending and one day old, low-value videos will have to be compressed harder or deleted.
- People point out that videos already disappear for copyright/ToS, government requests, uploader deletions, and abandoned accounts; YouTube’s ToS explicitly bans using it as generic storage, so channels can be wiped at any time.
Ethics, “commons,” and exploitation
- One side calls this “burden on the commons” and urges developers to pay for storage instead of abusing free platforms.
- Others reply that YouTube is a profit-driven monopoly, not a true commons, and “siphoning back” value within legal limits is fair.
- There’s tension between YouTube as corporate ad machine vs. YouTube as a massive cultural archive that should be preserved.
Alternatives and practical backup advice
- Suggestions: Backblaze B2 + tools like restic/borg, other cloud storage, or cheap tape libraries (LTO) for large archives.
- Some discuss par2’s limitations at modern scales and error models.
- A few propose other “parasitic” vectors (Reddit text, other video hosts) but most agree serious backups should use paid, purpose-built storage.