If it is worth keeping, save it in Markdown

Data hoarding vs selective saving

  • Some argue compulsively saving web content is wasted effort and cognitive overhead; others say future searchability and option value make local copies invaluable.
  • One camp trusts that “the internet never forgets” and prefers to let others bear storage costs; another reports an accelerating loss of semi‑obscure documents, especially government/organizational material.
  • Older participants note that decades of archives were rarely revisited more than a few years back, suggesting diminishing returns; others counter that AI may make large personal archives newly useful.

Web archiving & the Internet Archive

  • Several people rely on tools like SingleFile, Markdownload, or linkding to save full pages (HTML/webarchive/MHTML) rather than Markdown, especially for complex layouts, paywalled or app‑like sites.
  • There’s appreciation but concern for the fragility and legal risks of the Internet Archive; some want multiple IA‑like projects or partial mirrors, even with degraded media quality.
  • A recurring point: the ability to find and organize archives (full‑text search, tagging, annotations, vector search) matters more than the raw act of saving.

Why Markdown, and how people use it

  • Many keep nearly everything textual in Markdown: notes, blog drafts, research, meeting minutes, even daily journals, often synced via Git, Nextcloud, or cloud drives.
  • Advantages cited: plain text, tool portability, easy scripting (Pandoc, Python, Quarto), and the ability to rebuild new systems over the same files.
  • Some are building personal “knowledge bases” or LLM corpora from scraped Markdown (bookmarks, GitHub stars, HN posts, transcripts).

Limitations, fragmentation, and readers

  • Pain points: images and tables, math, diagrams, underlining, colored text, custom blocks, rich layout, and lack of standardized frontmatter.
  • People debate CommonMark’s design (emphasis syntax, ambiguity, complexity of parsers) and the profusion of flavors (GitHub, Obsidian, MDX).
  • A strong subthread complains about the scarcity of simple Markdown viewers (vs editors with preview) compared to ubiquitous PDF readers.

Alternatives and ecosystems

  • Org‑mode is praised as a more holistic PKM system (tasks, time management, literate programming) but is seen as Emacs‑centric.
  • Others advocate AsciiDoc, reStructuredText, MediaWiki markup, Typst, HTML, or even RTF for richer, WYSIWYG‑friendly, or more semantically precise documents.
  • Consensus: use Markdown for most plain text; switch to richer formats when layout, semantics, or exact styling are essential.