CSVs Are Kinda Bad. DSVs Are Kinda Good
Role and Limits of CSV
- Many see CSV as an “ancient, lowest-common-denominator” text format that persists because it’s simple, human-inspectable, and widely supported.
- Others stress that CSV itself is “kinda great” when used per RFC 4180; the real problem is ad‑hoc dialects and badly tested exporters/importers.
- Several note that non-technical users equate “CSV” with “whatever Excel does,” which drives inconsistent, often broken files.
DSV / ASCII Control-Character Delimiters
- The proposed DSV idea (using ASCII control characters like 30/31 as field/record delimiters) avoids quoting and newline edge cases.
- Supporters like that it’s efficient to parse, solves many CSV escaping issues, and builds on long-existing ASCII concepts.
- Critics say it sacrifices the main CSV benefit (human readability/editability in generic tools) without offering enough in return.
- Others note practical issues: these characters do appear in some real-world “CSV” containing binary/non-plain-text data; you still need an escape strategy.
Standards, Dialects, and Pragmatism
- One camp argues strongly for enforcing RFC 4180 and rejecting broken CSV to reduce chaos and improve interoperability.
- Another camp counters that “broken” is a sliding scale; power asymmetries mean small suppliers must accept whatever large partners send.
- RFC 4180 itself advises being liberal in what you accept; there’s debate over whether that helps standardization or entrenches sloppiness.
Alternatives and Tooling
- For data you control and don’t need to edit by hand, people recommend Parquet, SQLite files, JSON/NDJSON, or other structured/binary formats.
- TSV is popular for command-line work; seen as simpler and safer than CSV, though still fragile if tabs appear in data.
- Some suggest augmenting CSV with a standardized metadata header (delimiter, encoding, locale, etc.).
- Tooling quality matters: libraries like pandas, xsv, zsv, and LibreOffice Calc are praised for handling messy CSV better than Excel or naive parsers.
- Several conclude that any format needs clear escaping rules and that “just pick another delimiter” does not by itself solve the deeper interchange problems.