Hacker News, Distilled

AI powered summaries for selected HN discussions.

Page 42 of 778

UK to permanently ban future generations from buying cigarettes

Public health rationale vs personal freedom

  • Many see the ban as a straightforward health measure: cigarettes have no safe level of use, harm bystanders through second-hand smoke, and provide no societal benefit.
  • Others argue governments should not legislate lifestyle choices, even when harmful, and view this as an overreach into private life.

Role of NHS / socialized healthcare

  • A recurring argument: when healthcare costs are collectivized, restricting products that drive heavy costs (like cigarettes) is justified.
  • Critics counter that universal healthcare need not imply broad bans on risky behavior; the NHS is seen by some as a baseline safety net, not a mandate for paternalism.
  • Some libertarian-leaning commenters go further, arguing the core “problem” is collective healthcare itself.

Cost, taxation, and net-effect debates

  • One side: smoking-related illness costs the UK more than tobacco tax revenue; banning sales is “logical.”
  • Another side: heavy taxes could instead be raised until revenue exceeds costs.
  • Some argue smokers may be a net fiscal “saving” by dying earlier and using fewer pensions and late-life care; others call that morally abhorrent and contest the math.

Comparisons to alcohol, sugar, and other drugs

  • Alcohol: widely accepted culturally (e.g., wine in Christianity) and politically much harder to restrict, despite higher estimated NHS costs than smoking.
  • Sugar: some parallels, but defenders note moderate sugar use can be harmless, unlike cigarettes.
  • Illegal drugs: several note bans haven’t eliminated use; one suggests far fewer would use if they were legal, others argue prohibition clearly reduces prevalence.

Effectiveness of bans & black markets

  • Some expect a non-zero but reduced number of smokers; reduction is seen as sufficient.
  • Others predict smuggling and black markets, especially given existing drug trafficking networks, and note being an island doesn’t guarantee control.

Fairness, liberty, and slippery slopes

  • Critics fear a precedent for banning sugar, alcohol, extreme sports, or enforcing mandatory exercise, especially under single-payer logic.
  • Supporters respond that cigarettes are uniquely harmful and lack a safe or beneficial use, making them a reasonable special case.

Enforcement and practicalities

  • Age-cutoff design raises long-term ID-check complexity, but some think checking birth year actually simplifies enforcement.
  • Questions arise about continued nicotine via vaping, which remains legal and is viewed as less harmful.

GPT‑5.5 Bio Bug Bounty

Prize structure & incentives

  • Top prize is $25k for the first “true universal jailbreak” that answers five hidden bio questions without triggering moderation.
  • Many see this as a lottery: only one main payout regardless of how many people or distinct bugs succeed; partial successes may get nothing or only discretionary rewards.
  • Several posters call the reward insultingly low relative to OpenAI’s resources and the claimed existential stakes, comparing it unfavorably to six‑figure security bounties and OpenAI’s own prior $500k Kaggle contest.
  • Others respond that “first past the post” and discretionary partials are standard in bug bounties and contests.

NDA, access control, and secrecy

  • Participation is gated: applicants must already be ChatGPT users, be “vetted” bio red‑teamers, and sign an NDA.
  • Critics argue this turns it into unpaid or underpaid spec work where almost everyone gets nothing and also cannot publish results or even the questions.
  • Some worry the NDA allows OpenAI to reject payouts while still silencing participants. A few say this level of confidentiality is normal; others strongly disagree.
  • There is confusion/criticism around being asked to propose a jailbreak approach before even seeing the five questions.

Perceived goals: safety vs. marketing

  • Many commenters describe the program as a PR or “theatre” move:
    • To signal that models are extremely powerful and potentially dangerous.
    • To reassure regulators that OpenAI is responsibly self‑policing.
    • To contrast with and potentially stigmatize open‑source models.
  • Some think the real aim is to collect jailbreak attempts as training data for future safety systems and marketing claims (“safest model”).
  • A minority see value in a narrowly scoped, concrete biosafety red‑teaming effort.

Biorisk framing and model behavior

  • “Bio‑bugs” are described as ways to get the model to provide actionable guidance on harmful biological activities (e.g., weaponization steps), as opposed to high‑level or benign information.
  • Other AI companies’ CBRN/biorisk filters are mentioned as precedent.
  • Some users report current models already over‑blocking benign biology‑related tasks (e.g., sequence analysis, educational illustrations), calling it frustrating false positives.
  • One commenter notes that, despite flaws, over‑blocking is preferable to dangerous false negatives.

Trust in bug bounties

  • Several recount experiences of companies (including OpenAI) declaring impactful issues out‑of‑scope to avoid paying.
  • This fuels broader skepticism that corporate bug bounty programs are fair or trustworthy.

Show HN: A Karpathy-style LLM wiki your agents maintain (Markdown and Git)

Markdown, Git, and Durability

  • Markdown is praised as open, simple, widely supported, and likely to remain readable long term.
  • Git versioning is seen as a natural fit for tracking evolving agent-written artifacts.
  • Some question whether markdown itself improves LLM performance or is mainly about distribution and tooling.

Obsidian vs Dedicated Agent Wiki

  • Several suggest “just use an Obsidian vault + plugins.”
  • Project authors argue Obsidian is single‑user–centric, lacks promotion workflows and machine-facing APIs (MCP tools), but can still act as a read-only or parallel editor on the same markdown tree.

Retrieval Strategy: BM25, Vectors, Indexing

  • Many approve the “BM25-first” design over defaulting to vector databases.
  • Discussion on routing: text length/shape is a weak signal; using the agent’s task context may be better for choosing between exact-match vs narrative retrieval.
  • Others propose simple indices or TOCs; counterpoint is that cascaded filtering reduces context noise and makes reranking feasible.

Quality of Agent-Generated Wikis

  • Strong skepticism that “teams of agents” mostly produce low-quality “slop.”
  • Others report positive experiences when agents operate over a curated, git-based knowledge base, improving coordination across tools and repos.
  • Cited research suggests fully LLM-maintained docs can degrade quality vs human-maintained; hybrid setups with human curation work better.

Note-Taking Philosophy and Noise

  • Some reject automated note-taking entirely: the value is in humans building their own mental models.
  • Others use agents for structuring, tagging, and refactoring notes, while keeping humans responsible for actual understanding.
  • Concern that AI makes it too easy to generate mountains of text nobody reads.

Governance, Promotion, and Knowledge Decay

  • Multiple comments stress separating “capture” from “promotion”: agents can draft freely, but trusted entries need human review or multi-agent agreement.
  • Worries about confidently wrong entries compounding over time and being re-cited.
  • Questions raised about missing features like temporal vs atemporal memory, snapshots, rollbacks, and explicit handling of business rules.

Deployment, Privacy, and Provider Support

  • Current model is local use with git, without pushing to public hosts; some want easy self-hosted, multi-user setups.
  • OpenAI-compatible endpoints (including local or alternative providers) are reportedly supported via an intermediate runtime.

Ecosystem, Overlap, and Hype

  • Noted that multiple LLM-wiki systems hit the front page in a day; some see duplication and wish for collaboration.
  • Mixed reactions to the product’s playful branding: some find it slick; others see it as satire or fad-chasing, while maintainers emphasize prior serious CRM/context-infra work underpinning it.

New 10 GbE USB adapters are cooler, smaller, cheaper

Adapter capabilities & compatibility

  • New 10GbE USB adapters support a wide range of speeds (10/100/1000/2.5/5/10G), which users value because many embedded / industrial / IoT / legacy devices are still 10/100-only.
  • Some users report real-world throughput below 10 Gbit/s on USB 3.2 Gen 2x1 ports (around 5–7 Gbit/s), especially on Apple laptops that lack USB 3.2 Gen 2x2.
  • Thunderbolt-based 10GbE adapters generally reach near line-rate and are seen as the “safe” choice on Macs and other TB-equipped systems.

Thermals, power use, and PoE

  • 10Gbase‑T over copper is frequently criticized as hot and power‑hungry; some see it as inherently inefficient compared with fiber or DAC.
  • Users compare adapter behavior across laptop generations: same USB NIC runs hot on some models and cool on newer ones, suggesting driver or controller differences.
  • There is interest in 10GbE + PoE++ for laptop power and PoE-powered desktops/servers, but current products are rare, often limited to 2.5G and ~50–65 W.
  • Some see PoE as great for home automation, small devices, and even powering mini PCs; others point out PoE switches and injectors remain relatively expensive.

Copper vs fiber and SFP+

  • Strong debate: some want SFP+/fiber-based USB or TB adapters, arguing fiber/DAC are cooler, more efficient, and cheap on the used market; others prefer RJ45 because every endpoint already has copper.
  • Several report successful 10GbE over Cat5e/Cat6 at short to moderate distances, even “out of spec” cabling, while others emphasize that Cat6a is the proper choice.
  • Some argue 10GbE is already “old” and that 25GbE (often 1×25 or 4×25 for 100G) is becoming the real sweet spot in datacenter gear, though still uncommon in homes.

USB naming and ecosystem confusion

  • There is extensive frustration with USB 3.x / USB4 naming (Gen 1/2, x1/x2), inconsistent labeling on ports/cables, and unclear support for features like DisplayPort, PCIe tunneling, and power levels.
  • Some praise newer “USB 20/40/80 Gbps” marketing but note that older 3.x branding and mixed USB4/Thunderbolt support still confuse buyers.

Switches, NICs, and pricing

  • 2.5GbE gear is now considered cheap and common; 10GbE switches and NICs are still meaningfully more expensive, though used enterprise and low‑end Mikrotik/other brands have pushed prices down.
  • Opinions differ on whether 10GbE is “cheap enough”; some see it as finally practical for homelab backbones, others still consider the ecosystem (switches, cabling, power) too costly for casual use.

Use cases & practical value

  • Common home uses: fast transfers to NAS, multi‑TB research datasets, media / photo workflows, and RAID arrays where HDD mirrors or SSDs can saturate 1GbE.
  • Some argue 2.5GbE or 5GbE is sufficient for most, and that Thunderbolt/DAS is better than networked storage if you only need high speed between a workstation and local disks.

Linux & driver experience

  • Experiences vary: some report flawless operation with mainstream Realtek chipsets (e.g., RTL8156B for 2.5G), others see frequent link drops or speed issues tied to specific adapters and kernel versions.

Firefox Has Integrated Brave's Adblock Engine

Impact on uBlock Origin and Built‑In Blocking

  • Some wonder if Firefox’s new adblock engine benefits users of uBlock Origin or could be exposed to extensions.
  • Observations: current implementation seems to rely on uBO-style lists but lacks cosmetic filtering, leaving empty ad slots.
  • Several expect Mozilla is still integrating it and that it will primarily power Enhanced Tracking Protection rather than replace full adblockers.

Manifest V2/V3 and Extension Future

  • Strong concern that native blocking may be used to justify dropping MV2, limiting advanced adblockers.
  • Others argue:
    • Firefox’s MV3 keeps powerful APIs (e.g., webRequestBlocking) that Chrome removed.
    • Most MV2-only issues relate to Chromium, not Firefox.
  • Mozilla’s statement (via Reddit) says:
    • They are testing Brave’s Rust component to process tracker lists.
    • They have “no plans” to abandon MV2 and want adblockers to “work best” in Firefox.
  • Some distrust “no plans” language, reading it as eventual deprecation.

Browser Choice, Privacy, and Ecosystem

  • Many see uBlock Origin on Firefox as the gold standard; some say it’s the main reason to use Firefox.
  • Others have switched to Brave, praising speed, integrated blocking, and scriptlets, but often disable or distrust its crypto, ads, and promotional toggles.
  • Debate over whether Firefox is “alienating” users vs. unfairly singled out compared to Chromium browsers that add similar features.
  • Some advocate Firefox forks (LibreWolf, Waterfox, IronFox) but note they are dependent on Mozilla’s upstream work.

Alternatives and Network‑Level Blocking

  • Alternative browsers mentioned: Brave, Vivaldi, various Chromium forks on Android (Cromite, Ultimatum, Helium, Elixir), plus experimental engines like Ladybird.
  • Mobile users want Firefox‑level blocking and extensions on iOS/Android; iOS’s WebKit requirement limits parity.
  • Network/MITM proxies are proposed as browser‑independent blockers, though increasingly challenged by same‑domain ads and encrypted content.

Politics and Ethics

  • Some object to Firefox adopting technology from Brave given past controversies around leadership.
  • Others counter that open‑source reuse doesn’t benefit individual leaders directly and can reduce Brave’s competitive edge.

Plain text has been around for decades and it’s here to stay

Plain text, Unicode, and encodings

  • Debate over what “plain text” means:

    • Some equate it with ASCII only; others with Unicode text (usually UTF‑8); others use the cryptographic sense (any unencrypted data).
    • One view: Unicode is too complex and quirky to be “plain”; ASCII is the only truly universal baseline.
    • Counter‑view: today UTF‑8 is the de facto standard and “good enough,” giving back a meaningful notion of plain text.
  • Encoding issues:

    • Past pain with code pages, Shift‑JIS/EUC‑JP, and mojibake is cited as a reason to standardize on UTF‑8.
    • Others still prefer explicit encodings/BOMs and even code‑page‑like systems, arguing files are meaningless without a declared format anyway.

Size, efficiency, and tradeoffs

  • Some criticize UTF‑8 as space‑wasteful vs UTF‑16 or compact single‑byte encodings, especially for Cyrillic, Greek, and CJK.
  • Others present measurements where UTF‑16 only wins modestly for Japanese text and often loses once markup and compression are considered.
  • Consensus from many: small encoding gains rarely justify the interoperability costs of non‑UTF‑8 text.

Limits of plain text and structure

  • Plain text’s strengths: simplicity, tool ubiquity, longevity, easy parsing, and version control.
  • Limits noted: no native images/graphics, messy large configs, ambiguous grapheme clusters, RTL handling, and unreadability of unfamiliar scripts.
  • Some argue XML/JSON/YAML/Markdown/Org are “yes‑and” layers: still printable, but with defined structure.
  • Others point out formats like XML behave more like binary when you factor in encoding declarations and auto‑detection.

TUIs, terminals, and richer UIs

  • Strong enthusiasm for modern TUIs: perceived speed, consistency, low resource use, keyboard‑centric workflows, easy text selection/copying, and longevity.
  • Terminals now support layout primitives and even graphics protocols, blurring the GUI/TUI line.
  • Skeptics question rebuilding rich GUIs over terminals when we have abundant pixels and GPU power, especially for maps and images.
  • Ongoing tension between “constraint breeds good UX” vs “constraints limit what’s reasonable or possible.”

Plain‑text diagrams, plotting, and tools

  • Many tools are shared for ASCII/Unicode diagrams, box‑drawing, terminal plots, and Emacs modes.
  • Interest in hybrid text‑plus‑visual representations and concern about accessibility of ASCII diagrams for screen readers.

Plain text for personal data and accounting

  • Several describe successful use of plain‑text note systems, invoicing, and accounting (e.g., ledger‑style tools).
  • Benefits: no lock‑in, easy scripting, git history, timestamp attestation, and straightforward “escape plans” to other formats like CSV.

Google Flow Music

Pricing & Usage Expectations

  • Confusion over high song quotas (e.g., 600/month) for a consumer tool; some argue you need many generations because only a small fraction are usable.
  • Others see that much iteration as excessive “slot machine” behavior rather than meaningful creation.

Music Quality & Capabilities

  • Many describe the output as generic, “stock-library” or “corporate” music, suffering from a kind of “mean collapse” toward average genre clichés.
  • Repeated complaints about poor prompt adherence:
    • Struggles to do solo instruments, ambient with no beat, or genre-specific nuances (old-time banjo, microtones, dubstep breakdowns, prog metal arrangements).
    • Often adds unwanted instruments or shifts style mid-iteration.
  • Some users report decent, even impressive songs in various genres and languages, but note inconsistency and limited control.

UX, Interface, and Workflow

  • “ChatGPT-style” interface seen as ill-suited for detailed music work; people want section-level or stem-level control.
  • Site aesthetics and reliability criticized (buggy animations, odd layout, client-side errors, non-functional controls).
  • Age verification is disliked by some.

Comparisons to Other AI Music Tools

  • Frequently compared unfavorably to Suno and Udio: worse prompt following, less polish, weaker sound palette, though more features (e.g., music videos, “workspace builder”).
  • Some say Gemini’s built-in song generator is better than Flow Music itself.
  • Perception that Google is late to the party and behind current state of the art.

Use Cases & Motivations

  • Proposed uses: jingles, marketing, YouTube background tracks, algorithm-gaming “slop for revenue,” educational/mnemonic songs, personal emotional-regulation tracks, hobbyist experimentation.
  • Skepticism that AI-generated tracks will matter economically beyond low-value “slop.”

Ethical and Aesthetic Debates

  • Major debate over whether prompting counts as “creating,” with analogies to composers, photographers, cooks, and commissioners.
  • Some claim AI outputs aren’t “real music” or “art” because there is no human expression behind them; others argue beauty resides in the listener, not the source.
  • Concerns about unconsented training data, displacement of human musicians, environmental cost, and “AI slop” degrading culture.

Google Strategy & Trust Concerns

  • Noted as a rebrand of ProducerAI; some see it as just another side project.
  • Widespread distrust that Google will maintain the service, given past music product shutdowns and frequent product reshuffles.
  • Some worry Google is undermining its own music ecosystem; others think it’s a logical move to own production-to-distribution.

Do I belong in tech anymore?

Emotional impact and belonging

  • Many relate to feeling alienated or burned out, especially when AI is mandated or overused.
  • People fear being labeled “difficult” if they push back on AI practices, so they often stay silent.
  • Some say work has become bland: they enjoyed hand-writing and designing code, but “agentic” AI processes feel hollow.
  • A few suggest long breaks or even leaving mainstream tech can restore mental health.

Job market and career anxiety

  • Multiple mid- and senior-level devs report being unable to get interviews despite years of experience and using AI to tailor resumes.
  • Conflicting views on the market: some claim it’s “hot” in major hubs; others say remote roles are scarce and competition intense.
  • Advice offered: move to cheaper regions, avoid AI-written resumes, lean on referrals, or even switch careers—though many feel boxed in by debt, disability, or lack of alternative skills.

AI use in everyday engineering

  • Reports of AI-generated tickets, design docs, and huge PRs merged with little or no human review.
  • Some see AI as a powerful accelerator that helps good engineers deliver more value; others see it amplifying incompetence and knowledge debt.
  • There is frustration with AI note-takers and meeting summarizers that misrepresent discussions or add little value.

Code quality, safety, and “appearance of work”

  • Concern that AI encourages an emphasis on visible output volume over correctness, maintainability, or institutional learning via code review.
  • Some argue businesses already tolerated buggy, insecure systems long before AI; AI just makes the “fast, sloppy” equilibrium more extreme.
  • Counterpoint: good implementation details and thoughtful reviews still matter for long-term maintainability and incident response.

Culture vs. tools

  • Many insist the core problem is organizational culture and incentives, not AI itself: faddish “AI everywhere” mandates, performative work, and VC-driven hype.
  • Others view tech as inherently about automation; if you’re not comfortable with that, you may not enjoy staying in the field.

Possible paths forward

  • Suggestions include focusing on “master craftsman” niches, high-quality human-centric code, or open-source work.
  • Some foresee a future of mass-produced, disposable software plus small pockets of premium, human-crafted systems.

Could a Claude Code routine watch my finances?

Access to banking data (Plaid & alternatives)

  • Multiple comments discuss how individuals can or cannot use Plaid. Historically, sales channels required you to be a business with contracts and security reviews.
  • A new “hobbyist” / free-trial onboarding flow is mentioned as recently launched, aimed at non-business users.
  • Some report still having trouble getting such access in the recent past; timing and availability are somewhat unclear.
  • Alternatives mentioned: Yodlee, SimpleFIN (via SimpleFIN Bridge), Tiller (Plaid under the hood), Lunch Money, Actual Budget, Redbark (Australia CDR), Era Finance’s MCP, and direct bank APIs like Monzo’s.

Using LLMs and agents for personal finance

  • Several users describe building their own pipelines: e.g., bank → Tiller/CSV → Google Sheets → Supabase/SQLite → MCP → LLM for analysis.
  • LLMs are used for transaction categorization, subscription detection, cashflow projections (including Monte Carlo simulations), and scenario questions like “what mortgage can I afford?”.
  • Some see strong potential for LLM-based financial advisors, always-on agents, and email/chat-style “ambient” advisors that monitor and notify instead of gamified dashboards.
  • Others note companies have already tried and failed economically in this space.

Security, privacy, and trust concerns

  • Deep unease about giving Plaid bank credentials and about data sharing/sale risks.
  • Debate over whether Plaid still stores credentials vs using OAuth; consensus: big banks mostly use OAuth/tokenized APIs, smaller ones often still involve scraping and stored credentials.
  • Some argue the correct stance is to avoid such automation until proper open banking APIs exist.
  • The showcased product emphasizes read-only Plaid scopes, no money-movement tools, segregated encrypted tokens, strict IAM, and narrow MCP tools.
  • There are warnings about routine/agent modes: all tools (including write-capable ones in general) may be callable, so prompt-injection and “rogue agent” risks must be considered.

Perceived value vs traditional tools

  • Enthusiasts praise the flexibility of getting data out of closed banking apps and into systems they control, and the ease of wiring multiple data sources together.
  • Skeptics say spreadsheets + deterministic tools (Tiller, Actual Budget, plain-text accounting) are enough and more trustworthy, especially given LLM hallucinations with financial data.
  • Some worry about obsessive tracking and daily net-worth checks; prefer infrequent, calm financial reviews.
  • A subset wants self-hosted, open-source, or strictly local-LLM solutions due to strong privacy norms.

The Classic American Diner

Why the article resonated

  • Some question why the piece was front-page; others say it matched HN guidelines and satisfied their curiosity.
  • Many express affection for diners as cultural anchors and places of routine, especially for breakfast and late-night study or work.

Personal experiences & nostalgia

  • Numerous stories from across the US: South Bay (CA), Portland (ME), Austin (TX), Spokane (WA), Venice Beach, New Mexico, Massachusetts, San Francisco/LA.
  • Emphasis on:
    • Familiar staff who know regulars’ orders.
    • Cheap, filling breakfasts and endless coffee.
    • Diners as “movie-like” experiences for foreign visitors.
  • Some lament closures or fires that destroyed beloved local spots.

Global spread of “American diners”

  • 50s/60s-style American diners are common in Europe (Finland, Serbia, Austria, Germany, France, UK, Ireland).
  • Menus often feature burgers, shakes, pancakes, hot dogs; sometimes US-style breakfast items and hash browns.
  • Debate over authenticity:
    • Some find European versions quite similar.
    • Others say they lack bottomless coffee, 24/7 hours, or the “squeeze-in railcar” feel.

What counts as a diner

  • Disagreement over the term:
    • Loose definition: serves burgers, eggs, and coffee.
    • Stricter view: 24/7, breakfast all day, dessert case, railcar-like layout.
    • Some argue chains like Waffle House are closer to “true” diners than Denny’s/IHOP; others stretch the idea to places like Cheesecake Factory.

Economics, prices & inflation

  • Users compare historical menu prices to modern costs using CPI calculators.
  • Many feel inflation-adjusted figures underestimate real diner prices today.
  • Explanations discussed:
    • Rising labor costs and regulations.
    • Portion-size changes (“portion distortion” vs “shrinkflation”).
    • Complexities of CPI baskets and regional restaurant-price indices.
  • Broad concern that classic cheap, simple diner meals are disappearing; COVID-era price jumps are noted.

Layout, social dynamics & operations

  • Booths and fixed two-seaters seen as both cozy and limiting for large groups.
  • Operational reasons given against rearrangeable tables: turnover, staffing zones, safety (furniture as weapons), and crowd behavior.
  • Diners near courts are cited as models of efficient, attentive service versus slower, trendier neighbors.

Regional & historical notes

  • New Jersey’s dense diner culture is highlighted, plus anecdotes linking NJ diners to tech history (e.g., UTF-8).
  • Worcester, MA’s diner-manufacturing legacy is mentioned.
  • References to prefab diner “kits” and railcar dining (including chain restaurants) extend the theme of modular, movable diner architecture.

OpenAI releases GPT-5.5 and GPT-5.5 Pro in the API

Release timing & rollout

  • Some speculate the accelerated release was a response to DeepSeek; others think it was just final flag checks or that DeepSeek v4 is underwhelming.
  • Confusion over “safeguards and security requirements” mentioned the day before and how those could be resolved so quickly.
  • Rollout lagged for some enterprise and third‑party tools; a few users still saw only 5.4 initially.

Use cases & perceived value

  • Pro/expensive models are used for high‑value, infrequent tasks where cost is negligible compared with outcome (e.g., legal docs, ToS/PP drafting).
  • Some feel the marginal quality gain justifies the price; others don’t see meaningful improvements over cheaper models.

Safety, safeguards & liability

  • Strong disagreement over safety filters.
    • One side: filters are “counter‑productive,” harm access to medical and practical knowledge, and mainly shift liability away from providers.
    • Other side: hallucinations and mistranslations in contexts like medicine create serious risk; providers want to avoid PR/legal fallout.
  • Debate over real‑world alternatives for translation/diagnosis (professional interpreters vs AI vs “no help at all”).

Knowledge cutoff confusion

  • API docs list Dec 2025, but the model reports June 2024 in its own system prompt.
  • Several note model‑reported cutoffs have always been unreliable; practical testing suggests knowledge into early 2025.
  • Hypotheses: training data contamination, intentional older cutoff in prompts to encourage tool use; overall “unclear.”

Model quality & behavior

  • Mixed coding anecdotes: some see 5.5 as “shockingly good” and solving hard problems quickly; others see laziness (omitting obvious code) or no real gains over recent generations.
  • Long‑running automated coding workflows (hundreds of millions of tokens) reported as feasible and high quality by some; others are skeptical and expect “AI slop.”

Benchmarks & comparisons

  • Some benchmarks show 5.5 near or above top models (e.g., perfect SQL benchmark score, strong coding‑reasoning results).
  • Other user‑made benchmarks (e.g., WordPress plugin task) rank it poorly on both quality and value, with surprising underperformance versus some competitors. Methodology is debated.

Pricing, ecosystem & ethics

  • 5.5 (and especially 5.5 Pro) is significantly more expensive than 5.4 and Opus 4.7; concern that “subsidized AI” is ending and providers are clawing back margin.
  • Complaints about GitHub Copilot tiers and high multipliers; some predict migration to cheaper Chinese providers.
  • Ethical worries about financially supporting OpenAI, including references to alleged government surveillance contracts and concerns about astroturfing in online discussions.
  • Some report strict refusals on topics like benign SARS‑CoV‑2 analysis as evidence of over‑cautious safety policies.

There Will Be a Scientific Theory of Deep Learning

Overall reactions to the paper

  • Many find the survey engaging, comprehensive, and particularly value the open-problems section.
  • Others see the title as overconfident “flag planting,” but still consider the work useful as a synthesis and standardization of ideas.
  • Several note a gap between active theory research and public perception that “it’s all just a black box.”

History: why deep learning took off when it did

  • Key inflection points cited: AlexNet (2012) for vision, then attention and transformers for language.
  • Crucial enabling factors: GPUs, much larger curated datasets (e.g., ImageNet), and better software frameworks that made complex models practical.
  • Some argue transformers “could have existed earlier,” but most replies stress results at small scale would have been underwhelming or infeasible to train.

Architecture vs scale and data

  • Strong debate:
    • One camp emphasizes “bagillions of parameters” and data as the main driver (the “bitter lesson”).
    • Another stresses architectural inductive biases and optimizer interactions; not all scalable architectures work, and many design choices are the difference between success and failure.
  • Neural nets are compared to “learned kernels” with powerful compositionality; nonparametric and kernel methods hit limits at modern data scales.

Relation to brains, evolution, and biology

  • Some argue deep learning is quite unlike brains; others see evolution as an end‑to‑end optimization process that pretrains brain structure.
  • There is discussion of local vs global learning rules (predictive coding vs backprop), and whether the brain approximates gradient descent.

Can there be a real “theory of deep learning”?

  • Optimists draw analogies to statistical mechanics, information geometry, and information‑theoretic views (implicit regularization, compression, scaling laws).
  • Skeptics doubt we’ll get physics‑like theories because behavior depends massively on messy data and huge models; they question whether concentration‑of‑measure style simplifications apply.
  • Some raise computability concerns (Rice’s theorem, Turing completeness), but others argue typical feed‑forward nets are not Turing complete, so classical impossibility results may not apply.

Interpretability, hallucination, and failure prediction

  • Several emphasize that theory matters most for predicting failure modes, confidence, and hallucination.
  • OOD detection is seen as conceptually shaky; alternative approaches based on model misspecification are being explored but are currently expensive and niche.

Models vs other ML approaches

  • Neural nets dominate unstructured data (images, audio, text), where their inductive biases match data structure.
  • Tree-based methods (often with boosting) remain superior on tabular data due to more suitable inductive biases there.

SDL Now Supports DOS

Why SDL for DOS?

  • Many see it as fun and “because we can,” fitting the spirit of hobbyist hacking rather than “serious” use.
  • Others highlight that DOS (and FreeDOS) is still used in industrial control and legacy systems where “if it ain’t broke, don’t fix it” dominates.
  • Some argue older systems are conceptually simpler and more fully understandable by one person, which is attractive for learning and tinkering.
  • A project goal mentioned: getting modern engines (e.g., Diablo via DevilutionX, OHRRPGCE) running “on anything,” including DOS.

Technical/Platform Details

  • Implementation uses DJGPP and DPMI, so it’s 32‑bit protected mode, not “old-school” segmented real-mode DOS.
  • Requirements claimed: i386+ with VGA and 4MB RAM, comparable to Doom-era hardware; a K6-2 test machine reportedly runs Quake via SDL at ~45 fps in 640×480.
  • Input support includes gameport joysticks via BIOS with auto-calibration, contrasted with painful manual joystick calibration in classic DOS games.
  • Discussion notes that DOS as a target combines with browser DOSBox for easy, portable deployment of mid‑90s‑era games.

Bare-Metal / Pre‑OS Game Ideas

  • Multiple comments imagine SDL-style games running in pre‑OS environments: BIOS, UEFI, “SDL for bare metal.”
  • UEFI is compared (contentiously) to a “modern DOS”: simple shell, crude drivers, program loader model.
  • Limitations raised: lack of standardized sound in UEFI, no vsync indication in graphics protocol, messy Bluetooth/USB audio; several argue it’s easier to just boot a minimal Linux and run the game as PID 1.
  • Historical parallels: Amiga and PC “booter” games that boot directly into a game without a general-purpose OS.

Ecosystem, Maintenance, and Obscure Targets

  • Some are surprised upstream accepted DOS support, expecting maintenance-cost objections.
  • Others note that obscure ports often survive thanks to one dedicated maintainer and that SDL already spans many platforms.

FreeDOS and Policy Oddities

  • In some countries, laws requiring a bundled OS lead vendors to ship FreeDOS as the cheapest option.
  • Vendors sometimes layer FreeDOS inside Linux/QEMU when hardware isn’t DOS‑compatible.

Google plans to invest up to $40B in Anthropic

Deal Structure and “Circular” Financing

  • Many see Google’s up-to-$40B Anthropic investment as largely vendor financing: cash that will return to Google via TPU and cloud spend.
  • Some frame it as “circular” or accounting gimmickry that props up valuations; others call it standard large-scale vendor finance with strategic upside.
  • Several note past dot-com–era vendor financing blowups as a warning, though others argue $40B is only a fraction of Google’s quarterly profits and thus manageable risk.

Strategic Motives: Hedge, Hardware, and Competition

  • Common view: Google is buying a major compute customer and hedging against losing the frontier-model race, rather than purely backing a direct Gemini competitor.
  • Owning more of Anthropic gives Google influence over a key TPU customer, strengthens its non‑Nvidia hardware story, and hedges against OpenAI/Microsoft.
  • Some argue this diversifies Google’s AI exposure; others worry cross‑investment among giants is anti‑competitive but note regulators haven’t acted meaningfully.

Valuation, Bubble Risk, and Market Dynamics

  • Debate over Anthropic’s ~$350B round valuation vs reported $1T+ secondary pricing; consensus that large strategic investors get better terms than small secondary buyers.
  • Many see AI as a major bubble fueled by circular deals, excess capital, and FOMO, comparing it to dot‑com and pre‑2008 credit.
  • Others justify valuations by pointing to reported run‑rate growth and expectations of AI capturing a share of global wage and software spend.

Claude, Gemini, OpenAI, and Product Quality

  • Numerous comments say internal teams at big tech firms prefer Claude over Gemini for coding; some report heavy Claude usage inside those orgs.
  • Users describe recent Anthropic capacity constraints and perceived quality drops, blaming overloaded infrastructure and cheap consumer plans; some say quality has recovered.
  • There’s disagreement on how real reported revenue and usage figures are, with some directly calling them inflated or marketing-driven.

AI Productivity Gains vs. Slop and Risk

  • Many developers report 2–10x productivity boosts using Claude/Codex/Copilot, especially for internal tools, debugging, and glue code, and consider $100–200/month subscriptions cheap.
  • Others counter that this yields a flood of fragile “vibe‑coded” internal tools, more complexity, and unclear impact on actual business outcomes.
  • Broader concerns include rising layoffs, enshittification of products, concentration of power in hyperscalers, and potential systemic risk if AI economics disappoint.

I cancelled Claude: Token issues, declining quality, and poor support

Token limits, pricing & usage patterns

  • Many Pro ($20) users report Claude Code burning through 5‑hour session limits in minutes, sometimes on a single prompt, especially when agents spin up multiple background tasks or read large repos.
  • Some Max users (5x/10x/20x) say they rarely hit limits even with heavy daily use; others on the same tiers find metering opaque and inconsistent.
  • People complain about silent behavior changes (effort defaults, cache TTLs) that change token usage without notice.
  • Several feel Pro’s Claude Code access is “technically there but unusable,” pushing them toward pricier Max or other vendors.

Perceived quality regression & model behavior

  • Many say Opus 4.5–early 4.6 was a “peak”; later 4.6/4.7 feel more forgetful, lazier, more prone to shortcuts, over‑editing, and subtle bugs.
  • Anthropic’s own postmortems (thinking effort default change, cache bug, verbosity change) are cited as partial explanations; some think broader degradation and frequent silent tuning continue.
  • Others report no noticeable decline and find current Opus 4.7 on xhigh/max effort excellent, especially in Claude Code.
  • Some suspect “adaptive reasoning” or routing to cheaper back‑end models; others call this paranoia or misinterpretation.

Workflow differences: copilot vs autopilot

  • A major split:
    • Copilot users give small, well‑scoped tasks, prune context, and review everything; they rarely hit limits and are happy with quality.
    • Autopilot/“vibe coding” users let agents roam large codebases for hours; they see token explosions, tangents, duplicative code, fragile fixes, and lose trust.
  • Several argue that reviewing AI‑generated code is harder than writing it, making net productivity negative for serious systems; others say agentic workflows produce months of work in days if you manage them carefully.

Alternatives & local models

  • Codex (with recent GPT‑5.4/5.5) is frequently cited as a strong or superior coding alternative; some shift most work there.
  • Kimi 2.6, DeepSeek v4, GLM, Qwen3.6, Gemini, and others are mentioned as cheaper or “good enough,” often accessed via tools like OpenRouter, Opencode, Cursor, Pi.dev, Swival, and Crush.
  • Many are experimenting with local models (Qwen, Gemma, etc.) via LM Studio, OMLX, vLLM, llama.cpp; capability is improving but hardware and setup remain non‑trivial.

Business model, lock‑in & enshittification fears

  • Strong concern that all major LLM vendors are subsidizing now and will later jack prices, cut limits, or degrade quality once users are dependent (“vcware,” “enshittification,” “crack dealer” analogies).
  • Debate over closed vs open models: some insist open weights lag far behind SOTA; others say the gap is <10–20% and shrinking, with cost and sovereignty advantages.

Support & reliability

  • Multiple reports of poor or non‑existent human support: no refunds for failed generations or token mischarges, slow/absent responses, AI‑only frontline.
  • Outages, latency spikes, and opaque failures (e.g., long jobs ending in token‑limit errors) erode trust, especially for people building workflows or businesses around Claude Code.

Refuse to let your doctor record you

Perceived Benefits of Recording & AI Scribes

  • Many see “ambient” scribes as a genuine efficiency gain: less time charting, more visit capacity, and potentially shorter wait times for appointments.
  • Several reports from clinical settings claim: higher patient satisfaction (patients feel more “seen”), higher provider satisfaction, more standardized and detailed notes, and fewer insurance claim denials.
  • Some patients like recordings or detailed notes because they forget instructions, want precise technical language, or want better continuity of care across providers.
  • Others note this is not fundamentally different from long‑standing practices: human scribes, dictation with transcriptionists, and heavy electronic charting.

Privacy, Security, and Data Use Concerns

  • Strong worry about adding yet another third‑party system holding sensitive data; more vendors = larger attack surface and eventual breaches.
  • Disagreement over how protective HIPAA and similar regulations really are: some see them as strict and enforced; others as vague, toothless, or easily bypassed via SaaS middlemen.
  • Cloud-based tools using big providers’ AI raise distrust; several argue for strictly on‑prem or local-only solutions.

Accuracy, Errors, and Clinical Risk

  • Multiple anecdotes of serious misdocumentation: jokes about “coke” becoming cocaine use; side effects misrecorded as allergies; family comments turning into “alcoholism runs in the family.”
  • Studies cited in-thread show nontrivial AI omission and hallucination rates; concern these can be clinically significant and hard to audit if audio/transcripts are not retained.
  • Others counter that manual charting is also error-prone; core issue is provider review and accountability, not the tool itself.

Consent, Power, and Patient Autonomy

  • Some argue refusing recording will, over time, get patients labeled “difficult” and may degrade access or quality.
  • Fear that “optional” will drift into de facto mandatory, with social or financial penalties for opting out.
  • Frustration that providers demand patients accept lengthy institutional terms, but push back when patients assert their own limits (e.g., refusing recording or wanting their own copy).

Systemic Healthcare Issues Behind the Debate

  • Thread frequently shifts to structural problems: doctor shortages, training bottlenecks, insurance incentives, billing-driven documentation, and US cost structure.
  • Skeptics argue AI efficiency gains will be captured as more throughput and profit, not as more time per patient or lower costs.
  • Some see detailed records as a long‑term liability: data can be used against patients by insurers, licensing bodies, or courts, especially for mental health or substance‑related issues.

I'm done making desktop applications (2009)

Age and Context of the Post

  • Many point out it’s from 2009; tooling, distribution channels, and user norms are very different now.
  • Some argue its conclusions were reasonable then (thick clients, Swing, Flash/Java era), less so in 2026.
  • Others jokingly propose the inverse title for today: they’re “done with web apps” and rediscovering desktop / on-prem.

Desktop vs Web vs Mobile: Monetization and “Metrics”

  • Original thesis: web apps monetize and convert better than desktop apps.
  • Several commenters dispute how big the desktop “funnel” penalty really is; with good installers and flows it can be close.
  • Others say mobile now has the strongest B2C monetization, largely because app stores have payment info on file.
  • Counterpoint: mobile spend per user is often small; ad-supported models require huge scale.

Open Source, Hobby Projects, and Motivation

  • Many note the article assumes “goal = make money.”
  • For open source or hobby work, concerns like conversion, piracy, Adwords, analytics, and A/B testing often don’t matter.
  • Some push back: even non-commercial projects that want impact still need to care about onboarding, usability, and metrics.

Browsers, Electron, and “Universal App Engine”

  • Strong criticism of Electron as “worst of both worlds”; others note employers/governments still pay for Electron-based tools.
  • Debate over whether the browser effectively is the universal app engine versus a bloated, inefficient, historically-accidental one.
  • Some emphasize webapps’ huge distribution advantage; others stress native apps’ efficiency, offline use, and deeper system access.
  • Several lament GUI toolkit pain (GTK, early SwiftUI) and concede Electron/web often “just work” despite their flaws.

UX, Stability, and User Control

  • Desktop proponents value stable interfaces, predictable upgrades, offline capability, and local data/control.
  • Web critics complain about constantly changing UIs, onboarding popups, and dependence on remote servers.
  • Some highlight that self-hosted webapps on a LAN blur the “desktop vs web” distinction.

Infrastructure and Deployment

  • Web/SaaS is praised for centralized updates and simplified support.
  • Others note the cost: servers, DNS, DDoS risk, cloud complexity, surveillance potential.
  • A recurring theme: for many internal tools, simple desktop apps or on-prem webapps would have sufficed.

Side Tangents: Startups, Piracy, and Ethics

  • Discussion touches on indie SaaS eras, low-hanging-fruit niches being mostly gone, and the danger of building complex products without validating demand.
  • Debate over software piracy, search engines surfacing cracks, and whether “don’t be evil” ever genuinely guided large tech firms.

Norway set to become latest country to ban social media for under 16s

Evidence of Harm vs. Moral Panic

  • Some commenters say there are “plenty of studies” tying social media and smartphones to youth mental‑health problems and support bans as precautionary, likening them to cigarette restrictions.
  • Others point to research shared previously on HN that found no clear negative correlations, arguing the debate has shifted into a moral panic where the harmfulness is assumed rather than demonstrated.
  • Several note that harms are not uniquely about children; adults are also heavily affected.

Protection of Children vs. Surveillance / Control

  • Supporters emphasize giving kids a chance to “be kids,” reducing bullying, body‑image issues, grooming, doomscrolling and attention problems during formative years.
  • Critics see the child‑protection framing as a pretext for mandatory ID, OS‑ or hardware‑level age attestation, erosion of anonymity, and broader control of online discourse.
  • Some argue this also serves US soft power and corporate interests, by entrenching incumbents and enabling more identity‑linked tracking.

Enforcement, Circumvention, and Network Effects

  • Many expect kids to bypass bans via VPNs, fake birthdays, or borrowed devices, but others respond that imperfect deterrents (like alcohol or tobacco age limits) still reduce harm.
  • Bans may weaken network effects: if most peers are not on mainstream platforms, it’s easier for individual kids and parents to say no.
  • Concerns: bans may simply push teens to less visible channels (Discord, 4chan, underground “internets”).

Parenting, Collective Action, and Inequality

  • One camp frames this as primarily a parenting/education issue: use parental controls, set norms, teach critical thinking.
  • Another argues individual parents can’t realistically resist when “everyone else’s kid” is on social media; legal limits help solve a collective‑action and peer‑pressure problem.
  • Working‑class or time‑poor parents are seen as particularly constrained.

What’s Actually Harmful? Definition & Scope

  • Debate over what counts as “social media”: algorithmic, ad‑driven infinite feeds vs. chronological forums, chat, or text‑only sites like HN.
  • Many see the real problem as engagement‑optimized “addiction feeds” and opaque recommendation algorithms, not social interaction per se.

Age Verification and Alternatives

  • Strong worries about third‑party age‑verification vendors, data breaches, and “normalized credential harvesting.”
  • Alternatives floated:
    • OS‑level parental controls that only pass coarse age brackets.
    • Cryptographic proofs of age without identity.
    • Regulating algorithms (no personalized feeds, ads, or engagement‑based ranking), banning ad‑funded models, or mandating safer defaults for all ages.

Sabotaging projects by overthinking, scope creep, and structural diffing

Article format and focus

  • Several commenters note the piece reads more like a personal newsletter than a focused blog post, which explains its multiple topics and meandering structure.
  • Some think it should have been split into two: one on overthinking/scope creep, another on structural diffing.
  • Many comments focus only on over-engineering, even though a large part of the piece was about diff tools.

Scope creep, overthinking, and finishing projects

  • Common pattern: start with a simple goal → do lots of research → keep expanding scope → never ship.
  • Counter-strategy: lock scope early, ignore most “better ideas”, ship a minimal v1, then iterate.
  • Others warn against swinging too far: thoughtful upfront modeling and domain knowledge can prevent costly redesigns.
  • Scope creep can sometimes reveal the “real” problem being solved, so not all expansion is bad.
  • Deadlines (e.g., game jams) are seen as very effective constraints against endless expansion.
  • Advice for side projects: clarify “why” (learning, personal itch, business) and match the level of research/engineering to that goal.

PhD and academic research parallels

  • Many PhD experiences mirror the article: broad literature review leads to scope bloat, loss of excitement, and a brutal final 20–30% push.
  • One line of advice: treat finishing and narrowing scope as the core skill; keep the work small, replicable, and just novel enough.
  • Others argue that if narrowing makes the work uninteresting to the field, that is a serious problem.
  • Disagreement on literature strategy: some advocate exhaustive early review to avoid duplicating work; others suggest reading only a few key papers first, then deep review later to protect novelty.
  • There is concern about misaligned incentives: pressure for novelty discourages replications, contributes to the replication crisis, and makes “accidental duplication” career-threatening.

Perfectionism, “better,” and LLMs

  • Multiple comments frame overthinking as perfectionism: refusing to accept “good enough” leads to paralysis rather than excellence.
  • Incremental “better than before” is recommended over chasing an ideal design.
  • LLMs are seen as amplifying scope creep: faster coding encourages adding features and complexity, so gains in speed get canceled by expanding ambitions.

Structural diffing and tools

  • Some discussion notes that structural diffing was a major topic in the article but under-discussed in comments.
  • Specific tools mentioned include RefactoringMiner (for code-aware diffs) and Difftastic (syntax-aware, even for plaintext treated as a language).

The Rich and Powerful Want to Live Forever. What If They Could?

Class power, revolution, and immortal elites

  • Many fear that life-extension available only to the rich would harden class divisions and provoke unrest or violent revolution.
  • Some argue historical revolutions simply replace one exploitative elite with another, so revolution is no real solution.
  • Others stress modern gains (safety nets, labor protections, democracy) came from long struggles against entrenched elites, implying immortality would impede further progress.

Aging, death, and political power

  • Several see old age and death as socially beneficial: they force turnover of power and wealth, preventing gerontocracies.
  • Others counter that deciding who has “lived long enough” is dangerous; better tools are term limits, mandatory retirement ages, or cognitive tests for office.
  • There’s debate over whether older leaders bring wisdom or are simply out of touch in a fast-changing, tech-driven world.

Value of mortality vs desire for immortality

  • Some find the idea of living forever inherently bad: it could sap urgency, lead to boredom, and distort human relationships and ecosystems.
  • Others strongly want biological immortality and see death as undesirable; they’d delay it as long as possible, regardless of what happens afterward.
  • Several comments point out that no one actually knows what death feels like; analogies to pre-birth nonexistence are framed as plausible but still conjectural.

Ethics of sacrificing others for longevity

  • Hypothetical scenarios about trading others’ lives for lifespan extensions spark sharp disagreement.
  • A minority suggests many people would accept killing “unsaveable” or hated elites to extend their own lives; others call this straightforwardly evil and reject it.
  • Animal slaughter is raised as evidence that moral concern is highly contingent and culturally shaped.

Inequality, access, and tech critique

  • Some expect longevity tech to start as a privilege of the ultra-wealthy, reinforcing a “techno-feudal” order.
  • Others argue most technologies get cheaper over time and that making longevity widely accessible is the only just path.
  • There is criticism that today’s ultra-rich are less technically competent than past industrial titans, yet would wield immense extended power.

Fiction, systems, and feasibility

  • Multiple science fiction works about practical immortality are cited as thought experiments about stratified, semi-immortal societies.
  • One systems-oriented view holds that “forever” is incompatible with complex living systems without runaway maintenance costs, implying true immortality may be structurally unstable.
  • Some dismiss the whole discussion as premature, noting current science can’t reliably get humans even to 130 years.