Stories - Page 542 | HN Distilled

2025-02-10

SQL pipe syntax available in public preview in BigQuery

Origins of pipe syntax

Several comments trace pipe-style composition back to Unix pipelines in the 1970s.
The specific |> function-pipe operator is linked to ML-family languages (Isabelle/ML, F#, OCaml) from the 1990s, later popularized in F# and then picked up by Elixir.
People distinguish Unix “bytes in, bytes out” pipes from higher‑level, function-chaining pipes in functional languages.

Comparisons to other pipe / query languages

Kusto Query Language (KQL) is repeatedly cited as a strong precedent; some call it “the best query language,” others see it as only an incremental improvement over SQL.
PRQL and Malloy are mentioned as cleaner, pipe-oriented languages that compile to SQL; PRQL’s function system and extensibility are highlighted, as is a DuckDB extension.
LogsQL, ClickHouse’s KQL mode, and LookML/Malloy are given as further examples of pipe-based or transpiled query syntaxes.
Some note that InfluxDB’s Flux (also pipe-based) was dropped due to complexity and low adoption.

Perceived benefits of BigQuery’s pipe SQL

Many are enthusiastic: they find the stepwise, linear workflow more natural for data exploration and cleaning, especially for iterative filtering, aggregation, JSON/array handling, and DBT macros.
Advantages cited:
- Unifying WHERE/HAVING/QUALIFY.
- Reducing nested CTEs and ugly subqueries; easy to comment out or reorder stages.
- Closer to data-frame/tidyverse / KQL mental models and easier integration with host languages.
- Table-valued functions plus pipes allow reusable higher-order transformations.

Concerns and drawbacks

Some worry this further fragments SQL dialects and increases complexity; they’d prefer higher-level languages that transpile to SQL.
Others argue transpilers/ORMs have leaky abstractions and poor optimization compared to engine-native syntax.
A few find pipeline style harder to read than modular CTEs because it increases “state in your head,” especially for long chains; best practice suggestions include ending with an explicit SELECT to clarify outputs.
There is debate over whether this is essentially “SELECT * in disguise” and whether overuse will harm clarity.
Some simply prefer traditional SQL aesthetics and don’t feel a need for pipes.

Standardization and adoption

Several commenters hope this syntax spreads to Postgres, MySQL, SQLite, and note early support in Databricks and experiments in SQLite and DuckDB.
Others argue the real win would be a standardized, engine-aware pipelined query language used across databases, rather than many slightly different SQL extensions.

View on HN ↗ Original Article ↗

2025-02-10

Nvidia Security Team: “What if we just stopped using C?” (2022)

Scope and Reality of “Stopping C” at Nvidia

Several commenters note the title is misleading: Nvidia did not drop C wholesale; SPARK/Ada is used in specific, security‑critical components.
Case study material (linked in thread) indicates SPARK is used mainly for:
- GPU firmware image authentication and integrity checks
- BootROM and secure monitor firmware
- Parts of an isolation kernel for an embedded OS on a RISC‑V root‑of‑trust core
Overall usage is described as “significant in important places” but a tiny fraction of Nvidia’s total codebase; most drivers and CUDA stack remain C/C++.

Marketing vs Engineering Substance

Some see the piece as AdaCore marketing, arguing external consultants can recommend radical rewrites without bearing migration cost.
Others counter that multiple Nvidia security engineers, training investment, and multi‑year adoption suggest real internal buy‑in, not just PR.

Why SPARK/Ada for Firmware

SPARK offers:
- Strong typing, range types, and rich numeric/packet modeling
- Formal proofs of absence of run‑time errors and selected functional properties
This is seen as a good fit for small, stable, security‑critical firmware and isolation kernels, less for large, fast‑moving codebases like drivers or CUDA libraries.

Syntax, Approachability, and Interoperability

Debate over Ada’s “unapproachable” syntax:
- Some argue syntax is a red herring compared to safety/contracts; others claim a curly‑brace dialect would greatly boost adoption.
Interop concerns:
- Ada interoperates well with C, and to a degree C++, but heavy modern C++ (STL, unique_ptr, templates) complicates clean boundaries.
- Serialization/deserialization at language boundaries can add overhead and complexity; this is framed as a general multi‑language problem, not Ada‑specific.

Formal Verification vs Testing and C

Some commenters applaud emphasizing provability over testing for high‑integrity code; others warn verification only proves conformance to a (possibly flawed) spec and doesn’t cover timing side channels, microarchitectural flaws, etc.
Comparisons raised:
- SPARK vs Rust: SPARK can express and prove more properties (e.g., range types, functional proofs), while Rust primarily targets memory safety.
- Verified C paths (CompCert, Frama‑C, proprietary tools) are mentioned; several people who tried both say SPARK’s type system makes proofs easier than for pointer‑heavy C.
Skeptics doubt any language will truly “replace C” for OS/low‑level work; they see this more as adding a verified island inside a C sea.

AI, Proof Assistants, and the Future

Some argue the real future path to pervasive verification is AI‑assisted proof and code generation, not niche languages.
Others are cautious:
- Current LLMs struggle with non‑mainstream domains (Ada/SPARK, advanced math, borrow checkers) and often hallucinate APIs or incorrect arguments.
- AI‑generated code in non‑verified languages may increase subtle concurrency and logic bugs if developers over‑trust completions.
A promising niche proposed: AI as a helper for generating verification annotations and boilerplate in already‑formal ecosystems (SPARK, Coq‑based systems), where results are still checked by deterministic tools.

GPU Security and Threat Model

Some initially question why Nvidia needs strong security: “they just make graphics cards.”
Counterpoints:
- Modern GPUs carry RISC‑V roots of trust akin to T2/ME/PSP, enforce secure boot chains, isolation, and feature segmentation.
- GPUs are central to national‑scale AI infrastructure and used in safety‑critical domains (automotive, drones); compromise of firmware or RoT is high‑impact.
Discussion touches on GPU rootkits and DMA capabilities; concerns about “kludgey” GPU architectures and mailbox/VRAM transfer bottlenecks surface.

SPARK Tooling, Licensing, and Activity

Clarifications:
- SPARK and GNATprove are open source and freely available; commercial “SPARK Pro” mainly adds support, newer releases, and “wavefronts” (details not fully explained in thread).
- The project’s commit history is cited as evidence it’s actively developed, countering claims of being “largely untouched.”
Alire is highlighted as the de‑facto package manager/distribution channel for the free toolchain.

Language Design and Safety Features

Ada/SPARK features praised:
- Range types, derived types, subtype predicates, and bounds‑checked arrays/strings that prevent many classes of errors by construction.
- Ease of modeling bit‑packed registers and network/packet formats compared to many mainstream languages.
Some view these strong types as “creepy” or alien compared to plain integers; fans argue they eliminate whole bug categories and are closer to what safety‑critical domains need.

Alternatives and Broader System Ideas

Suggestions include:
- Using verifiable C subsets and verified compilers instead of switching languages.
- Avoiding opaque binary blobs entirely by distributing intermediate representations and compiling on‑device; critics label this unrealistic due to performance, IP, and ecosystem hurdles.
There is recurring tension between “ideal” verified stacks and messy real‑world constraints (existing C++ ecosystems, proprietary drivers, hardware quirks).

View on HN ↗ Original Article ↗

2025-02-10

Rust kernel policy

Governance and Policy Clarity

Several commenters feel the Rust-for-Linux effort lacks a clear, enforced policy: who decides what, what maintainers must accept, and how cross-language interfaces should work.
Some argue Linux’s distributed governance (subsystem maintainers with a lot of autonomy) makes it hard to impose a coherent Rust strategy without stronger direction from the top.
Others counter that Linus’s approach—allow Rust, but let subsystem maintainers decide—is appropriate given project size and complexity.

Linus’s Role and Maintainer Conflicts

Strong criticism that the current “political mess” is largely due to Linus approving Rust without clearly defining what that means in practice.
One side wants him to either explicitly back Rust (and overrule blocking maintainers) or clearly say “no Rust” to avoid wasting effort.
Others reply that he must trust maintainers, can’t centrally micromanage every subsystem, and that some behind-the-scenes coordination is already happening.

Technical and Toolchain Issues

Concerns about mixing GCC (for C) and LLVM/Clang (for Rust and bindgen) in kernel builds; some kernel developers historically oppose mixed toolchains.
Rust’s use of unstable language features is seen as a risk; defenders note that:
- Kernel has long depended on non-standard GCC extensions.
- Rust and kernel teams coordinate to stabilize what the kernel needs.
Questions around static linking of libgcc and long-term compatibility of Rust-generated code.

Rust vs. C: Safety, Complexity, and Culture

Pro-Rust arguments:
- Memory safety is increasingly mandated (governments, industry); Linux must compete.
- Rust is easier to write correctly than C; C is “simple to learn, hard to use safely.”
- Rust is particularly valuable for drivers and DMA-safe abstractions.
Skeptical views:
- Rust feels complex/“PhD-level” to some; fear existing C maintainers won’t adopt it.
- Cross-language codebases and bindings are called “cancerous” or maintenance nightmares.
- One commenter argues Linux’s unstable APIs, monolithic design, and GCC focus make deep Rust integration fundamentally mismatched; a full Rust OS (e.g., Redox) is seen as cleaner.

Project Identity and Support

Confusion over who “we” is in the Rust-kernel policy; some want clearer attribution and authority.
Clarified in-thread: Rust-for-Linux is led by kernel folks, not the Rust Foundation, but the Rust project treats “R4L on stable Rust” as a flagship goal and actively supports needed language work.
There is debate over naming individuals publicly due to concerns about harassment and misplaced responsibility.

Long-Term Outlook

Some predict old-guard C maintainers may eventually leave as Rust use grows and new generations of developers arrive.
Others think Rust will remain limited to drivers and select subsystems, never supplanting C wholesale.
Overall, commenters see high technical promise but unresolved governance, social, and tooling tensions.

View on HN ↗ Original Article ↗

2025-02-10

The state of Rust trying to catch up with Ada [video]

Why Ada Didn’t “Win” Broad Adoption

Historical timing: universities were teaching Fortran, Simula, Pascal, etc. long before Ada; if Ada was your “first language” that likely means a late-starting CS program.
Unix shipped with C and its own source; C was easy to bootstrap and port, Ada compilers were heavy, slow, and often expensive extras on Unix.
Commercial compilers: high prices, opaque licensing, and hardware demands limited hobbyist and startup use.
Ecosystem bias: Ada positioned as “mission critical / aerospace / defense,” not for web, games, or general apps, so it never built broad mindshare.
Chicken‑and‑egg: mainly “specialists” use it, so tooling and UX for newcomers lagged, which in turn discouraged wider adoption.
ACATS public conformance suite is both a strength and barrier: a new compiler starts with thousands of failing tests and no customers until it passes.

Safety Features, Subtypes, and Pattern/Liquid Types

Ada has long had range types and subtypes (and later dynamic predicates), allowing types like “angles 0..360” or “UUID_String” with embedded format checks.
These constraints can be enforced at runtime and, with SPARK/verification tools, often proven at compile time.
Rust’s “pattern types” / liquid types proposal aims for similar refinement-style types, expressed as patterns over existing types.
Current Rust niches (e.g., NonZeroI8, Option<&T> occupying same size as T) rely on compiler magic; pattern types would let users define such optimized, constrained types themselves (e.g., “BalancedI8” excluding INT_MIN).
Debate over utility: some see runtime-checked subranges as crucial for invariants; others find them historically underused or awkward without good verification tooling.

Rust vs Ada: Domains, Tooling, and Safety-Critical Use

Many see Rust and Ada as overlapping in “C/C++ replacement” and safety‑critical domains; Ada additionally has deep roots in hard real‑time control loops where post‑init allocation is forbidden.
Others argue Ada’s memory management model makes “browser‑like” workloads awkward, while Rust is built for heavy dynamic allocation with graceful failure.
Safety‑critical industries require certified/qualified toolchains; Ada vendors already offer such compilers and proof tools, largely proprietary.
Rust is building similar infrastructure (Ferrocene, safety‑critical consortium, MISRA‑aligned guidelines, model checkers like Loom).
Integer overflow semantics are contrasted: Ada treats overflow as a constraint violation; Rust uses debug‑panic vs release‑wrap plus explicit checked/saturating APIs, which some consider too opt‑in.

Syntax, Ergonomics, Adoption, and Community

Strong disagreement over readability: some find Pascal/Ada “BEGIN/END” clearer than C braces; others say that syntax “hurts their eyes” and prefer C‑style.
Rust deliberately adopted C‑like surface syntax as “weirdness budget” management to ease onboarding; internally it borrows heavily from ML/FP ideas.
Several commenters argue Rust’s success owes more to marketing, large‑company backing, platform roles (like Swift/Kotlin), and ecosystem investment than to purely technical merit.
Rising interest in Rust and “memory‑safe C replacements” has renewed attention to Ada as prior art; some are annoyed by Rust evangelism that appears to rediscover decades‑old ideas without credit.
Rust’s learning curve is widely acknowledged as steep; advice includes treating it initially like a mostly immutable, value‑passing language and adding advanced features later.

View on HN ↗ Original Article ↗

2025-02-10

To buy a Tesla Model 3, only to end up in hell

Battery drain and Sentry mode

Many focus on the reported ~8% battery loss per day while parked. Several owners confirm similar losses with Sentry mode enabled, estimating ~200–300W continuous draw (car never sleeps).
Others report near‑zero vampire drain (1–2% over several days) when Sentry is off, suggesting that behavior is abnormal for a healthy car.
In this case the author insists Sentry is off; some speculate a software or hardware fault (e.g. reboot loop, short, stuck subsystem, broken cameras confusing the system), but it’s ultimately unclear.

Broken cameras, safety systems and design choices

Commenters highlight that with cameras down, the car loses wipers, ADAS, and other safety features, and Tesla support still told the owner to drive it—seen as unacceptable.
Several see this as emblematic of Tesla’s “all‑in on cameras/software” architecture without robust fallbacks (e.g. wipers tied to vision, radar removed, heavy reliance on a single compute stack).

Tesla reliability and service – highly mixed reports

Some owners report years of trouble‑free use, low running costs, and high driving enjoyment; others recount repeated service visits, long parts waits, squeaks, misaligned panels, and inconsistent quality.
Service experiences vary widely by location: some praise responsive mobile service or new centers; others describe unresponsive apps, months‑long waits, few approved body shops, and feeling used as “free QA”.
Statistical views conflict: references to Consumer Reports place Model 3 as relatively reliable and well‑liked, while German and Danish inspection data show high defect/fail rates, especially on brakes, suspension, and lighting.

Buying a Tesla where there is no local presence

Many criticize importing a Tesla into a country without official sales/service, noting the risk of long travel to service centers and complicated cross‑border disputes.
Others point out that within the EU consumer protections are strong and cross‑border, and suggest the real failure is Tesla’s handling, not the buyer’s choice.

Modern car software, UX, and over‑automation

Large subthreads broaden this to “modern cars are terrible”: intrusive warnings, ADAS nags, touchscreens replacing buttons, unreliable infotainment (Volvo Android, VW, Toyota, Mazda, BYD, etc.).
Some argue OTA updates and centralization enable fixes; others say they incentivize shipping half‑baked software and increase failure modes in safety‑critical systems.
Several express preference for older, simpler cars (90s–2000s Honda, Toyota, BMW, Clio, etc.) with physical controls and minimal electronics.

Alternatives and EV landscape

Korean EVs (Hyundai, Kia) are recommended by some as safer bets: good efficiency, “boring” but stable software, wide service networks. Others report serious issues (ICCU failures, bugs, harsh suspension) and mixed UX.
Competitors like Polestar, Renault, and Volvo are cited as having better ride/handling or comfort, but also their own software and reliability problems.

Musk, politics, and ethics

A large tangent debates the ethics of buying Tesla given Musk’s behavior, political influence, and associations; some see purchase as tacit support, others find this moral signaling tiresome.
This is seen by some as separate from the product’s technical merits; others argue it can’t be disentangled, especially when Tesla’s governance and priorities affect quality and safety.

Consumer protection and next steps

Multiple commenters urge the author to stop waiting on Tesla and invoke EU consumer law: formal written notice, ECC‑Net, or a lawyer’s letter to seek refund, replacement, or enforced repair.
The consensus is that, given severe defects from day one plus a months‑long delay, legal escalation is appropriate rather than continuing to rely on Tesla’s voluntary goodwill.

View on HN ↗ Original Article ↗

2025-02-10

Apple's Best Option: Decentralize iCloud

Alleged UK Order & Its Scope

Discussion centers on a reported secret UK Investigatory Powers Act order compelling Apple to provide blanket access to iCloud data, including Advanced Data Protection (ADP), potentially for “any user anywhere in the world.”
It’s noted that such orders must be kept secret, similar to US National Security Letters, so details and Apple’s actual response are inherently unclear.

Legal Conflicts and GDPR

Some argue Apple simply “can’t comply” for EU users because of GDPR and other countries’ privacy laws; others point out GDPR has explicit law-enforcement and third‑country exceptions where “adequate” protections exist.
Several commenters think if Apple did comply with a UK global-access mandate, the EU and other jurisdictions would eventually react, potentially making it impossible to serve all markets with a single global cloud design.

Apple’s Strategic Options

Options discussed:
- Flat refusal and risk fines / being forced out of the UK.
- Comply only for UK users by disabling ADP or adding a UK‑only backdoor.
- Quietly backdoor everything and rely on secrecy.
- Restructure: separate legal entities and infrastructures per region (UK, EU, US, etc.), similar to the China iCloud setup.
- Threaten to leave the UK and turn it into a public “we refuse to spy on you” campaign.
There’s disagreement whether the UK or Apple has more leverage; some think losing Apple would be politically disastrous for the UK, others think Apple won’t abandon a large services market.

Decentralization & Technical Ideas

Many like the blog’s idea of decentralizing iCloud (protocols like IMAP/CalDAV; user‑selectable or self‑hosted endpoints; Time Capsule–style hardware; “Apple Cloud Edge”).
Counterpoint: as long as Apple controls OS updates and signing keys, governments can legally compel it to ship surveillance code; decentralization doesn’t solve the core legal problem.
Others note most users would still choose the “official” iCloud due to bundles and convenience, so a UK‑mandated backdoor would still capture the majority.

Politics, Intelligence, and US Role

Strong political undertone: UK portrayed by some as authoritarian, surveillance‑obsessed, and using “think of the children” narratives; others push back on exaggerated “failed state” rhetoric.
Speculation that Five Eyes partners (especially the US) might quietly favor such an order as a way to access data globally.
A minority argue the opposite: that US political leadership could threaten trade tariffs or sanctions to protect US tech firms, forcing the UK to retreat. This is contested and explicitly framed as conjecture.

Impact on Users & Personal Responses

Users ask what this means concretely: could authorities silently clone full device backups, access photo libraries, passwords, messages? The consensus: if Apple complies, ADP can be weakened or bypassed without notice; verification would be hard.
Some propose abandoning cloud services entirely and reverting to local or self‑hosted backups; others highlight the practical difficulty of replacing iCloud‑style sync and backup for non‑experts.

Apple’s Incentives and Precedents

Multiple comments stress Apple’s enormous and growing services revenue (iCloud storage upsells are ubiquitous) and desire for tight ecosystem control; decentralization is seen as against its financial interests.
Apple’s past stance in the San Bernardino case is cited as evidence it can resist governments, but many doubt it will take the same risk now, especially given its willingness to accommodate China through a local iCloud operator.
One line of argument: complying with a UK backdoor sets a precedent that other governments will demand, potentially creating mutually incompatible legal requirements and making compliance “suicidal” for Apple’s global business.

View on HN ↗ Original Article ↗

2025-02-10

How does Ada's memory safety compare against Rust?

Use cases & adoption

Several commenters note Ada is niche outside defense/transport and some European public-sector work, while Rust has rapidly growing use across kernels, servers, tools, and embedded.
Linux choosing Rust over Ada is attributed to ecosystem size, momentum, and personal preferences of maintainers, not just technical merit.
Game dev: Rust and Odin are seen as more promising; Ada game examples exist but are rare.

Memory-safety models

Consensus: “out of the box” Rust’s safe subset provides stronger, more composable memory-safety guarantees than baseline Ada.
Ada provides bounds-checked arrays/strings and runtime null checks; but manual allocation/deallocation with Unchecked_Deallocation can still cause use-after-free and leaks.
Some argue Ada’s “let the OS reclaim leaks” attitude is inadequate in many domains.

Aliasing, parameter passing, and history

Long subthread on IRONMAN/STEELMAN and Ada’s in/out/in out intent-based parameters vs Rust’s explicit ownership/borrowing.
One side criticizes Rust for requiring mechanism-level detail (by value vs reference); others argue Rust’s ownership/borrowing is semantically important (aliasing, lifetimes, thread safety) and more general than Ada’s parameter modes.
Ada’s aliasing pitfalls with in out parameters and compiler-chosen by-copy/by-reference semantics are demonstrated; SPARK tools can detect some of these issues, GNAT often does not.

SPARK vs Rust

SPARK + Ada can prove absence of many memory errors and data races, sometimes exceeding Rust’s guarantees, but:
- SPARK historically covered a subset of Ada (now much larger).
- Formal proof is seen as expensive and difficult to scale to large, crate-style ecosystems.
Rust’s borrow checker is praised as a scalable, lightweight approximation that works for large codebases without full formal verification.

Tooling, ecosystem, and “human factor”

Rust’s success is widely attributed to:
- Cargo, crates.io, good error messages, strong library ecosystem (e.g., serde, tracing).
- A large, active community and modern ergonomics drawing in developers whose first “systems language” is Rust.
Ada/SPARK tooling and docs are viewed as weaker and more fragmented; licensing and “market segmentation” between free GNAT and paid SPARK are criticized.

Performance, GC, and “fast enough”

Rust is framed as the first mainstream language to combine strong memory safety with near-zero-overhead abstractions, targeting C/C++ niches.
Debate over “fast enough”: some argue many GC’d languages already suffice; others emphasize that zero-overhead and no GC pauses are essential in embedded, OS, and some high-performance domains.
Some worry Rust’s popularity normalizes GC-less programming again; others note Rust can use GC libraries when desired.

Type safety and threading

Ada’s type safety is challenged with examples showing aliasing-related unsoundness in “safe” code; others respond that newer standards and SPARK address much of this.
Rust’s model is praised for statically preventing data races; Ada’s tasks/protected objects provide structured concurrency but don’t globally guarantee race freedom without SPARK or discipline.

General sentiment

Broad agreement that the industry shift toward memory-safe systems languages (Rust, Ada/SPARK, others) is a major positive.
Rust is seen as currently unmatched in the combination of safety, performance, ecosystem, and usability, though SPARK/Ada remain strong where formal verification is mandated.

View on HN ↗ Original Article ↗

2025-02-10

Don't Be Frupid

Value of Conferences

Strong disagreement on big vendor conferences: many see them as overpriced “paid vacations” with talks available free on YouTube; main in‑person benefits are networking, recruiting, and perks, not unique learning.
Some argue conferences function as rewards, tax-advantaged perks, or mini-holidays, not serious training.
Others stress value in:
- Focused, protected learning time away from day‑to‑day work.
- Specialized, industry‑specific user groups (e.g., regulated industries, vendor user conferences) where operational experience and roadmap details matter, sometimes “must have.”
- Local/regional, community-driven events (FOSDEM, USENIX, etc.) as higher-signal and cheaper than big vendor shows.
Concern that many attendees mainly job-hunt or socialize; some prefer 1:1 expert training instead.

Tools, SaaS, and “$15 Saves Hundreds of Hours”

Several find the article’s anecdotes (cheap tools saving hundreds of hours, a conference saving millions) exaggerated compared to the reality of SaaS sprawl and unused licenses.
Descriptions of dark SaaS patterns: seat blocks, forced “enterprise” upgrades for basic security/SSO, hard-to-remove accounts, viral per-seat growth leading to huge recurring bills.
Counterexamples show that small spends (e.g., a design tool, a VPS) can indeed unlock large productivity or cost savings.
Consensus: both “no-questions-asked spending” and blanket penny-pinching are harmful; the difficulty is knowing which tools pay off.

Cloud, Infrastructure, and Databases

Debate over “cutting cloud costs”: some advocate moving to cheaper VPS/colo/on‑prem; others say that’s naive for complex, regulated, or highly scalable systems relying heavily on managed services.
Example of cost-optimization gone right: consolidating many tiny services into one “expensive” instance that was actually dramatically cheaper overall.
Example of cost-cutting gone wrong: under-provisioned CI/build infra causing multi-day turnaround, flaky tests, and major productivity loss.
Disagreement on database consolidation: one side fears underpowered shared DBs; others note multi-DB, microservice-heavy setups often create race conditions, stale data, and more cloud spend.

Developer Hardware and Work Environment

Many support high-end laptops, fast CI, prod-like dev environments, and strong connectivity as obvious ROI for expensive engineers.
Others think “give them the fastest MacBook” is self-serving without quantitative justification; cheap-but-adequate machines plus remote build/SSH can suffice.
Measuring productivity effects (e.g., compile times vs throughput) is viewed as both important and very hard; risk of false precision and overconfidence in shaky models.

Business Travel, Everyday Frupidity, and Incentives

Business travel cited as classic “frupid” territory: rigid policies banning discounted business class, requiring receipts (favoring taxis over public transit), or mandating multi-hop flights to “save” cash while burning staff time and energy.
Story of removing in‑office coffee machines to save one salary, which led to long café queues and huge time losses for highly paid staff.
Other examples: homebrew NAS vs reliable storage, rolling your own internal tools instead of buying, forcing office work instead of WFH, and multi-week onboarding due to IT constraints.
Multiple comments tie frupidity to:
- Optimizing only easily measurable line items (hardware, cloud bills) while ignoring intangible costs (morale, flow, delay).
- Split budgets where the team that “saves” doesn’t bear the resulting productivity cost.
The term “frupid” is linked to internal Amazon culture around “frugality” and compared to concepts like “false economy,” “suboptimization,” and the “Vimes boots” theory.

View on HN ↗ Original Article ↗

2025-02-10

Trump says he has directed Treasury to stop minting new pennies

Scope and Motivation of Ending Penny Production

Many note the idea is old; other countries (Canada, Mexico, New Zealand, parts of the Eurozone, Japan) have already removed low‑value coins and used cash rounding.
Some see this as one of the few sensible policy moves, arguing nothing costs mere cents anymore and pennies mostly accumulate unused.
Others see timing and framing as political theater: a distraction during a major sporting event and part of a “flood the zone” strategy to normalize rule‑bending.

Legality and Constitutional Debate

One camp argues it’s unconstitutional because the Constitution gives Congress power “to coin Money,” and historically Congress has controlled denominations.
Others counter with U.S. Code Title 31 §5111: the Treasury Secretary must mint coins in “amounts the Secretary decides are necessary,” which could include deciding the necessary amount of pennies is zero.
There’s a distinction drawn between:
- Permanently eliminating a denomination (generally seen as needing Congress).
- Temporarily halting minting while leaving existing coins legal tender (argued to be within Treasury’s discretion).
Some worry that even if this particular move is arguably legal, the pattern of unilateral actions serves to acclimate the public to executive overreach.

Economic Effects: Rounding, Prices, and Inflation

Concern: with no 1‑cent coin (and no 2‑cent coin like the euro), the de facto minimum cash unit becomes 5 cents, potentially nudging prices and inflation upward.
Others note:
- Most transactions are digital; rounding usually applies only to cash (“cash rounding”).
- In practice, the total bill is rounded to the nearest 5 cents, often under published rules meant to average out; some retailers even always round down as a goodwill gesture.
- Examples from Canada and Europe suggest minor, mostly negligible impact if properly implemented.
There is debate over whether merchants would quietly reset sticker prices upward to exploit rounding; some think competition and consumer resistance limit that.

Costs, Denominations, and Seigniorage

It costs more than face value to mint both pennies and nickels, with nickels reportedly much worse on a per‑coin basis.
Some argue both pennies and nickels should end, leaving dimes/quarters (or even starting cash at higher values), noting other countries manage with 10c/20c as their smallest coins.
Others suggest instead reformulating the nickel to cheaper metals, but point out vending machines and coin validators would need updating.

Technical and Practical Issues

Point‑of‑sale systems in the U.S. may need minor updates to support rounding for cash; Canadian commenters say their registers haven’t required fundamental changes.
Sales tax complications are raised: states may need explicit rules to allow rounding without under‑ or over‑collecting tax.
Side discussions cover legal limits on melting coins, the role of coins as medium of exchange vs. their metal value, and speculative ideas like higher‑denomination bills or even deflation to “make the penny valuable again.”

View on HN ↗ Original Article ↗

2025-02-10

I Blog with Raw HTML

Debating what “raw HTML” means

Many argue the blog is not “raw HTML” because posts are written in Markdown and rendered via JavaScript in the browser; with JS disabled, users see unrendered Markdown and broken links/images.
Some say “raw HTML” should mean hand-authored HTML served as-is, without build steps, third‑party JS, or client-side transformations.
Others note that even this blog’s HTML is technically invalid (HTML4 doctype with HTML5 elements) and that browsers’ forgiving nature blurs what “HTML” practically means.
There’s a related debate about “static” sites: user-perspective (no code needed to view the page) vs developer-perspective (served as static files, even if heavy client-side JS makes it dynamic).

Critiques of the specific blog

Reliance on JS to turn Markdown into HTML contradicts the “raw HTML” framing and causes visible “markdown flash” on load.
People criticize the lack of minimal styling and reader-mode support, while noting that even “raw HTML” can be pleasant with a tiny amount of CSS.
Some see the “raw HTML” claim as an excuse to skip themes and design, or as a slightly performative “less tech than you” flex.

RSS and tooling

The original concern about missing RSS generates several suggestions:
- Manually maintain an RSS XML file alongside posts.
- Use a simple script or static-site-like tooling (pandoc, custom scripts) to emit both HTML and RSS.
- Use a GitHub Action that scrapes the site and produces a feed, then redirect /feed to it.
A few commenters mention writing RSS by hand as “raw XML,” or building small custom web servers that add RSS and sitemaps on top of raw HTML directories.

Static site generators, longevity, and hassle

Some defend static site generators (Hugo, etc.) as stable enough if you avoid needless upgrades.
Others report abandoned tools (e.g., Node-based generators) becoming painful due to old runtimes and dependencies, which motivates sticking to plain HTML files.
There’s disagreement over whether hand-editing HTML is simpler in the long run or an unnecessary chore compared to SSGs or Markdown-based workflows.

Alternatives, nostalgia, and humor

Alternatives raised: Gemini protocol, plain-text or Markdown-first blogs, org-mode, Emacs macros, SSI includes.
The thread contains a lot of playful one‑upmanship (blogging in machine code, SGML, stone tablets) and meta-critique of the trend of blogging about ultra-minimal stacks as if they were noteworthy feats.

View on HN ↗ Original Article ↗

2025-02-09

Three Observations

Megacorps, States, and Power over AI

Commenters note Altman contrasts “individual empowerment” vs authoritarian states, largely omitting the scenario of AI controlled by a few megacorps.
Several argue the line between state and corporation is already blurry; state–corporate fusion is seen as the realistic danger.
Historical analogies (e.g., chartered companies ruling territory) are used to show how corporate power can become de facto sovereign.

Economic Liberation, Inequality, and Land

Many doubt AGI will “liberate” the masses economically, arguing we already produce enough but distribution and rent extraction (especially land) block broad benefit.
Others counter that global GDP per capita is still too low for universal comfort even with perfect equality; productivity growth is still needed.
Debate centers on whether labor‑saving tech inherently fails workers or whether institutions (wages tied to hours, landlord capture, weak bargaining power) are the real problem.
Some use Georgist arguments: productivity mostly flows into higher land values and rents. Others question whether land prices must rise given sub‑replacement fertility and remote work.

Jobs, Displacement, and UBI

Many expect AI to first augment, then replace, large swathes of knowledge work, potentially faster than new roles appear.
Manual, embodied, and interpersonal work (plumbers, caregivers, hospitality, public safety) is widely seen as harder to automate in the medium term.
UBI is the dominant proposed response, but there’s deep skepticism about funding (multi‑trillion scale), inflation, and political will; extensive argument over sales vs wealth vs asset taxes.
Some foresee heavy AI taxation/regulation; others think elites will mainly use AI to entrench power, not to underwrite mass welfare.

Altman’s Three “Observations” and Hype

Point (1) “intelligence ≈ log(resources)” is heavily contested: critics say we’ve poured huge compute in since GPT‑4 with limited visible gains; supporters cite recent reasoning models and SWE‑bench jumps.
Point (2) “10× cost drop per year” is seen by many as cherry‑picking OpenAI’s own prices; others distinguish falling inference cost from still‑exploding training cost.
Point (3) “super‑exponential socioeconomic value” is widely called unfalsifiable marketing meant to justify “exponentially increasing investment” and sky‑high valuations.

Definitions and Reality of AGI

Altman’s AGI definition (“human‑level across many fields”) is viewed as so vague that one could almost claim we are there already.
Some note that older anchors like the Turing test have effectively been passed but did not yield “true” general intelligence.
Others argue focusing on brain‑scale simulation (C. elegans, synapse counts) misses the point; as with flight vs birds, artificial general problem‑solving need not mirror biology.

Capabilities and Limits of Today’s Models

Practitioners report models feel like overconfident junior devs: extremely useful for boilerplate, CRUD, and exploration, but unreliable for anything “slightly complicated” or niche.
There is disagreement over progress since GPT‑4: some see stagnation, others point to major CoT reasoning and coding gains, especially under tool‑using or “agentic” setups.
Benchmarks (ARC‑AGI, SWE‑bench) are cited both as evidence of rapid progress and as examples of over‑fitting and benchmark gaming.

Access, Empowerment, and Regulation

Optimists highlight open and local models plus rapidly dropping costs (and upcoming prosumer hardware) as evidence that individuals will have strong AI “at their fingertips.”
Pessimists expect lock‑down: closed models, B2B‑only APIs, and tightly controlled training data, with individuals getting at best rationed “compute budgets.”
AI is widely expected to “seep into everything,” but many fear a “smart TV” future: pervasive surveillance, dark patterns, and ad optimization rather than genuine empowerment.

Trust, Governance, and OpenAI

The blog post is broadly interpreted as crafted for investors and policymakers: defend huge capex, promise exponential upside, and downplay distributional harms.
OpenAI’s nonprofit origin, Microsoft AGI contract, and prior broken commitments are raised as reasons not to trust its assurances about “benefiting all of humanity.”
Several see growing existential and political risk from concentrated AGI, yet find the piece largely hand‑waves concrete mechanisms to prevent extreme inequality or abuse.

View on HN ↗ Original Article ↗

2025-02-09

AI Demos

Access, Region Locks & Cookies

Several users in the US see “Our site is not available in your region,” especially in Illinois and Texas; others work around it via VPN.
The site explicitly blocks Illinois and Texas, likely due to biometric/AI laws there; some note over‑blocking when ISPs geolocate to Chicago.
Users also encounter a heavy-handed Meta cookie prompt; some are unsure whether to accept just to see demos.

Legal & Biometric Law Discussion

Commenters link the exclusions to Illinois and Texas biometric statutes covering face/hand geometry and requiring consent and safeguards.
Some argue the laws are reasonable; others note ambiguities (e.g., is any face photo a “record”?).
There’s concern that Meta may store uploaded biometric data, but also recognition that “every AI company saves all the data” is common.

Seamless Translation: Impressive but Inconsistent

Several bilingual users call the voice translation “pretty incredible,” close to their real speech in another language.
Others report generic voices that sound nothing like them (sometimes wrong gender or age), or outright mistranslations that feel jarring.
Debate arises over whether we really want near‑perfect voice cloning (deepfakes) versus a deliberately imperfect voice.
People note it’s “good enough” for casual tasks (travel, directions) but not yet suitable for high‑stakes or artistic translation.

Segment Anything & Other Demos

Segment Anything 2 gets strong praise: video cutouts tracking objects across occlusion feel “incredible” and very useful for editing tools.
Some mention it’s already integrated into third‑party products and open-sourced via GitHub.
Other demos (animated drawings, Audiobox) are seen as fun but “half‑baked,” more like tech toys than products.

Meta’s AI Strategy & Business Angle

Many see the core motive as better ad targeting, content generation, and engagement—more personalized ads, auto‑generated creatives, and “AI slop.”
Others frame it as “commoditizing your complement”: open, cheap models weaken closed competitors while protecting Meta’s advertising moat and social graph.
There’s skepticism that users want AI‑generated feeds, but also acknowledgment that engagement metrics may still go up.

Ethics, Reputation & Employment

Strong disagreement over Meta as an employer: some call it top‑tier AI work, others say you need to “have no ethics” to join.
LLaMA’s open(-weights) releases are praised by some as a public good and derided by others as PR that doesn’t erase wider harms.

View on HN ↗ Original Article ↗

2025-02-09

Persistent packages on Steam Deck using Nix

Using Nix on the Steam Deck / Immutable Arch

The Deck’s immutable Arch root makes traditional AUR/pacman installs fragile; Nix is seen as a strong fit because it respects immutable roots, supports atomic upgrades, and has a large, up-to-date package set.
Common pattern: Flatpak for GUI apps, Nix for CLI/dev tools; others use overlayfs or re-run pacman scripts after each SteamOS update.
Users report success running Nix-installed tools and using Home Manager–created user systemd services alongside SteamOS, including in game mode, though the exact boundaries vs. full NixOS remain somewhat unclear.

Graphics / OpenGL and Desktop App Issues

Several people hit serious problems packaging GUI apps with Nix on non-NixOS systems: OpenGL/Vulkan and hardcoded X11 paths often break when the app is run outside the Nix environment.
nixGL and nix-gl-host can wrap executables or the compositor to inject correct driver paths, but wrapping everything is seen as tedious.
Broader complaint: decades of software assuming fixed /usr paths makes Nix-style per-package prefixes painful; Nix’s design is described as a workaround for badly written, non-relocatable software.

What “Nix” Is (and Comparisons to Guix/Spack)

Clarifications:
- Nix = package manager + build system; Nix language/expressions = config language; nixpkgs = package repo; NixOS = distro built/configured with Nix.
- Some distinguish three parts: language, interpreter, and build/sandbox engine.
Guix is praised for its Scheme/Guile-based language and service manager, but criticized for fewer/staler packages and strict FOSS-only policies; nonguix exists for nonfree bits, and patch workflow via mailing lists frustrates some.
Spack is mentioned as another option in HPC settings for mixing system and source-built dependencies.

Installers, Forks, and Governance (Determinate, Lix)

Determinate Systems’ Nix installer is widely praised: clear about changes, supports clean uninstallation, and was explicitly tested on the Steam Deck.
Some criticize the tight relationship between this company and upstream Nix (conflict-of-interest concerns, “Determinate Nix” enabling features not yet default upstream).
Lix is a fork of Nix (language + implementation) and also a community project; claims include better defaults, fewer footguns, faster parsing, and regular releases, plus a fork of the Determinate installer.
An “official” installer in the style of Determinate’s is reportedly in progress but constrained to upstream’s official feature set.

Nix for Development Environments

Positive experiences: per-project environments via Nix + direnv on macOS/Linux; exact tool versions pinned in Git; same configs reused in CI and production.
Use cases cited: consulting across many stacks; tooling like Node, kustomize, poetry, linters, language servers, and shell tools (grep/sed/awk) that fall outside language-specific package managers.
Some are skeptical, saying they’ve rarely needed this complexity; others respond that Nix is painful but often the only unified, reliable solution, especially compared to conda, which is described as brittle at scale.
Nix works well for C/C++, but Python+ML packaging via Nix is reported as still tricky for complex stacks.

Retro Gaming and Game Ports

Several users use Nix/NixOS to trivially build and bundle retro ports (Ocarina of Time, Majora’s Mask, SM64) and want these setups on Steam Deck; Nixpkgs contains most of these ports, with a few missing due to packaging difficulty.
Nix is seen as attractive for “retro rigs” because it can bundle emulators, drivers, and supporting tools declaratively.
Outside of Nix, EmuDeck and RetroDeck are discussed:
- EmuDeck is feature-rich but considered messy (absolute symlinks, sync issues, different Windows/Linux behavior).
- RetroDeck is a newer, more self-contained flatpak-based alternative.

Practical Concerns on the Deck

Some report Nix daemon/socket issues after suspend on the Deck; single-user (no-daemon) mode is considered safer there.
Storage usage concerns are raised: between Flatpak, Nix, and games, space can be tight; replies note that both Nix and Flatpak deduplicate shared libraries, but the overall impact remains a consideration.

Terminology, Arch, and Channels

There’s a side discussion clarifying that “nix” is historically a censorship/trademark dodge (“Unx”), not a literal glob, and that “Unix-like” is a clearer term.
Running Nix on Arch/Steam Deck is jokingly framed as “peak Linux”; some note Arch’s Nix package has been subtly broken in the past, and suggest using upstream installers or AUR variants instead.
One commenter questions why Nix stable channels do not expose a “latest” alias, forcing users to manually update their channel URLs; no definitive answer is given.

View on HN ↗ Original Article ↗

2025-02-09

Why blog if nobody reads it?

Blogging for self: memory, thinking, enjoyment

Many posters say they blog primarily for themselves: as a time capsule, public diary, or “web log” of what they were doing and thinking.
Technical blogs often function as personal documentation. People frequently Google something and land on their own post, or use posts as long-form gists.
Writing is described as “thinking”: it forces clarification, reveals gaps in understanding, and improves communication skills. Some only truly understand topics once they’ve written a post about them.
Several mention the intrinsic pleasure and therapeutic value of writing, independent of readership.

Audience, discovery, and traffic

Some dispute “if you write it, they won’t come”, noting that over years certain posts unexpectedly gain significant search traffic or go viral (including via HN).
Others emphasize that most content gets little attention; blogging is like other creator markets where the top fraction captures most views.
Social referrals are said to have collapsed; for some, almost all traffic now comes from HN or search. Algorithms on platforms like Substack can still bootstrap niche newsletters.
A recurring theme: it’s better to value a small, high-quality audience than chase large numbers.

Credibility, careers, and self-marketing

Blogging is framed as a low-cost way to demonstrate expertise and seriousness; posts can be sent to interviewers, clients, or colleagues and have helped some land jobs or consulting work.
Older pre-LLM posts and public Git histories are seen as harder to fake than AI-generated portfolios. There’s discussion of timestamping and archive.org as partial guards against backdating.
Others warn that writing “to build a brand” can shift focus from teaching the reader to marketing the author, reducing educational quality.

Public vs private writing

Debate: if blogs are mostly for the author, why publish instead of keeping local notes?
Arguments for publishing even with few readers: easier cross-device access; linkability; the mere possibility of an audience improves rigor and honesty; occasionally someone is helped or a good conversation starts.
Some say they would stop if no one ever read; others explicitly write “as if no one will” and treat readers as a bonus.

AI and blogs

Multiple people notice LLMs (ChatGPT, Perplexity, etc.) citing their blogs.
Some welcome this as amplifying knowledge; others dislike that their work helps enrich large AI companies and may be misattributed or distilled into low-quality summaries.

Indie web, tooling, and discovery

HN commenters strongly favor simple, JS-light blogs with proper semantics and RSS; one thread helps a blogger fix button-links and add a feed.
There’s concern that search makes personal sites hard to find; suggestions include niche search engines, webrings, and custom directories.
Several avoid detailed analytics to reduce anxiety and keep focus on writing itself.

View on HN ↗ Original Article ↗

2025-02-09

UnitedHealth Is Sick of Everyone Complaining About Its Claim Denials

Denial Rates, Profits, and Exec Pay

UnitedHealth is seen as emblematic but not unique; others (e.g., Cigna) also deny heavily. One stat cited: ~31% denial rate, about 30% above industry average.
Back-of-envelope use of SEC filings: wiping out all profit plus executive comp would allow ~7–9.7% more spend on claims, not 9.7% more claims.
Several commenters argue this would still leave many denials; exec pay is “a rounding error” compared to overall profits and system-wide costs.
Others counter that even ~10% more paid claims is substantial and worth pursuing.

Systemic Drivers: Insurers vs Providers

One camp: insurers are parasitic intermediaries, increasing administrative burden for providers and patients, and consuming ~20% of ACA premiums as overhead.
Another camp: core problem is high provider prices, hospital consolidation, and professional guild behavior (physician supply limits, resistance to scope expansion, lobbying against Medicare for All).
Disagreement over how big administrative overhead actually is; some cite studies showing it’s a modest fraction of excess costs, others cite on-the-ground experience of large billing/RCM teams devoted to navigating insurers.

Single Payer vs Free-Market vs Hybrid Models

Strong support for single-payer: current US system seen as “worst of all worlds,” tying care to employment and generating poor outcomes at extreme cost.
Some advocate the opposite: strip government incentives/regulation, detach coverage from employers, let true market competition and charity handle care; critics respond this would abandon people with chronic/expensive conditions.
Fears about single-payer: loss of choice, politicized rationing, lifestyle policing (“we all pay for that”). Counterpoints: Medicare works without those dystopias; many countries mix public baseline with optional private coverage.
Examples offered of Swiss and Dutch tightly regulated multi-payer systems as alternatives to both US-style and pure single-payer.

Patient Experiences and Insurer Conduct

Multiple anecdotes of pre-authorized surgeries, births, and drugs later denied; patients forced to “prove a negative” or fight billing-code pretexts.
Reports of 10–100x copay jumps after carrier switches, and insurers hanging up when told calls are being recorded.
Hospitals sometimes record insurer approvals yet still cannot collect; patients and providers describe insurers as systematically obstructionist, not accidentally mistaken.

Fraud, Oversight, and Appeals

Some note that claim review/denial is one of the few practical brakes on fraud and unnecessary care; cite Medicare fraud cases, overtesting, and unnecessary procedures.
Others argue insurers’ profit motive makes them poor stewards of this role and shifts risk/pain onto patients.
Proposed fixes include a government-run, low-cost “DMCA-style” appeal channel, stricter regulation of profits, standardized price catalogs, or banning most private health insurance altogether.

View on HN ↗ Original Article ↗

2025-02-09

A drill bit that can also drive screws

Perceived Problem vs. Real Workflows

Many argue the “bit‑swapping problem” is overstated: on real jobs you either:
- Use two tools (drill + impact driver/driver), often multiple drills preloaded with different bits.
- Drill all holes, then swap once and drive all screws.
For people who already own common drill/driver combo kits, this bit offers little advantage.
A minority notes that even a few seconds saved can make you more likely to grab the tool for quick fixes, especially for hobbyists.

Time Savings vs Existing Solutions

The claimed ~50% time savings is widely seen as unrealistic.
Alternatives seen as strictly better:
- Two drills/drill + impact driver.
- Quick‑change hex chucks and hex‑shank drill bits.
- Self‑drilling / self‑countersinking screws, nailguns, or “screw guns.”
Several mention that on ladders or in tight spots, not needing to juggle bits could be modestly useful.

Bit Geometry & Pilot Hole Issues

Strong criticism that:
- The drill diameter shown is far too large for a typical screw pilot; threads barely bite.
- The tip is flat with no brad/split point, so it’s likely to “walk” and require high axial force, especially in anything harder than softwood.
- Four “flutes” plus compromised geometry make it a mediocre, tear‑out‑prone drill.
Seen as a classic “two tools in one, both worse” compromise.

Phillips vs Torx/Robertson (Square) Debate

Many say Phillips is obsolete for construction:
- Designed to cam out, strips easily, especially with power or impact drivers.
- Mismatch between similar standards (Phillips vs Pozidriv) adds confusion and stripping.
Strong preference expressed for:
- Torx (“star”) screws for high torque and minimal cam‑out.
- Robertson (square) for good bit retention, especially in some regions.
This tool being Phillips‑only is considered a major drawback; some doubt the design adapts well to Torx geometry.

Durability, Wear, and Safety

Concerns that:
- The combined tip will wear quickly and unevenly; once dulled, both drilling and driving degrade.
- Sharpening looks impractical without specialized jigs.
- Using a cutting tip right where hands may hold screws raises injury concerns.

Who Might Actually Use It

Skeptics see it as “solution in search of a problem” or a disposable homeowner gadget.
Possible niche: light DIY deck/fence/shed work in softwood, or quick repair tasks where only one tool is handy.
Overall sentiment: clever idea, but inferior to current workflows and hardware for anyone serious about fast, reliable fastening.

View on HN ↗ Original Article ↗

2025-02-09

LIMO: Less Is More for Reasoning

Compressibility of Reasoning

Several comments speculate that there may be only a small set of generic “reasoning patterns,” but effective reasoning also needs domain‑specific steps (e.g., across math subfields) and strategies for handling impasses.
The LIMO result is interpreted as evidence that much of this structure is already present in large pretrained models; small, high‑quality datasets can elicit rather than create reasoning ability.

Self‑Play, “Chatbot‑Zero,” and Code Proofs

People compare AlphaGo Zero–style self‑play to LLMs and ask why we don’t have a “chatbot‑zero.”
Objections: conversations lack a clear win signal; pure self‑play would likely produce an idiosyncratic language unintelligible to humans.
DeepSeek‑R1‑Zero is discussed as a partial analogue: RL only, no CoT SFT, but still dependent on a heavily pretrained base model and labeled problems.
A detailed subthread imagines using RL/LLMs to generate formal code correctness proofs, with SMT solvers as the objective signal; concern about safely running arbitrary generated code is raised.

Math Reasoning, Theorem Proving, and Latent Skills

One tension: claims that LLMs can’t generalize in theorem proving vs this and related papers arguing models already contain rich mathematical knowledge that needs elicitation.
Some argue human knowledge is finite enough to be pattern‑matched; others note models still fail on niche expert domains.
A popular mental model: pretraining builds latent mathematical competence, but because the internet is mostly non‑reasoning text, models must be nudged (e.g., with a few curated CoT examples) to reliably use those circuits.

“Less Is More” Caveats and Data Curation

Strong criticism: the 817 math examples were distilled from ~10M using powerful reasoning models (e.g., R1), and the base Qwen model was already trained on large curated math datasets.
Thus, “less data” is contingent on:
- A huge, high‑quality pretrained model.
- A very expensive selection pipeline driven by stronger models.
Many liken this to human textbooks: generations of effort distill millions of problems into a few hundred maximally instructive ones.
Some see this as scientifically important (elicitation threshold, pedagogy of LLMs); others say the title overstates “less is more” and want full performance‑vs‑data curves.

Tools, Arithmetic, and True Reasoning

Debate on whether LLMs should be perfect calculators: most argue they should call external tools (calculators, Python, SMT, theorem provers) rather than emulate exact arithmetic internally.
This feeds into a broader skepticism: current systems often “mimic” reasoning and must be paired with verifiers, especially for safety‑critical tasks.

Broader Implications

Comments imagine:
- Iterative power transfer from huge to small models via distilled reasoning datasets.
- Swarms of small, specialized submodels collaborating.
- Using curated LIMO‑style sets as human teaching material.
There is also concern about using LLMs’ malleable “reasoning” for advertising and political persuasion, given how easily preferences can be embedded in prompts or training.

View on HN ↗ Original Article ↗

2025-02-09

Advanced Magnet Manufacturing Begins in the United States

Meaning and Mining of “Rare Earths”

Commenters stress rare earth elements aren’t geologically rare in the US; they’re “rare” because ore deposits are dilute and processing is complex and toxic (including low-level radioactivity).
The main environmental problem is in chemical processing and separation, not just mining; references to radioactive waste ponds and long-running disputes over waste in places like Malaysia.
Some clarify that “rare” historically referred to their scarcity in concentrated mineral form and difficulty of isolation, not absolute crust abundance.

US–China Supply Chains and MP Materials

Thread highlights that China holds a near-monopoly on rare-earth production, both mining and especially refining, and is not a US ally.
MP Materials’ vertically integrated chain (mine → beneficiation → separation → alloy → magnets) is seen as significant for reducing reliance on Chinese processing, especially after China’s ban on exporting processing tech.
Past bankruptcies at Mountain Pass are tied to price crashes and Chinese overproduction; several note that rare-earth location is mostly a function of cost and regulation, not physical scarcity.
Some argue this long supply chain re‑build should have been preserved years ago instead of letting US capacity be offshored.

Alternative Magnet Technologies (Niron, Iron Nitride)

Niron Magnetics’ iron-nitride approach attracts interest but also strong skepticism.
Critics note: tiny planned capacity (5 tons/year vs thousands elsewhere), decade-old patents with no visible large-scale product, and fundamental material challenges (unstable crystal phases, low coercivity).
Some suggest their early patents may be “bogus” or overly broad, potentially delaying others; others see it as a typical high-risk deep-tech effort that may still fail.

Motor Design Without Rare Earths

Question raised: why not avoid rare-earth magnets entirely via synchronous reluctance or separately excited synchronous motors.
Responses: non-permanent-magnet motors are cheaper and used where possible, but generally have lower efficiency and torque density, hurting EV range and heat management.
Rotor electromagnets introduce cooling challenges and slip-ring/inductive-coupling complexity, limiting practicality, though new inductive schemes avoid brushes at some efficiency cost.
Mention of a growing push toward rare-earth‑free motors and generators that could capture a sizable market share over time.

Industrial Policy, National Security, and Markets

Many argue key sectors (rare earths, shipbuilding, munitions, PPE) should not be left purely to “cheapest wins” logic due to national security and supply risk.
Suggested tools: subsidies, tariffs, government stockpiles, and guaranteed offtake, analogous to some agricultural and defense programs.
Others counter that heavy protectionism (e.g., Jones Act analogies) often backfires, raising costs and weakening industries, and that subsidies must be targeted and limited.
Debate over whether offshoring was “obviously stupid” or a complex, reasonable decision given lower prices, higher living standards, and expectations that high-value work would remain domestic.
Several note voters and consumers consistently chose cheaper imported goods, reinforcing corporate offshoring.

Globalization, Strategic Assurances, and Autarky

Distinction made between full autarky (seen as unrealistic and too costly) and “strategic assurance” – enough domestic or allied capacity to avoid coercion.
Some regret that earlier focus on pure price delayed development of alternative sources (e.g., in Australia) and eroded microelectronics and manufacturing bases.
Others argue the US has still benefited enormously from trade, and attempts to freeze low-skill industries domestically could have diverted talent from more valuable tech sectors.

Prices, Subsidies, and “Market Forces”

Side debate on whether stabilizing prices (e.g., via buying excess production) truly helps low-income households or just raises average prices.
One camp sees “market forces” as real, emergent human behavior that policy must work with; another calls them artificial constructs shaped by laws and power, not natural like gravity.

Non-US Producers and Capacity

Mention of non-Chinese producers (e.g., European magnet makers, Australian miners/processors) that still depend heavily on Chinese feedstock or are only now building full chains.
Multiple exploration-stage REE deposits in the US are noted, but none yet in production beyond Mountain Pass.

Recycling and “American-Made” Magnets

Some note that “recycling-based” US magnet production can amount to importing finished Chinese magnets, crushing them, and reforming, which still counts as domestic manufacture under some origin rules.
Parallels drawn with steel recycling as a way to skirt tariffs and origin constraints.

Miscellaneous

One subthread is a magnet pun exchange; another notes neodymium magnets’ role in elevators, synchrotrons, and toys.
A few comments criticize the article’s implicit assumption that readers should automatically prefer US over Chinese production, suggesting better trade relations may be another path.

View on HN ↗ Original Article ↗

2025-02-09

Baffled by generational garbage collection – wingolog

When generational GC helps (and when it doesn’t)

Several comments stress that generational GC only shines when workloads match the “most objects die young, survivors live long” pattern and there is a large mature heap.
Benchmarks like splay may churn the entire heap, so minor collections give little benefit and can even add extra copying cost.
Big real-world systems (servers, GUIs, dynamic languages) often do fit the generational profile: many short-lived temporaries per request/event, plus a large, mostly-stable object graph.

Java and JVM techniques

JVM is said to handle short-lived objects very well via Thread-Local Allocation Buffers (bump-pointer allocation) and escape analysis, sometimes effectively turning heap allocations into stack-like behavior.
Generational, pauseless collectors (G1, Shenandoah, ZGC) are described as state-of-the-art; ZGC in particular is cited for sub-millisecond pauses.
Object pools in Java were historically used to reduce GC pressure but are criticized:
- Can harm generational GC by creating many old-to-young references, forcing more frequent/expensive full GCs.
- Are error-prone (double-returns, use-after-free of pooled objects).
Binary-trees benchmark is used as an example where Java’s generational + TLAB approach competes with or beats arena-based native code.

Go’s non-generational, concurrent GC

Go’s GC is concurrent, non-generational, and explicitly optimized for latency, with “GC assist” that throttles heavy allocators.
Supporters like that it rarely breaks SLOs and usually “just works” without tuning.
Critics say throughput is worse than modern JVM/.NET GCs and there are scaling issues at large heaps or high core counts, with few tuning “escape hatches.”
Go relies heavily on stack allocation via escape analysis, so it simply creates less garbage.

.NET and C# perspectives

.NET has long used generational GC with server/workstation modes and newer options to behave more or less “cooperatively” with other processes.
It’s portrayed as close to Java in GC sophistication, often more memory‑efficient, and helped by value types and lower allocation rates.
Discussion diverges into whether highly optimized C# can rival C/Rust performance; opinions differ, but many agree C# has gained powerful low-level features (spans, ref structs, custom allocators).

Object pools, arenas, and the memory-management continuum

Commenters emphasize memory management as a continuum: GC, malloc variants, arenas, static layouts, custom allocators can all be mixed.
Java now has arena allocators; performance‑sensitive code in multiple ecosystems sometimes bypasses GC-managed heaps entirely.

Benchmarks, bias, and the generational hypothesis

Several note a methodological risk: if you design benchmarks and software assuming cheap young-object allocation, you’ll naturally validate the generational hypothesis.
Some suggest richer instrumentation (e.g., per-callsite lifetime profiling, pretenuring, multiple generations sized to cache/request lifetimes) to better test and tune generational collectors.

Debate over Whippet and Immix

One thread criticizes the blog author’s GC choices (mark-sweep, Immix) as “not proper” compared to classic semispace+generational designs; others push back, calling Immix state-of-the-art and noting the author has written extensively about advanced GC techniques.
There’s some confusion around what was said in a talk vs what exists in the code and prior posts, and no clear consensus on the quality of the author’s GC implementations.

View on HN ↗ Original Article ↗

2025-02-09

Don't "optimize" conditional moves in shaders with mix()+step()

What “branching” means on GPUs

Multiple commenters note a key distinction: a branch is a conditional jump that changes the program counter; a conditional move/select does not.
On GPUs, threads in a warp/wavefront execute the same instruction stream. If threads disagree on a branch, the hardware usually runs both paths sequentially with masks, idling non‑taken lanes.
If all lanes make the same decision (“uniform branch”), only one path runs and a real branch can be beneficial.

step()+mix() vs ternary/if

The criticized pattern is using step() + mix() to “avoid branches” that weren’t there to begin with; the original ternary compiles to conditional moves/selects, not jumps.
step() itself is typically implemented as a conditional, so you’re just hiding logic, not removing it, and often adding extra arithmetic.
Some note that using mix() with a boolean/vector mask is fine when that’s the natural form, but it’s not an optimization over a ternary that already works.

Performance tradeoffs and when branches hurt

Divergent branches reduce effective throughput because portions of a warp do useless work; uniform branches can skip work and be faster.
For short, cheap expressions, computing both sides and selecting is often best; for very asymmetric or expensive branches, a real branch can win.
Several people emphasize: you can’t reliably reason this out in your head—profile on target hardware.

Compiler behavior and tooling

Whether step/mix gets optimized back into a conditional move is compiler‑ and driver‑dependent; shader compilers are latency‑sensitive and can’t run every heavy optimization.
There’s debate about adding passes to detect and undo the “fake optimization”; some say it’s straightforward pattern‑matching, others expect many variants and corner cases.
Multiple tools are mentioned (DXIL, SPIR‑V, vendor ISAs, Radeon GPU Analyzer, driver disassembly) and people advocate inspecting generated code to see real branches, masking, and unrolling.

CPU conditional moves tangent

A large subthread discusses cmov on CPUs: sometimes faster than unpredictable branches, sometimes worse due to data dependencies and good branch predictors.
People complain about not being able to force a cmov in C/C++; compilers use heuristics, sometimes undo cmovs, and there are flags and intrinsics to influence this with mixed success.

Driver and ecosystem quirks

GPU vendors sometimes replace or tweak game shaders in drivers for performance or correctness, sometimes keyed by executable name or shader hashes.
This can yield big speedups but also odd behaviors and compatibility issues when games or mods deviate from what drivers expect.

Misinformation, LLMs, and best practices

Commenters note that the “branches are always bad, use step/mix instead” meme is old, platform‑specific, and wrong for modern GPUs, yet persists online.
LLMs are criticized for repeating this folklore, since they mirror common but incorrect advice.
General guidance from the thread: write clear code (e.g., ternary/if), inspect generated code when in doubt, and measure on representative GPUs rather than relying on myths.

View on HN ↗ Original Article ↗

Hacker News, Distilled

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics