Hacker News, Distilled

AI powered summaries for selected HN discussions.

Page 104 of 780

Mullvad VPN: Banned TV Ad in the Streets of London [video]

Ad content & reception

  • Linked 4‑minute ad is widely described as powerful but also long, confusing, and niche.
  • Some viewers didn’t understand the “and then?” teaser posters in the London Tube even knowing Mullvad.
  • Style is compared to political stunt campaigns; some expected a more direct, “Led By Donkeys”-style critique.
  • Several note the dystopian vibe of surveillance imagery set against the London skyline.

Clearcast rejection, “ban,” and free‑speech debate

  • Clearcast (industry-owned pre‑clearance body) rejected the TV ad as unclear and “inappropriate/irrelevant” to average VPN users, especially references to serious crimes and sensitive groups.
  • Big debate over whether this is censorship:
    • One side: prior approval (especially when tied to statute) is de facto government‑mandated censorship and dangerous “prior restraint.”
    • Other side: this is broadcaster self‑regulation to prevent misleading or harmful ads, similar to standards in many countries; not remotely like political censorship under dictatorships.
  • US vs UK/EU perspectives collide:
    • US‑leaning voices emphasize the First Amendment, opposition to pre‑approval, and worry about speech “freezing.”
    • European voices stress that advertising isn’t normal discourse, that lies can cause damage, and that freedom of expression always has legal limits.
  • Some see the “banned on TV” framing as a deliberate viral marketing angle; others accept Mullvad’s account at face value. The extent of any formal “ban” is unclear.

Advertising ethics vs. regulation

  • Arguments over whether misleading ads should be:
    • Pre‑screened and stopped,
    • Punished after the fact with scaled fines or forced corrective ads,
    • Or both, with harsher penalties for intentional political or commercial lies.
  • Broader concerns raised about normalization of censorship in UK/EU and, conversely, about “freedom to lie” in the US.

Mullvad’s marketing strategy & brand perception

  • Some praise Mullvad’s strong privacy stance but dislike the loud, stunt‑driven campaigning; they want a “quiet” utility service.
  • Others think the rejection is a “gift” enabling Mullvad to market a “banned ad” narrative.

Effectiveness and limits of VPNs

  • Skepticism that VPNs meaningfully solve mass surveillance, which is framed as a legislative/regulatory problem.
  • Supporters argue VPNs at least shield activity from ISPs and make tracking harder, especially with easy account rotation and non‑traceable payments.
  • Counterpoints note EU legal frameworks and international cooperation can still compel VPN providers, including Mullvad, to comply with law‑enforcement requests.

Product- and ecosystem-related issues

  • Complaints about Mullvad dropping port forwarding, seen as hurting legitimate file‑sharing but acknowledged as abuse‑prone.
  • Practical problems: Mullvad IP ranges are increasingly blocked by banks, YouTube, and other sites; some switched providers over speed or accessibility.
  • One user worries Mullvad’s rapid growth and heavy ad spend feel “sus,” though this is subjective and unsubstantiated in the thread.

Claude's Cycles [pdf]

Overview of the Result

  • The paper describes how a reasoning-focused language model, guided by a human collaborator, explored many programmatic approaches and eventually discovered an algorithm that solved an open combinatorial problem for all odd cases.
  • The human then proved correctness and wrote up the formal math; the even case remains unsolved.

Was This Genuine Novelty?

  • Some commenters assert the model must have simply regurgitated part of its training set; others counter that:
    • The problem was presented as open in the literature.
    • The successful approach emerged only after ~30 failed explorations.
    • The model refined and reused earlier partial ideas, suggesting genuine search rather than memorization.
  • Several note that if this were a known solution, it likely would have appeared immediately, not after a long iterative search.

What This Suggests About LLM Capabilities

  • Many see this as strong evidence of nontrivial problem-solving: pattern search, hypothesis generation, code synthesis, and refinement under feedback.
  • Others emphasize the human–model synergy: the person chose directions, restarted when outputs degraded, and translated the final algorithm into a proof.
  • There is debate over whether this counts as “thinking” or simply “very powerful next-token prediction plus good tooling.”

Intelligence, Memory, and Learning

  • Long back-and-forth on whether models that can’t update their weights at inference time are truly “intelligent,” with analogies to human amnesia and external memory tools.
  • Some argue that adding tool use, external memory, and agents on top of a base model can approximate long-term learning; others insist this remains fundamentally different from self-updating cognition.

Keeping Models Up to Date

  • Concern about models as “time capsules” with fixed knowledge cutoffs.
  • Discussion of:
    • Continual training vs. continual learning in-context.
    • Huge context windows, compaction, and the “dumb zone” when too much prior detail is lost.
    • Using user interactions and reasoning traces as future training data, with attendant privacy and consent worries.

Broader Implications and Skepticism

  • Enthusiasts see this as an early sign that hard open problems (including in physics or pure math) might fall to similar approaches.
  • Skeptics stress current systems still make silly errors, struggle with many novel problems, and rely heavily on human steering.
  • Ethical concerns arise around surveillance, concentration of power, and the future role of human cognitive labor.

The Xkcd thing, now interactive

Overall Reception

  • Many commenters found it delightful, funny, and oddly satisfying, comparing it to Angry Birds, Little Inferno, and old Box2D playboxes.
  • Several people reported spending notable time just toppling the stack or trying to clear the screen or rearrange into a stable configuration.
  • A few praised the polish and the “feel,” acknowledging the work required to make simple interactions feel good.

Metaphor & Interpretation

  • The auto-collapse after enabling physics is widely read as metaphor: infrastructure that looks stable is actually already collapsing.
  • Some appreciate that it decays even if you “do nothing,” aligning with views of real-world tech stacks and maintenance.
  • Others explicitly say they wanted amusement, not existential dread, yet still acknowledge the metaphor as “very real.”
  • Observations that the “Nebraska” block or tiny maintainer piece often remains stable longest are seen as poetically accurate.

Physics, Friction, and Technical Critiques

  • Frequent complaints about blocks feeling too “slippery” and friction being set too low; several suggest increasing friction.
  • Some note unrealistic behavior: small blocks nudging larger ones sideways/upwards, wedged blocks being squeezed out despite heavy load, and an initial “bump” when physics starts because the pre-drawn state isn’t a relaxed physical state.
  • One commenter points out stroke/border not matching collision bounds and shows a code tweak.
  • Input handling critiques: dragging feels rigid, force applies from center rather than cursor, and blocks can “quantum tunnel” through others.
  • For drag behavior, suggestions range from registering mousemove on window to using pointer events with setPointerCapture.

Performance and UX Issues

  • Some users report browser or mobile app jank, especially with back navigation, and at least one Android HN client effectively freezing.
  • Others note frame-rate and device differences affecting initial stability.

Ideas, Variants, and Related Work

  • Requests for: editing labels, a multiplayer Jenga-style game, generating stacks from real dependency graphs (e.g., package.json), and integrating with external diagram tools.
  • A prototype site that builds XKCD-style stacks from GitHub repos is mentioned, along with concerns about overly broad OAuth permissions.
  • Related XKCDs, memes, and a recent video citing the original comic are referenced for context.

AWS outage due to drone attacks in UAE

Geopolitical context and targeting rationale

  • AWS confirms drone strikes on three facilities in UAE and Bahrain, causing outages.
  • Commenters link this to AWS’s contracts with the US Department of Defense, arguing that makes AWS infrastructure a symbolic and strategic target.
  • Others stress Amazon’s visibility as an American icon: hitting it is framed as “we can destroy your stuff too,” aiming at US morale rather than direct military effect.
  • Heated debate over whether Iranian actions are “terrorism” or legitimate self-defense after being attacked by the US/Israel.
  • Strong disagreement on moral frameworks:
    • One side emphasizes civilian casualties, hotel and residential hits, and labels Iran a terrorist actor.
    • Another side insists this is state-on-state retaliation; argues you can’t attack a sovereign nation and then treat its response as proof your attack was justified.

Israel–Iran–Palestine spillover debate

  • Long subthread argues over:
    • Whether Israel is a “religious ethnostate” and its “right to exist.”
    • Who is committing or threatening genocide.
    • Which side is more dangerous with nuclear weapons.
    • Translation and meaning of Iranian slogans like “Death to America.”
  • Conflicting claims over famine/starvation in Gaza, civilian targeting, and support/funding of Hamas, with links cited on both sides.
  • Several note a perceived shift in Western public opinion toward Palestinians and say this changes the information environment.

Cloud reliability, DR, and AWS architecture

  • Practitioners describe real-time response: some clients’ UAE-region systems are largely nonfunctional; data recoverability still unclear.
  • Strong reinforcement of multi-region design and offline/backups in other regions.
  • Key lessons:
    • Region-level events nullify intra-region AZ redundancy.
    • DR plans must work when every API call to the primary region times out; any dependency on the dead region will fail.
    • Old-school DR practices (separate sites, tested runbooks, tape backups) are still relevant.
  • Some argue not every startup system needs full multi-region DR; others counter that major outages show even large companies underinvest.

Anthropic/Claude angle

  • Speculation that Claude’s issues might stem from reliance on affected AWS regions; others find it more likely due to user influx after a controversial DoD–OpenAI deal and ensuing ChatGPT uninstalls.
  • Jokes about Anthropic models assisting US strikes that then hit AWS, causing Anthropic’s own outage.

Media framing and broader reflections

  • BBC is criticized for wording that mentions US/Israeli strikes but not Iran as the attacker by name.
  • Some connect drone attacks, cable cutting, and infrastructure strikes to asymmetric warfare aimed at raising economic and political costs.
  • People note psychological impact on civilians near data centers and the large expat population in the Gulf questioning why they stay under such risk.

Arm's Cortex X925: Reaching Desktop Performance

Apple Silicon vs Cortex-X925 and Other ARM Cores

  • Many commenters find it odd the article doesn’t compare X925 to Apple’s M-series, given Apple’s status as ARM performance leader and similar “desktop” positioning.
  • Counterpoint: Apple doesn’t sell its CPUs as components, so they’re irrelevant for “industry” use; the article focuses on licensable ARM cores.
  • Rebuttal: The piece already compares to AMD/Intel, so excluding Apple feels arbitrary; X925-based systems (e.g., GB10/DGX Spark) compete with Macs as products.
  • Benchmarks cited from Geekbench: M5 beats a Ryzen 9 9950X in single-thread but lags in multi-thread; M4 Max/Mac Studio offer strong laptop/desktop performance.
  • Some see Apple as single-thread “king,” but note that server chips like Graviton 4 or AmpereOne win on core count and different workloads.

Ecosystem, OS, and Linux Concerns

  • Several participants reject Apple hardware despite performance due to macOS UX, ecosystem lock-in, repairability, and lack of official Linux support.
  • Others highlight Asahi Linux’s progress but note poorer battery life and power management vs macOS.
  • Virtualization on Apple Silicon is praised (Apple’s framework, containers, QEMU), though Rosetta 2’s future changes raise concerns.

Relevance of “Desktop Performance” and Product Timing

  • Debate over what “desktop performance” even means today given diversity of form factors (mini PCs, laptops, workstations).
  • Some argue power and perf/W should be central to any desktop comparison; others prioritize raw performance when plugged into mains.
  • Criticism that ARM core announcements often precede real products by years; others note X925-based hardware is already shipping and that long pre-briefing isn’t unique to ARM.

Memory Models and Concurrency Bugs

  • Discussion on whether wider ARM desktop use will expose more race conditions due to ARM’s weaker memory model vs x86’s stronger ordering.
  • Explanation that C/C++ memory models allow behaviors x86 happens not to expose, so bugs may surface when moving to ARM.
  • Historical note: multiple architectures (PowerPC, MIPS, Alpha, Windows NT’s early multi-ISA design) were useful for uncovering such portability bugs.

SIMD, Cache, and Microarchitecture Details

  • X925 praised for high IPC but seen as disadvantaged vs Zen 5 in heavy vector/FP workloads due to 128-bit SIMD width and unbalanced FMA vs load throughput.
  • Some argue wide SIMD (e.g., AVX-512) matters mainly for specialized workloads; everyday desktop use is more latency/UI bound.
  • Separate discussion on 4KB vs 16KB page sizes constraining L1 cache design on x86 vs recent ARM cores, and how that affects performance and power.

Daily Driving GrapheneOS

Installation & Reverting to Stock

  • Several users report that installing GrapheneOS on Pixels and reverting to stock is straightforward via web installers in a desktop browser with USB.
  • Both directions (stock → GrapheneOS and back) wipe device data; backups are required.
  • eSIMs generally survive an install of GrapheneOS and can now be set up directly on GrapheneOS, but whether they always survive a full “round trip” back to stock is unclear.

Banking Apps, 2FA & Contactless Payments

  • Experiences with banking apps are mixed but mostly positive: many report multiple banking and trading apps (Germany, Netherlands, UK, US, Sweden, Belgium) working fine.
  • A community-maintained compatibility list is referenced; some apps require toggling “exploit protection compatibility mode.”
  • Device attestation is the main technical friction for some banks; a detailed attestation guide is linked.
  • Contactless payments are a recurring pain point:
    • Google Wallet often fails due to lack of Google whitelisting.
    • Some succeed using alternatives like Curve, PayPal, or bank-specific NFC solutions; others fall back to a physical card tucked behind the phone case.
    • For some, loss of phone-based tap-to-pay is a dealbreaker; others argue using a card (or even cash) is an acceptable trade-off.

Daily Use, Apps & Backups

  • Many daily drivers report GrapheneOS as stable and smooth over months/years, praising:
    • Minimal default apps, zero bloat/ads, and fast security updates.
    • Sandboxed Google Play and the ability to selectively install Google apps while restricting permissions (e.g., disabling network for Gboard or Google Camera).
  • Ride-hailing (Uber and others) generally works, though one user had payment verification failures.
  • RCS via Google Messages was broken for months but is now working again; some left GrapheneOS over this and would consider returning.
  • Seedvault backups are widely criticized as unreliable; some recommend relying on web apps and file sync instead.

Security, Privacy & Trade-offs

  • Enthusiasts highlight strong privacy, reduced Google privilege, and better carrier/VPN controls versus stock Android and iOS.
  • Some argue mobile banking is safer on modern phones than on desktops; others refuse banking apps entirely and use hardware tokens or card readers.
  • A minority is strongly skeptical: one calls GrapheneOS a “toy” for privacy enthusiasts and claims their device was compromised as easily as stock Android (no technical details given).

Alternatives, Ecosystem & Outlook

  • Other de-Googled or minimal Android options mentioned: LineageOS, /e/OS, iodéOS, de-Googled Samsung with work profiles.
  • Some users just want a near-plain AOSP OS; others dream of open iPhone hardware or government-mandated openness and APIs.
  • The upcoming Motorola–GrapheneOS collaboration is seen by some as a path to more mainstream adoption; others fear it could dilute the project’s focus, though current user growth is described as strong.

U.S. Troops Were Told Iran War Is for "Armageddon,"

Religious Framing of War and Christian Nationalism

  • Many commenters see the “Armageddon” framing as evidence that US leadership and parts of the electorate are dangerously influenced by apocalyptic evangelical theology.
  • Specific movements like dispensationalism and the “New Apostolic Reformation” are cited as feeding Christian nationalism and end-times obsession.
  • Some note the irony of Armageddon theology playing out in or around Iran, a cradle of Zoroastrian apocalyptic ideas.

US Governance, Electorate, and War Policy

  • Several argue the people in power are “insane,” but others stress they reflect voters’ choices, including tens of millions who don’t vote.
  • Counterpoint: structural issues (gerrymandering, Senate/EC bias, media capture) and donor control make it misleading to say outcomes purely reflect popular will.
  • There is frustration that both major parties support highly aggressive foreign policy (e.g., toward Iran and Gaza), making “don’t vote for it” a false choice.

Elections, Representation, and System Design

  • Debate over responsibility of nonvoters vs. those who voted for current leadership.
  • Some stress primaries and local elections as points of maximum voter influence.
  • Proposals: multi‑party proportional representation, ranked-choice/ranked-pairs voting, independent redistricting.

Two-Party Failures, Centrism, and Compromise

  • Widespread dissatisfaction with “pathetic options on both sides.”
  • One camp sees Democrats as already centrist but bad at messaging; another says the message itself is compromised by corporate capture.
  • Big argument over whether Democrats should moderate on issues like abortion and cultural questions to win red states, or instead embrace unapologetic economic leftism and refuse “lesser-evil” compromises.

Healthcare and Political Economy

  • ACA is seen by some as useful but structurally captured by private insurers; others defend it as the only durable reform feasible.
  • Medicare for All (or phased expansion) is raised as the reform that never got a serious vote, reinforcing perceptions of Democratic timidity and donor control.
  • Broader critiques describe US healthcare as an inefficient, quasi-redistributive drag on the economy.

Evidence for the “Armageddon Briefing” Claim

  • The main factual basis cited is complaints to the Military Religious Freedom Foundation; some readers find this plausible given military religious rhetoric.
  • Others are skeptical and request stronger independent confirmation, noting potential bias or overstatement.

Elevated Errors in Claude.ai

Uptime and Reliability Concerns

  • Many see 98.9% availability as “one nine,” i.e., poor for a core dev tool; frequent red bars on the status page reinforce the perception of instability.
  • Some heavy users report almost no downtime over a year; others say outages and errors feel “half the time,” especially recently.
  • People note this outage feels larger/longer than prior blips and complain about vague, slow incident updates.

Possible Causes and Infrastructure

  • Thread references rapid user growth: Super Bowl ad + recent migration from OpenAI as likely stressors.
  • Some speculate about links to a recent US DoD ban and/or Middle East data center disruptions (AWS regions hit by drone strikes). Others find this connection doubtful or unclear.
  • Clarification that Anthropic models on AWS Bedrock/Google Vertex are hosted in those clouds, giving some “backup Claude” capacity.

User Experience, Limits, and Pricing

  • Complaints that Claude Code quotas are tight; a single moderate session can exhaust a 4‑hour window.
  • Some users spend large amounts on API usage, arguing it’s easy to burn through context; others are shocked at the cost.
  • Confusion and frustration around email verification codes not being delivered for some providers.

Claude vs OpenAI/Codex

  • Mixed views: several say OpenAI’s latest coding models are more accurate and less hallucination-prone; others claim Claude has become their primary tool due to better behavior on complex tasks.
  • A notable migration from OpenAI is driven by politics/ethics (military, surveillance concerns), not just quality.

Over-Reliance on AI and Skill Atrophy

  • Strong debate about “coding only with AI”:
    • Critics warn of vendor lock-in, lost coding skills, shallow understanding of architectures, and poor supervision.
    • Supporters argue AI frees them to focus on design, logic, and higher-level engineering, and that learning to orchestrate agents is itself a valuable new skill.
  • Hiring signals are shifting: many companies now explicitly ask how candidates use AI tools; some interviews already test AI workflow skills.

Alternatives, Architecture, and OSS/Local Models

  • Outages push some toward OpenAI again, or to consider OSS/local models or GPU purchases; consensus is OSS models still lag Claude for general use but can be good for narrow tasks.
  • Architectural advice for production systems: treat LLM APIs as unreliable externals, use multi-provider fallback (e.g., Claude → GPT on errors), async queues with retries, and graceful degradation paths (handoff to humans).
  • Several commenters contrast AI’s “single-nine” reality with truly critical infrastructure, arguing expectations and system design should match that lower reliability.

Ars Technica fires reporter after AI controversy involving fabricated quotes

Scope of the failure

  • Commenters see the core violation as severe: fabricated / AI-hallucinated quotes presented as real, attributed to a specific person, and published under a major tech masthead.
  • Many argue this is close to the “worst thing” a journalist can do short of outright corruption: not verifying sources, putting words in someone’s mouth, and possibly relying on AI for the actual writing.
  • Some emphasize that the reporter’s beat was AI, making the lapse worse: an AI reporter should be more skeptical of LLM output than anyone.

Firing vs “learning moment”

  • A large group says the firing was necessary to preserve credibility; apology or illness does not erase the ethical breach.
  • Others think the outlet missed an opportunity for a “blameless postmortem” approach: keep the reporter, publicly dissect what went wrong, strengthen policies, and treat it as a systemic failure.
  • Several note that newsroom pressures (speed, “do more with less,” possible implicit AI encouragement, working while sick) likely contributed, but most still see personal accountability as non‑negotiable.

Responsibility beyond the reporter

  • Many fault the publication’s editorial process: co‑bylined editor, lack of fact‑checking, and rapid article deletion are seen as institutional failures.
  • Debate over whether co‑authors or editors should apologize or face consequences, given practical limits on re‑doing each other’s research but also shared responsibility for bylines.
  • Some criticize the outlet for retracting and deleting the article and comments rather than clearly documenting corrections and consequences on-site.

AI, plagiarism, and slop

  • Disagreement on terminology: is misattributing LLM‑generated paraphrases as quotes “plagiarism,” fabrication, or something adjacent? Consensus that undisclosed AI use and false attribution are unethical regardless of label.
  • Several highlight a broader trend: management pushing AI use without clear guardrails, while the public and many readers are deeply skeptical of AI‑generated “slop.”
  • Commenters repeatedly note that LLMs hallucinate; some worry people still over‑trust them, especially when outputs are plausible.

Trust in media and Ars Technica

  • Some readers say this confirms a longer decline in quality, clickbait headlines, and weak standards; a few are dropping the site entirely.
  • Others still see it as one serious but contained incident in an outlet that also employs strong reporters, and expect heightened vigilance going forward.

Meta’s AI smart glasses and data privacy concerns

Scope of Meta Glasses Data Collection

  • Many assumed any AI/assistant use meant video/audio would be uploaded and used to train models with human-in-the-loop labeling.
  • Key concern: contractors can see highly intimate clips (sex, nudity, toilets, children, bank cards, private homes).
  • Unclear from the thread exactly when/how uploads happen:
    • Some think only AI-invocation (“Hey Meta…”) triggers uploads.
    • Others fear all recordings by opted‑in users may be ingested.
    • People with devices are unsure whether local‑only settings truly prevent server-side use.
  • Confusion persists about whether glasses ever record/stream when not explicitly in “record” mode.

Privacy, Law, and Ethics

  • Strong view that this is a “surveillance-as-a-service” model, not just an AI feature.
  • EU/GDPR implications debated:
    • Some argue uploading others’ faces/voices to Meta without consent likely violates GDPR/biometric laws.
    • Others note filming in public is often legal, but large‑scale processing and training is a different legal category.
  • Several European examples: security cameras tightly regulated in some countries; always-on glasses seen as worse.

Indicators, Consent, and Stealth

  • Glasses have a recording LED, but:
    • Hard to see in daylight or at distance.
    • Can reportedly be disabled or bypassed via hacks or hardware damage.
    • Even if intact, bystanders cannot be expected to scrutinize tiny LEDs for consent.
  • Many argue “you’re in public, no privacy” is outdated in an era of ubiquitous, machine-processable video.

Social Norms and Reactions

  • Repeated comparisons to Google Glass “glassholes”; many predict or advocate social ostracism.
  • Some workplaces and homes already have explicit “no Meta glasses” rules.
  • Suggested responses range from:
    • Polite requests to remove them in private spaces.
    • Social shaming and refusing to interact.
    • Legal bans on covert recording devices.
  • Physical retaliation (e.g., knocking glasses off) is discussed but widely flagged as assault and dangerous.

Benefits, Accessibility, and Use Cases

  • Some owners like them for:
    • Hands-free POV recording with kids, travel, work documentation.
    • Audio (music, calls, podcasts) without earbuds.
  • Strong accessibility upside for blind/low-vision users (object description, reading, navigation).
  • Tension noted: genuine assistive value vs. systemic privacy harm to everyone in view.

Trust in Meta and Calls for Regulation

  • Many see this behavior as entirely consistent with Meta’s long history of privacy abuses; others are still surprised by the brazenness and legal risk.
  • Internal planning around future facial recognition (timed for periods when critics are “distracted”) intensifies distrust.
  • Frequent calls for:
    • Stricter privacy laws around always-on/wearable cameras.
    • Limits on data retention and human review.
    • Strong penalties rather than symbolic fines.

Show HN: I built a sub-500ms latency voice agent from scratch

Overall reaction & significance

  • Many find the write-up unusually clear and useful, especially the latency breakdown and the visual explanation of the loop.
  • Sub‑500ms is seen as an important UX threshold where voice agents start to feel conversational rather than IVR-like.
  • Several commenters built similar systems and confirm the framing: voice agents are primarily an orchestration and turn‑taking problem, not a pure model problem.

Latency, TTFT, and orchestration

  • Latency is described as distributed across the pipeline: network, STT, LLM, TTS, and telephony hops.
  • Time‑to‑first‑token (TTFT) is repeatedly called out as more important than total generation time. Streaming an acknowledgment early feels faster than delivering the whole answer slightly sooner.
  • Co‑location of services, warm WebSocket connections, and caching (e.g., common TTS phrases) are suggested as key optimizations.
  • Geography matters: when callers are far from the infra region (e.g., India → US‑East), carrier and edge routing add noticeable delay.

VAD, endpoint detection, and turn-taking

  • Multiple approaches are discussed: classical VAD, semantic “end of turn” based on text, and fused models that use both audio and semantics.
  • Some argue semantic endpoint detection or integrated turn‑taking models outperform raw VAD and fixed silence thresholds.
  • Handling “barge‑in” (user interrupting mid‑response) is highlighted as a complex area, especially when downstream actions (bookings, webhooks, DB writes) may already be in flight.

Cascaded STT→LLM→TTS vs end‑to‑end speech models

  • Some call cascaded pipelines a “dead end” and see end‑to‑end speech‑to‑speech (full‑duplex models, Moshi‑style) as the future.
  • Others with production experience argue cascades will persist because they’re auditable, modular, and easier to regulate and debug, especially for enterprises.
  • Concern is raised that end‑to‑end models can blow up KV/cache size and latency, especially on device.

UX, human timing, and filler speech

  • Several comments connect to conversation analysis: humans often have ~0ms or even negative gaps (predictive overlap) and use nonverbal cues, backchannels, and fillers.
  • Ideas: use small models or pre‑cached audio clips for “thinking” fillers (“hmm… let me check”), or even predictive response generation while the user is still speaking.
  • Others warn of “uncanny valley” risk if fillers are mistimed and of frustration when assistants interrupt during brief pauses.
  • Some suggest explicit verbal end markers (“over to you”) or keyword‑based turn endings to avoid mis‑detections, though this is seen as unnatural by some.

Frameworks, tooling, and alternatives

  • Commenters compare hand‑rolled Python orchestration to frameworks like LiveKit, Pipecat, and commercial STT/endpoint providers (Deepgram Flux, Soniox).
  • Some advocate building from scratch once to truly understand latency sources; others favor production‑grade frameworks for robustness and configurability.
  • Several share alternative stacks: Twilio, various STT/TTS vendors, Groq/Cerebras/Claude/Gemini, local Qwen/MLX pipelines, and fully in‑browser or offline agents.
  • Costs, reliability, and rate‑limit issues with some hosted LLMs are mentioned as practical constraints.

Big‑tech assistants and business constraints

  • There’s frustration that consumer assistants (Alexa/Siri/Google) feel dated compared to bespoke setups.
  • Possible reasons cited: GPU cost at scale, the need for very strong guardrails when controlling real‑world devices, weak monetization for simple voice queries, and the complexity of upgrading legacy assistants into fully agentic systems.

Welcome (back) to Macintosh

Time Machine and Backup Reliability

  • Many report Time Machine as effectively abandonware: corrupting over time, especially on network shares (NAS/SMB, sparse bundles), forcing full reset/erase.
  • Some long‑time support folks say they rarely see corruption and view frequent failures as environment‑specific (bad disks/cables), but others counter that failures often appear only after years.
  • Trust in Time Machine has eroded; several have switched to restic, Carbon Copy Cloner, ChronoSync, Backblaze, or manual strategies and emphasize regularly testing restores.

macOS Tahoe (26) Stability, Performance, and UI

  • Large contingent describes Tahoe as unstable, “death by a thousand paper cuts”: Finder hangs or fails to refresh, Spotlight/tag indexing glitches, UI lag (especially with external 4K displays), weird window resize hitboxes, and DisplayPort/monitor issues.
  • Others report Tahoe as broadly “fine,” with no major bugs beyond prior releases and acceptable performance on newer M‑series Macs.
  • The new “liquid glass” UI is widely disliked: inconsistent iconography, changing corner radii, transparency over content, and perceived regressions in built‑in apps (Music, Mail controls moved/removed, Reminders typing lag).
  • Some see accumulated cruft and config corruption over years/migrations; clean installs reportedly behave better.

iOS/watchOS 26 and Ecosystem Sentiment

  • iOS 26 and watchOS 26 draw heavy criticism: sluggishness, UI glitches, anti‑user choices, harder‑to‑read watch UI, degraded keyboard behavior, iPadOS performance regressions.
  • Several long‑time, deeply invested Apple users say this is the first time they are seriously planning to leave the ecosystem.

Linux and Windows as Alternatives

  • Many are actively migrating workloads to Linux (Fedora, Debian, Pop!_OS/COSMIC, KDE, ElementaryOS, cachyOS, etc.), citing control, fixability, and open source.
  • Others warn that Linux desktop still has rough edges (Wayland breakage, audio/BT, sleep, drivers) and will disappoint users expecting “just works” Apple‑style polish.
  • Windows 11 is described as “fine” but with its own UX and telemetry issues; some find SMB/networking and certain apps better there.

Networking, SMB, Printing, Enterprise

  • Persistent complaints about macOS SMB: slow/broken compared to Windows, problematic with NAS and creative workflows; Apple’s in‑house SMB stack blamed.
  • Printer stack and CUPS changes reportedly broke drivers, label printers, job management, and increased flakiness.
  • In business contexts, some IT professionals say Macs’ network/file‑share quirks can harm non‑technical users’ reputations at work.

Apple’s Strategy, Culture, and Trust

  • Recurrent themes: Apple prioritizing flashy UI churn, services, and lock‑in (iCloud, notarization, JIT restrictions, Rosetta phase‑out) over reliability and user needs.
  • Several longtime fans feel Apple’s hallmark care for users has decayed into indifference or contempt; others argue hardware (Apple Silicon) still excels and hope software will course‑correct, perhaps via a “Snow Leopard‑style” bug‑fix release.

British Columbia is permanently adopting daylight time

Overall reaction to BC’s move

  • Many celebrate ending clock changes, calling twice-yearly shifts “insanity” and “madness,” especially for parents and developers.
  • Some are disappointed BC chose permanent Daylight Time (DST offset) instead of permanent Standard Time; others say they’ll accept “anything permanent” over switching.
  • A few predict public opinion may shift after one dark winter of permanent DST.

Permanent DST vs Permanent Standard Time

  • Pro‑DST arguments:
    • More usable daylight after work/school, especially in winter, is seen as a major quality-of-life gain.
    • Morning light is considered “wasted” because people are commuting or indoors; evenings are when socializing, sports, and outdoor activities happen.
    • BC already spends ~65% of the year on DST; locking that in is framed as a smaller change.
  • Pro‑Standard-Time arguments:
    • Standard Time aligns better with solar noon; “noon should be at noon.”
    • Several commenters cite chronobiology and sleep research organizations, plus past failed permanent‑DST experiments (US, Russia, UK) as evidence that permanent DST harms health.
    • Preference for more light in the morning, easier waking, and safer school commutes.

Health, safety, and kids

  • Widely shared view: eliminating clock changes should reduce short‑term spikes in accidents, sleep loss, and circadian disruption.
  • Disagreement on safety:
    • Some stress dangers of dark winter mornings for children walking or busing to school, especially at northern latitudes.
    • Others emphasize depressing early sunsets and dark trips home, arguing evening light improves mental health and outdoor time.
  • Several note that just shifting school/work hours seasonally could address these issues without moving clocks, but coordination is seen as politically/organizationally harder.

Time zones, UTC, and alternatives

  • Side debates explore:
    • “UTC for everything,” decimal time, Swatch beats, location‑based solar clocks, or continuous gradual shifts; most are deemed elegant in theory but impractical socially.
    • China’s single time zone and Spain’s effective permanent DST as real‑world oddities; interpretations differ on whether they work well.

Technical and coordination concerns

  • Developers discuss tzdata updates and the need to propagate new rules (e.g., America/Vancouver) across systems before the final “would‑have‑fallen‑back” date.
  • Some worry “Pacific time” will become ambiguous; city‑based labels (e.g., “Vancouver time”) are suggested as clearer.

Comparisons and politics

  • BC previously waited for US West Coast states; frustration with US federal inaction is noted.
  • EU’s stalled plan to end seasonal changes is contrasted with BC’s unilateral move.

A case for Go as the best language for AI agents

Role of static typing & compile-time checks

  • Many argue that pushing more checks to compile time helps agents: fewer runtime surprises, clearer feedback loops, and easier automated refactoring.
  • Go, Rust, Haskell, OCaml, TypeScript, and C# are cited as good targets because compilers and linters catch many errors that LLMs routinely make.
  • Counterpoint: benchmarks (e.g., AutoCodeBench) show that “more static” doesn’t automatically mean better LLM performance; Rust in particular scores mid-tier despite its strong types.
  • Some note that for human‑in‑the‑loop workflows, dynamic languages can be fine since you can quickly rerun and patch errors.

Go-specific pros and cons

  • Pros mentioned: simple syntax, one main way to do things, fast compilation, stable APIs, strong tooling (formatter, vet, golangci-lint, govulncheck), easy static binaries, and limited framework churn.
  • govulncheck is highlighted for symbol-level vulnerability checking, though it’s clarified that it analyzes usage of known-vulnerable libraries, not arbitrary logic bugs.
  • Go’s uniform style and lack of advanced abstraction features are seen as a plus for LLM predictability and human review.
  • Criticisms: verbose error handling (if err != nil), comparatively weak type system, no enforced purity, and APIs that are easier to misuse than in Rust or richer FP languages.

Comparisons with other languages

  • Rust: praised for safety, expressive types, excellent compiler errors, and unit tests colocated with code; criticized for slow builds, complex lifetimes, ecosystem churn, and heavy dependency graphs.
  • Python: wins on ML tooling and ecosystem; loses on runtime safety, dynamic typing, and messy historical code corpus.
  • TypeScript: strongly promoted by some as the ideal mix of static types, massive training data, and web‑ecosystem alignment.
  • Others mentioned positively: Haskell, OCaml, Clojure, Elixir, F#, C#, Java, D; often framed as theoretically strong but with less training data or more ecosystem complexity.

Training data, ecosystem, and benchmarks

  • Volume and “purity” of training data are seen as critical: Go’s boring, uniform code vs Python’s huge but noisy corpus.
  • Some research and anecdotal tests suggest LLMs currently perform better in C#/Elixir than in Rust or Go, challenging claims that Go is empirically “best.”

Agent workflows and practical experience

  • Several report strong real‑world success with Go for agent tooling, but others see better results with Python, TypeScript, Ruby/Rails, or Haskell.
  • One view: language choice is less important than agent architecture—state management, error handling, observability, and tool orchestration dominate performance.

Bars close and hundreds lose jobs as US firm buys Brewdog in £33M deal

Brewdog deal and business context

  • Brewdog had heavy losses and entered a process likened to Chapter 11; a US buyer acquired the assets for £33m.
  • Some see this as a straightforward rescue of a failing, over‑leveraged business, not “evil company buys good company”.
  • Others emphasize Brewdog’s self‑image as “punk” and anti‑corporate, arguing the outcome exposes it as a conventional, aggressive growth play that overexpanded and burned out.

Equity for Punks & liquidation preferences

  • Around 200k “Equity for Punks” retail investors likely lose everything, leading to broader skepticism about startup equity and employee stock.
  • Discussion focuses on preference shares: institutional investor TSG reportedly had preferred shares with an 18% compounded return in a liquidation priority stack.
  • Some argue liquidation preferences are standard investor protection; others see them as a legal way for insiders to self‑deal and subordinate common shareholders and employees.
  • There’s disagreement over whether any equity class (including preferred) actually gets money given the sale price vs total obligations; this is noted as unclear.

Crowdfunding and retail investors

  • Several commenters conclude equity‑style crowdfunding is usually a bad deal for small investors; “perks + stock” is more like a donation than an investment.
  • Non‑equity crowdfunding tied to specific products or projects is viewed more favourably.

Perceptions of Brewdog & UK pub culture

  • Mixed views on Brewdog: once important in bringing IPAs/craft styles to the UK, now seen by some as a corporate, TGI‑Fridays‑style chain with mediocre beer and tourist vibes.
  • Others push back, noting plenty of people clearly did like it, or it couldn’t have grown so large.
  • Broader point: UK pubs have been in structural decline, but Brewdog’s problems go beyond the “one village, fewer pubs” story.

Trends in alcohol and nightlife

  • Factors cited for industry pressure: high on‑premise prices, oversaturated craft market (too many hazy IPAs, fewer diverse styles), “TGI‑Fridays‑ification” of brewpubs, and young people drinking out less.
  • Non‑alcohol substitutes and changes: cannabis (where accessible), GLP‑1 drugs reducing desire to drink, online dating reducing bars’ role as meeting spots.
  • Some argue drinking out has simply become too expensive relative to drinking at home.

Employment law and “redundancy”

  • “Made redundant” is clarified as a specific UK legal term akin to “laid off because the role no longer exists”, with associated rights and redundancy pay.
  • It is contrasted with being “fired” for cause, and described as both a protection and something that can be gamed (e.g., reshaping roles to dismiss specific people).

Broader capitalism and fairness debate

  • The thread broadens into arguments about free‑market capitalism, unequal investor access, and whether ordinary people can realistically benefit from equity markets.
  • Some stress index funds and broad market access; others highlight two “classes” of investors with different terms, tools, and protections, using Brewdog as an example.

Ask HN: Who is hiring? (March 2026)

Overall hiring landscape

  • Very large, diverse set of companies hiring: early‑stage startups, profitable bootstrapped firms, growth‑stage, and big tech.
  • Domains span AI/ML infrastructure and agents, fintech and payments, healthcare, education, civic tech, gaming, robotics, hardware/supply chain, real estate, energy, and devtools.
  • Many roles are senior or staff‑level (backend, full‑stack, infra, ML/AI, product engineers), but there are also junior roles, internships, IT/system admin, product, design, sales, and support.

Remote vs onsite and geography

  • Mix of fully‑remote, hybrid, and strictly onsite roles.
  • Remote jobs often restrict to specific regions or time zones (e.g., US only, EU only, GMT‑8 to GMT+2, CET±2h).
  • Some posts are explicit about not sponsoring visas; a few call out which visas they do support; others require US persons or specific US states.
  • Commenters ask whether “US‑based” roles can be done fully remote from EU; some posts clarify they will not consider US‑based candidates at all for certain low‑comp roles.

Tech and AI trends

  • Heavy emphasis on AI and “agentic” systems: LLM apps, RAG, multimodal models, MLOps, AI observability, AI‑native products, and AI‑assisted development.
  • Common stacks: TypeScript/React/Next.js, Python/FastAPI/Django, Go, Rust, Elixir, Java, Postgres, Kubernetes, Terraform, AWS/GCP.
  • Several teams explicitly state daily use of tools like Claude Code, Cursor, Copilot, and other AI coding assistants.

Compensation and transparency

  • Compensation ranges from relatively low (one global remote job in the ~$24–48k range, which draws criticism) to very high (multiple $200k+ base or $280k+ total comp roles).
  • Some companies provide public compensation calculators or detailed band ranges; others are vague or say “very competitive”.
  • Equity is frequently offered, especially at startups; some highlight profit‑sharing or phantom stock.

Meta discussion and concerns

  • Users question vague marketing sites (“what does your software actually do?”), downed links, and job posts listing many cities apparently for search visibility.
  • One salary level and “no US candidates” stance is explicitly criticized and downvoted.
  • A moderator intervenes on a flagged comment, reminds posters of HN guidelines, and notes that spammy accounts are banned.
  • There are requests to prune outdated “search/aggregator” tools from the thread index.

Ask HN: Who wants to be hired? (March 2026)

Overall thread focus

  • This “Who wants to be hired?” thread is essentially a marketplace of individuals advertising availability, not a debate.
  • Posts cover a broad spectrum: IC engineers, designers, data scientists, product and engineering leaders, and non‑engineering roles (PM, UX, DevRel, project/program management).

Roles and seniority

  • Many posters are senior or staff‑level engineers (backend, full‑stack, DevOps/SRE, data/ML, mobile, embedded).
  • Several executives and fractional leaders: CTOs, heads of AI, product leaders, fractional CxOs, and consultants.
  • There are also new grads, early‑career developers, and students seeking internships or first roles.
  • A few non‑coding specialists appear: UX/visual designers, UX researchers, data analysts, technical project managers, and legal/HR/ops profiles.

Technologies and stacks

  • Common stacks:
    • Backend: Python, Go, Java/ Kotlin, C#/C++, Node.js, Ruby/Rails, PHP/Laravel, Rust, Elixir.
    • Frontend: React/Next.js, TypeScript, Vue, Svelte, Tailwind, design systems.
    • Infra: AWS/Azure/GCP, Kubernetes, Docker, Terraform/Ansible, CI/CD.
    • Data: Postgres, MySQL, MongoDB, Redis, Kafka, Spark, dbt, Snowflake/BigQuery.
  • Niche skills: embedded and drivers, FPGA, game engines (Unity/Unreal), CFD and HPC, geospatial/GIS, financial trading/HFT, safety‑critical automotive, telecom/networking, DevTools, and security.

AI/ML and LLM work

  • Very strong AI/LLM presence: RAG, agents/agentic workflows, MCP, vLLM/llama.cpp, fine‑tuning/LoRA, evaluation, voice pipelines, computer vision, and traditional ML.
  • Some specialize in “AI infra” or “AI systems” (latency, observability, orchestration) rather than model research.
  • Multiple posters emphasize heavy use of AI coding tools (Claude Code, Cursor, etc.) in their daily workflow.
  • A minority explicitly state they are not interested in AI/crypto/ads, while others refuse adtech, gambling, or “enshittification” work for ethical reasons.

Industries and domains

  • Frequent domains: fintech, health/medtech, e‑commerce, SaaS/B2B, devtools, security, data platforms.
  • Also represented: aviation, mapping, robotics, games, edtech, civic/gov, automotive, adtech, and blockchain/crypto (with mixed enthusiasm in the thread).

Remote work, geography, and relocation

  • Posters are globally distributed: North America, Europe, UK, India, LatAm, Africa, and Asia–Pacific.
  • Strong bias toward remote‑first roles; many highlight 10–20+ years of remote experience.
  • Some are open to relocation (often with constraints: region, visa, family), others insist on remote‑only.
  • Time‑zone overlap (US or EU) is frequently mentioned.

Engagement types and preferences

  • Mix of full‑time job seekers and contractors/freelancers; many open to either.
  • Several explicitly offer fractional/part‑time leadership or consulting (DevOps, AI, product, data).
  • Rate transparency varies: a few list hourly or monthly rates; most offer “on request.”
  • Common preferences:
    • Early‑stage or 0→1 product work.
    • Small, high‑agency teams.
    • Mission‑driven or “non‑toxic” domains.
    • Interesting technical challenges over pure maintenance.

Judge finalizes order for Greenpeace to pay $345M in ND oil pipeline case

Case outcome and legal basis

  • Jurors found Greenpeace USA liable for defamation and incitement related to Dakota Access Pipeline protests; sister entities (Greenpeace Fund, Greenpeace International) were not found liable for on‑the‑ground harms.
  • Many commenters accept that Greenpeace likely crossed legal lines by spreading misinformation or paying for “direct action training,” arguing this is outside First Amendment protection.

Fairness of trial and jury composition

  • Critics say the jury pool was structurally biased: small county (~30k), heavily pro‑Trump, economically dependent on oil, with some jurors allegedly having financial ties to Energy Transfer.
  • Venue change motions were filed and denied multiple times; some readers found the venue arguments weak, others see this as evidence of a stacked deck.
  • A group of legal observers claimed “multiple due process violations” and a constrained defense; others question that group’s neutrality and credibility, noting it appears purpose‑built for this case.

SLAPP, free speech, and incitement

  • Some frame the suit as a SLAPP: a strategic effort to chill protest and bankrupt a high‑profile NGO, especially in a state without anti‑SLAPP laws.
  • Others counter that a successful, evidence‑tested case by definition isn’t a frivolous SLAPP.
  • Debate over how high the bar should be for turning political rhetoric into actionable “incitement”; references to Brandenburg and concern about a chilling precedent for civil society groups.

Scale of damages

  • Many doubt Greenpeace’s statements could reasonably cause $345M+ in harm and see the award as punitive or retaliatory.
  • Others argue delays, equipment damage, and lost revenue could plausibly reach large sums, depending on construction and throughput economics (details in thread remain unclear).

Greenpeace’s role and reputation

  • Some environmentalists criticize Greenpeace as dogmatic, anti‑nuclear, and strategically counterproductive, even welcoming its potential collapse.
  • Others emphasize the case’s importance not because of Greenpeace per se, but because it may be used to target any NGO that threatens major corporate interests.

Climate, energy, and pipelines

  • Strong disagreement on tactics:
    • One camp favors supply‑side obstruction (blocking pipelines, “direct action”) given the climate crisis and historical fossil‑fuel deceit.
    • Another stresses harm‑reduction and demand‑side policy: pipelines vs rail risk, global oil pricing, and focusing on cheaper renewables rather than constraining specific projects.
  • Side debates cover “too late” climate pessimism, tipping points, and whether decarbonization should rely mainly on market incentives vs restriction.

Corporate structures and asymmetric power

  • Several note Greenpeace USA’s separate legal entity may now absorb the judgment and go bankrupt, while a new entity could be spun up—mirroring tactics used by large corporations.
  • Extended discussion of “Texas two‑step” bankruptcies and fraudulent transfer law highlights perceptions that corporations routinely game liability in ways activist groups cannot.
  • Comparisons drawn to other cases (e.g., Chevron vs. an environmental lawyer) to argue courts systematically favor major corporate actors.

Broader institutional distrust

  • Many commenters express deep skepticism toward the US legal system, juries in polarized locales, and “expert” NGOs on all sides.
  • Others push back, emphasizing that imperfect jury trials plus appellate review remain the best available mechanism, and that both environmental groups and oil companies can behave badly.

Parallel coding agents with tmux and Markdown specs

Perceived productivity vs. “where’s the software?”

  • Some are skeptical: if multi-agent setups are so productive, they ask why we don’t see lots of clearly “great” AI-built software.
  • Others argue it’s early: tools only got good recently, feedback cycles for software quality are long, and much of the output is internal tools or personal projects.
  • Several note that AI mostly increases volume of “mundane” but useful software, not necessarily greatness; average quality may even drop as volume rises.

Reported use cases and concrete wins

  • Many describe internal or personal tools: finance and chat apps, window compositors, status dashboards, automation scripts, filesystem/Ansible helpers, browser automation, CI tooling, ZFS backends, games, and reverse‑engineered docs for legacy systems.
  • One user claims a ~20%+ reduction in PR time-to-merge via parallel review agents, but this is challenged as over-extrapolated and not business‑proven.
  • Others mention multi-agent code review at large companies and production-grade products built on agentic coding.

Orchestration patterns: tmux, specs, worktrees

  • Common pattern: one markdown spec per agent/pane; agents work in parallel against git worktrees or repo copies to avoid clashes.
  • Some prefer 2–3 focused agents (backend/frontend/tests) over 6–8, citing merge conflicts and cognitive overhead.
  • Others build higher-level “factory” abstractions with a supervisor agent that decomposes work, spawns workers, and manages worktrees/merges.

Context management and documentation

  • Major challenge is context drift across sessions; solutions include:
    • Per-agent spec docs plus an orchestration doc.
    • Tools like agent-doc, Beads, or entity-centered “NERDs” documents.
    • PROJECT.md / SPEC.md to record direction, key decisions, and avoid scope creep.

Costs, quotas, and optimization

  • Parallel agents rapidly exhaust top-tier subscriptions; several hit weekly limits in 3–4 days.
  • Strategies: mix cheaper models, “oracle” agents for code questions, cheap supervisors to detect spec gaps, and strong planning/checkpointing to reduce wasted “thinking” tokens.

Quality, validation, and safety

  • Strong emphasis on tests, verification commands, and sometimes separate reviewer agents.
  • Concern about agents bypassing deny lists; some invert the model and require “proof of safety” (explicit intents, path checks, diffs, tests) before any action.
  • Long-term impact on maintainability and design quality is seen as unclear and unmeasured.

Anthropic Cowork feature creates 10GB VM bundle on macOS without warning

VM-based Cowork design & rationale

  • Claude Cowork on macOS/Windows runs inside a Linux VM via Apple’s Virtualization framework or Microsoft’s Host Compute System.
  • Stated goals:
    • Give the model its “own computer” to freely install tools and run scripts without touching the host.
    • Strong isolation guarantees versus lighter sandboxing (containers, seatbelt, etc.).
    • Reduce risk for non-technical users who can’t safely vet arbitrary commands and suffer from “approval fatigue.”
  • Some commenters see this as the right tradeoff for “agents” and regulated environments; others argue containers or alternate sandboxes could suffice.

Storage, performance, and UX complaints

  • Cowork creates a ~10 GB VM bundle on macOS (and similar on Windows) without an explicit warning.
  • Users report:
    • Surprise at the disk usage, especially on small SSDs or metered connections.
    • The VM image starting mostly full, leaving little room for tasks; it can fill up and break Cowork.
    • VM RAM usage and power draw even when Cowork is disabled.
  • Some workarounds: delete or resize the VM bundle, move/symlink to other disks, or avoid the desktop app and use the web.
  • Many want:
    • A clear prompt before downloading/creating the VM.
    • A one-click way to remove or relocate it.
    • Options to disable Cowork, use a lighter sandbox, or restrict host filesystem access to chosen folders.

Broader macOS disk-usage frustration

  • Thread broadens into complaints about opaque “System Data” and aggressive caching (Docker/Podman VMs, Time Machine snapshots, Podcasts, Messages, Photos).
  • Tools like ncdu, GrandPerspective, DaisyDisk, and others are recommended to find large, hidden directories.
  • Some note SIP and permissions make it hard to even see or delete certain files and snapshots.

Quality, development practices, and “vibe coding”

  • Several users praise the models and Claude Code CLI, but describe desktop apps/Cowork as buggy, resource-heavy, and shipped “too fast.”
  • Concerns that parts of the product and even GitHub issues themselves are “vibe coded” or AI-generated, leading to misleading bug reports.
  • Others counter that VM isolation and fast iteration are acceptable tradeoffs, especially for non-expert users and SMBs, but call for better diagnostics (“doctor” tools) and error messaging.