Hacker News, Distilled

AI powered summaries for selected HN discussions.

Page 56 of 779

Nothing Ever Happens: Polymarket bot that always buys No on non-sports markets

Bot concept and scope

  • Bot automatically buys “No” on Polymarket non‑sports markets, based on the observation that ~73% of markets resolve to “No.”
  • Described by the author as a meme project with no risk management and no claimed returns; also useful as a template for writing custom bots.
  • Excludes sports partly because of platform plumbing (some sports markets are internally Yes/No in non‑obvious ways).

Profitability and math debate

  • Many commenters stress that “73% of markets resolve No” does not imply profit; pricing and payouts matter.
  • Example: if “No” resolves correctly 83% of the time but pays too little, the strategy still loses.
  • Some report backtests and small live trials: modest or no profits in practice, especially once resolution timing and opportunity cost are accounted for.
  • Others claim they’ve seen positive edge (e.g., always selling overpriced longshots), but note high variance and limited capacity.

Market efficiency, biases, and crowding

  • Discussion of whether prediction markets are reasonably efficient: insiders and sophisticated traders tend to arbitrage away simple edges.
  • Known bias: dramatic/long‑shot “Yes” outcomes often overpriced because they’re more fun, suggesting some structural edge on “No.”
  • Any systematic edge is expected to shrink once widely publicized; open‑sourcing a working strategy would help price it away.

Risk, variance, and “pennies before steamroller”

  • Several compare the approach to selling options or “picking up pennies in front of a steamroller”: many small wins, occasional big losses.
  • Counterpoint: downside is capped per bet and can be managed with position sizing (e.g., Kelly‑style), but correlation between events and thin liquidity make real‑world implementation tricky.

Data and backtesting

  • Data quality is a challenge; realistic backtests need full order books and accurate resolution times.
  • A large Polymarket dataset on Hugging Face is referenced for research and strategy testing.

Prediction markets, ethics, and regulation

  • Strong debate over whether these platforms are just unregulated casinos exploiting gamblers or valuable tools revealing insider and expert beliefs.
  • Concerns raised about manipulation, house behavior (fees, disputes), and moral issues (e.g., markets on wars, assassinations, leaders losing power).
  • US access and KYC rules are murky; some mention official US entry paths, others mention crypto and VPN workarounds.

This year’s insane timeline of hacks

Public Apathy and Outrage Fatigue

  • Many see public indifference not as ignorance but as exhaustion: constant crises (wars, economic instability, political scandals, data breaches) have drained people’s capacity to care.
  • Others argue it’s distraction rather than fatigue: social platforms “DDoS” human attention.
  • Several note that for most people these events don’t visibly change day-to-day life, so “end of the world” rhetoric is tuned out.

Why Hacks Feel Abstract to Most People

  • Non‑technical people often don’t understand what GitHub, npm, or “supply chain attacks” are, so stories don’t land.
  • Repeated breach notices, with few tangible personal consequences, create “another hack, who cares?” attitudes.
  • Ordinary users have little agency: they can’t control corporate security or prevent their data from being stored and exposed.

Incentives, Accountability, and Security Culture

  • Posters stress that big actors rarely face meaningful punishment; costs of breaches are low relative to savings from under‑investing in security.
  • Security is framed as inconvenience and pure cost, so it’s underfunded and overruled by executives demanding exceptions.
  • Compliance is often treated as checkbox theater, not real risk reduction.

AI as Force Multiplier in Cyber Offense and Defense

  • Many see gen‑AI as a “godsend” to criminals: better phishing, deepfakes, malvertising, vulnerability discovery, ransomware-as-a-service, and exploit scaling.
  • Others think similar tools can harden defense (e.g., automated auditing, formal reasoning), restoring parity.
  • The thread debates a specific frontier model’s alleged ability to find vulnerabilities and the seriousness of central bank briefings; some see genuine risk, others see marketing and fear‑driven “Security™”.

Debate on the Importance of Cybersecurity and Privacy

  • One line of argument downplays data exfiltration: most consequences are borne by corporations; leaks could even aid research or weaken harmful IP monopolies.
  • Strong pushback emphasizes discrimination, political persecution (e.g., abortion travel, medical histories), and the societal role of privacy as an “escape hatch.”

Systemic Risk and Future of the Internet

  • Several expect a Morris‑worm‑scale (or worse) event: mass compromise of repos, payment systems, critical infrastructure, or banks, especially under AI‑enabled scanning.
  • Concerns include public clouds hosting sensitive source, fragile global credit card systems, and attacks on OT/ICS (e.g., Rockwell Automation).
  • Some foresee partial de‑globalization of the internet and migration into walled gardens or segmented “human vs AI” networks; others think people may simply use the internet less.

Security Careers and Labor Market

  • One camp says this is a massive growth area: rising attacks, talent shortages, and increasing demand for serious security engineering.
  • Another highlights burnout, stress, and low organizational support; some senior leaders report planning to exit the field or move to low‑risk consulting and non‑tech trades.
  • Consensus that meaningful roles require real software/OS/network fundamentals, not just paper “cybersecurity” credentials.

Tools, Architectures, and Mitigations

  • Suggested mitigations: stricter network and data tiering, air‑gaps, local stacks, use of ephemeral VMs for browsing, and defense‑in‑depth.
  • Some doubt whether “air‑gap” style isolation can work at scale given human behavior and complex supply chains.

Language, Media, and Hype

  • A side thread criticizes vague or hyped terminology (“cyber”, “order of magnitude”, stylized LLM prose) and fear‑centric marketing.
  • Others counter that language evolves and that media coverage naturally optimizes for attention, not technical accuracy.

Make tmux pretty and usable (2024)

Overall Themes

  • Tmux is widely seen as powerful but unfriendly by default: great once tuned, awkward out of the box.
  • The thread centers on usability (keys, scrollback, copy/paste), theming, and whether tmux is still the right tool versus newer multiplexers or smarter terminals.

Tmux vs Alternatives (Zellij, screen, native terminals)

  • Many switched from tmux to Zellij, praising its default UX, pane/tab handling, mouse support, and vim-like keybindings.
  • Others returned to tmux due to Zellij crashes, missing features (e.g., status bar toggling, window previews), or its stance against tmux-style keyboard copy/multiple buffers.
  • Screen is mentioned as smaller and long-proven; tmux offers better UTF-8 and saner defaults, but screen remains attractive on very constrained systems.
  • Some argue modern terminals (kitty, Wezterm, KDE Konsole) with built-in tabs/panes and scripting can replace tmux entirely.
  • Size matters on embedded devices: Rust-based tools like Zellij are criticized for large static binaries.

Core Use Cases & Session Persistence

  • Main tmux use: detaching/reattaching SSH sessions, working on headless VMs, and surviving flaky connections or mobile devices.
  • Some want only persistence, not panes/tabs; lighter tools suggested: dtach, abduco, shpool, zmx, mosh, etc.
  • Others integrate tmux tightly with editors/IDEs (vim/neovim, Emacs) and use project “sessionizer” scripts, fzf switching, or byobu.

Customization, Keybindings, and UX

  • Frequent rebinding of the prefix (Ctrl‑a, Ctrl‑Space, Ctrl‑s, Ctrl‑z, backtick) and pane navigation to vim-style keys.
  • Some warn against heavy customization because it breaks muscle memory on stock servers; others maintain dotfiles or scripts to deploy configs quickly.
  • Control mode (-CC) with terminals like iTerm2 is praised for making tmux panes look and behave like native tabs, though the protocol is called ugly and support is limited.

Scrollback, Copy/Paste, and Mouse

  • Many complaints about tmux scrollback and copy, especially with mouse mode, browser terminals, macOS Terminal, and OSC52 issues.
  • Workarounds: vi-style copy-mode, custom keybindings, mouse-on configs, plugins (for fuzzy extraction or hint-based copying), and mapping system clipboards.
  • Zellij is criticized for lacking robust keyboard-based copy and multiple buffers, which is a dealbreaker for heavy keyboard users.

Aesthetics & Theming

  • Default bright-green status bar is widely disliked; users apply themes, powerline-style bars, color-coded hosts/root, or hide the bar entirely.
  • Some view prettification as nice but secondary; others say tmux only becomes “usable” after substantial theming and config.

Microsoft isn't removing Copilot from Windows 11, it's just renaming it

Renaming Copilot and AI Branding

  • Many see the “removal” of Copilot as mostly a rebranding/entry‑point shuffle rather than a functional retreat.
  • Renaming “AI” options (e.g., to “Advanced features”) is viewed as an attempt to reduce visible backlash while keeping the tech.
  • Some predict generic “AI” icons and silent integration will replace explicit branding over time.

AI Features in Notepad and Other Apps

  • Notepad gaining AI is widely criticized as bloat for what users expect to be a tiny, simple tool.
  • Concerns that AI in Notepad is cloud/LLM-backed, not NPU-local, and has already correlated with a security CVE.
  • Several people wish Notepad had stayed simple and that richer functionality had gone into a distinct “WordPad-like” app.

User Control, Privacy, and the Need for a Global Toggle

  • Strong desire for a single, easy‑to‑find “Disable AI everywhere” system switch.
  • Frustration that AI is on by default and that opting out requires hunting through scattered or misleadingly named settings.
  • Some distrust any setting at all, expecting features to re‑enable themselves after updates.

Windows 11 Experience and Bloat

  • Many describe Windows 11 as sluggish and cluttered with background services, ads, AI hooks, and unwanted integrations (OneDrive, Copilot, Edge/Bing).
  • Nostalgia for earlier Windows versions (7, 10, XP) as snappier and more focused; LTSC/IoT editions are praised as what Windows “should” feel like.
  • There is skepticism that promised 2026 “fixes” to Windows 11 will materially change this trajectory.

Alternatives: Linux, macOS, and Dual‑Boot

  • Large subthread on moving to Linux (often for all non‑gaming tasks) or macOS, with Windows kept as a “console” for games or specific enterprise apps.
  • Linux desktop is described as mature enough for most workloads; gaming via Proton/Steam Deck is considered viable for many titles.

AI Utility and UX Concerns

  • Many see Copilot‑style integrations as poorly thought out: can’t perform obvious in‑context tasks (e.g., operate directly on selected spreadsheet cells).
  • Comparisons are made to Clippy: intrusive assistants that add friction instead of solving real problems.
  • General sentiment that vendors prioritize AI marketing and KPIs over longstanding usability requests.

US appeals court declares 158-year-old home distilling ban unconstitutional

Scope and Immediate Legal Effect

  • Ruling comes from the 5th Circuit; commenters agree it’s only binding in that circuit (TX, LA, MS) and not automatically nationwide.
  • Debate over nationwide injunctions: some say they used to make such decisions effectively national; others argue nationwide injunctions are a very recent, rare tool.
  • Where state law still bans home distilling, people note this federal ruling doesn’t yet translate into practical legality on the ground.

Commerce Clause, Wickard, and Federal Power

  • Many see tension with Wickard v. Filburn and Gonzales v. Raich, where the federal government was allowed to regulate personal, in-home production under “interstate commerce.”
  • This case explicitly avoided Commerce Clause issues because the government dropped that argument; another pending case in the 6th Circuit is said to raise it directly.
  • Some hope this is part of a broader move to rein in federal power and revive the 10th Amendment; others warn overturning Wickard could endanger the legal basis of federal civil-rights and regulatory regimes.
  • There’s discussion of “necessary and proper” and taxing powers versus true commerce powers, and frustration that “interstate commerce” has been stretched to cover almost any activity.

Methanol, Safety, and Risk Perception

  • Several brewers/distillers argue methanol fears are largely misplaced for grain-based and typical fruit-based ferments; methanol is produced only in tiny amounts, and poisoning cases mostly involve denatured/industrial alcohol being sold as drinkable.
  • A common myth is that “throwing out the heads” meaningfully solves methanol; some say physics (azeotropes) makes this effect limited.
  • Consensus in the thread: the main real risk of home distilling is fire, not poisoning.

Home Distilling Practice and Technology

  • Multiple anecdotes from regions where home distilling is widespread (e.g., Balkans) are used as informal evidence of its relative safety.
  • Several posts dive into technical details: pot stills, electric vs propane heat, condenser cooling, reflux management, PID control, and automation with microcontrollers and MQTT.
  • Safety practices emphasized: open systems (not pressure vessels), careful heat control, adequate cooling and ventilation, and human supervision during runs.

Broader Policy Analogies

  • Commenters draw parallels to federal marijuana bans, homegrown cannabis cases, machine-gun regulations, and even civil-rights laws, seeing this decision as one skirmish in a larger fight over the scope of federal authority versus state and personal autonomy.

They See Your Photos

Perceived Accuracy and Behavior

  • Most users report the tool is wildly inaccurate beyond obvious visual facts (age band, race, clothing, rough setting).
  • Personality traits, income, religion, politics, sexuality, and “biases” are often incorrect, contradictory, or mutually inconsistent across runs.
  • Same photo uploaded twice can yield different personality and bias profiles; different photos of same person can yield opposite traits.
  • Many liken it to horoscopes, astrology, or Mad Libs: vague, generic, sometimes accidentally resonant, but not systematically reliable.

Geolocation and EXIF Debate

  • Several users saw surprisingly precise location guesses (e.g., specific landmarks), which some attribute to EXIF metadata.
  • Others stripped EXIF and still got good or at least plausible geolocation, suggesting visual geoguessing from background cues.
  • Some users, however, saw completely wrong countries or cities, especially for nature scenes.

Bias, Stereotyping, and Harmful Inferences

  • Outputs frequently lean on demographic stereotypes: race, age, clothing, and setting strongly drive assumptions about income, religion, political affiliation, and even criminality or substance abuse.
  • Many examples show offensive or absurd attributions (e.g., casteism, racism, addictions, low self-esteem, specific party alignments) with no visible justification.
  • Sexual orientation is usually defaulted to heterosexual; queer couples or individuals are misclassified.
  • Users characterize it as a “stereotype machine,” with repeated canned labels like confirmation bias, in‑group bias, ageism, classism.

Purpose, Messaging, and Trust

  • Some see the site as dishonest fear‑mongering or “rage‑bait,” exaggerating current capabilities to scare users about big tech.
  • Others argue the accuracy isn’t the point: it illustrates the kinds of inferences companies might attempt and how wrong guesses could still affect people when embedded in opaque decision systems.
  • There is concern it doubles as an advertisement for a competing encrypted photo service and possibly a data‑collection honeypot; several users refuse to upload real or friends’ photos.

Broader Reflections on AI Profiling

  • Commenters note that existing trackers, purchase history, and browsing behavior likely provide much stronger targeting signals than such image-based psychometrics.
  • Some worry that even partially accurate guesses, when combined with other data, could significantly enhance profiling.
  • A minority find the demo “fascinating” or illuminating, preferring such capabilities to be visible to the public rather than hidden.

AI could be the end of the digital wave, not the next big thing

Impact on creativity, skills, and learning

  • Several developers report feeling less creative and more dependent on AI, with fewer original project ideas and difficulty coding without AI (e.g., on planes / offline).
  • Others say their “muscle memory” hasn’t degraded despite heavy AI use; they see skill loss as normal “use it or lose it,” not AI-specific.
  • Mention of cognitive offloading: when a tool is trusted, humans remember less and think less deeply; studies cited showing lower ownership and weaker recall when using AI for writing.
  • Some argue not remembering boilerplate is fine and even desirable; the important part is understanding logic and architecture.

Online experience degradation & retreat to offline

  • Many complain that search and shopping are being flooded with “AI slop” (bad images, SEO text), making it harder to learn or buy reliable products.
  • This drives people back to physical spaces: clothing stores, libraries, film rental shops, social clubs, and in‑person services.
  • Some foresee a partial “rewind to 1990”: in‑person shopping, paper exams, proctored tests, and distrust of online text/images.

Economic & labor implications

  • Strong expectation that AI will be used first for cost-cutting and layoffs, especially in back-office and low–medium complexity white-collar work.
  • Concern that even if society benefits overall, individual workers (especially mid‑career engineers) may be “the horses” in the car analogy.
  • Others note historical patterns: tech often lowers costs, compresses margins, increases competition, and benefits consumers more than workers.

Is AI the end of the digital wave or a new surge?

  • One camp: AI/LLMs are late-stage digital optimization; software has already “eaten the world,” and AI just accelerates commoditization and margin collapse.
  • Another camp: current investment behavior (massive, speculative, infrastructure-heavy) looks like an “installation phase” of a new techno‑economic surge, not an endgame.
  • Some view AI as fundamentally new because it can “do” rather than merely “help,” potentially clashing with existing economic and social structures.

Alternative “next waves”

  • Robotics, renewable energy (solar + batteries + EVs), biotech (protein folding, mRNA), and space are proposed as deeper or more important transformations than LLMs.
  • Debate over how tightly robotics is tied to LLMs; some see them as largely separate, others see LLMs as a coordination layer for robot behaviors.

Adoption patterns, UX, and infrastructure

  • Rapid mainstream uptake of chatbots is contrasted with the slower PC adoption curve; non‑technical users often adopted ChatGPT early.
  • Disagreement over whether a “chatbox for everything” is actually a better UI than traditional interfaces; sometimes doing the task directly is quicker than describing it.
  • Some predict ubiquitous on-device models (even in appliances), but others question cost, accuracy, hallucinations, and user tolerance.
  • Concerns about hyperscale data centers: seen locally as symbols of job loss, high energy use, and “your livelihood being replaced.”

Open vs proprietary models and moats

  • Open-weight, local models already handle many everyday tasks well for some users.
  • Skepticism that proprietary AI firms have durable moats if “good enough” local models catch up; AI might erode existing software moats rather than create new ones.

I went to America's worst national parks so you don't have to

Reception of the Article / Tone

  • Many readers interpret the piece as satire or “rage bait,” with exaggerated negativity used for humor.
  • Others find the tone cynically dismissive of nature, arguing it reflects a narrow, joyless way to experience parks.
  • Some enjoyed the writing style and found it genuinely funny and entertaining, even when they disagreed with the judgments.

Grand Canyon Reactions

  • Several commenters strongly defend the Grand Canyon as transformative, awe-inspiring, and beyond what photos convey.
  • Others, especially non-hikers/non-campers, report being impressed but “done in 30 minutes,” finding it less engaging without backcountry activity.
  • Multiple people describe life-highlight multi-day hikes or river trips and argue the canyon can’t be judged from the rim alone.
  • Safety is a recurring theme: unprepared hikers frequently get into serious trouble; deaths and rescues are common.

Crowding, Access, and Strategy (Yosemite, Zion, etc.)

  • Overcrowding in “marquee” parks (Yosemite, Zion, Yellowstone, Grand Canyon rim) is widely acknowledged; traffic and shuttle lines can feel like theme parks.
  • However, many note crowds vanish a few miles from trailheads or at sunrise, or in winter/off-season.
  • Yosemite and Zion are praised as spectacular if you:
    • Hike beyond the usual viewpoints.
    • Use less popular entrances.
    • Seek backcountry routes or traverses.
  • Some argue the article overstates Zion’s crowding; others confirm intense congestion on famous hikes (e.g., Angels Landing, Narrows approaches).

Other Specific Parks

  • Congaree: locals and visitors describe it as eerie, beautiful, and unique (old-growth forest, cypress, synchronized fireflies), though it lacks “wow” vistas and can be buggy.
  • Gateway Arch: widely questioned as a “national park–level” site; some accept it as “fine,” more like an urban monument/museum.
  • Cuyahoga Valley, Mammoth Cave, Pinnacles, Acadia: mixed views, with some seeing them as underrated or contextually appropriate (especially for the East Coast), others underwhelmed.

Alternatives and Broader Context

  • Many recommend state parks and nearby National Forests as quieter, equally beautiful alternatives to crowded National Parks.
  • Several note the exceptional overall quality and diversity of U.S. parks, while also praising the Alps for hut infrastructure and contrasting European “restoration” with U.S. “preservation.”

Servo is now available on crates.io

Overview of the Release

  • Servo is now published as a crate on crates.io, with documentation on docs.rs still stabilizing.
  • Related components Stylo (CSS engine) and WebRender (rendering engine) are also published and usable standalone, with plans for ongoing monthly releases.
  • Slint provides an example of embedding Servo in a GUI via wgpu, showing how to use the embedding API.

Maturity and Production Readiness

  • Consensus: not ready as a drop-in replacement for Blink/WebKit for JS-heavy, modern web apps.
  • Works better for simpler, mostly static content; users are advised to test it standalone before embedding.
  • Current limitations mentioned: layout issues, problems with some heavy JS sites, and incompatibility with Cloudflare Turnstile.

JavaScript Engine and Rust JS Ecosystem

  • Servo’s scripting still uses SpiderMonkey (C++), which some find disappointing for an otherwise Rust-centric project.
  • Others argue SpiderMonkey is mature and self-contained, and replacing it is risky without clear wins.
  • Multiple Rust JS engines exist, but are generally 10x+ slower, often interpreter-only, and not “browser-grade” (e.g., lacking high-performance JITs).

Standards and Feature Coverage

  • No single “caniuse”-style feature table; closest references:
    • Servo’s Web Platform Tests dashboard.
    • Auto-generated WebIDL API docs.
    • AreWeBrowserYet as a high-level capability overview.
  • Some experimental features exist, including partial WebGL/WebGL2 support via feature flags.

Use Cases and Integrations

  • Envisioned as an embeddable engine/webview akin to CEF, and potentially as a standardized engine across platforms.
  • Example tool: a “servo-shot” CLI that renders web pages to images, with noted need for more work around cookies and robustness.
  • Mentioned possible roles: Tauri integration, qutebrowser backend, headless rendering, and alternatives to system webviews.

History with Firefox and Mozilla

  • Servo originated as a Rust-based experimental engine at Mozilla.
  • Key pieces (Stylo, WebRender) were integrated into Firefox; the broader rewrite effort was halted after layoffs around 2020.
  • Views differ on whether Servo was ever intended to fully replace Gecko versus remaining a long-term testbed.

Versioning and SemVer Debate

  • Substantial side discussion on Rust crates staying at 0.x:
    • Some criticize Cargo’s treatment of 0.x as “stable enough,” causing projects to linger below 1.0.
    • Others defend 0.x as a clear signal of instability and freedom to change APIs.
    • Techniques like the “semver trick” are mentioned to ease the 0.x → 1.0 transition.

Michigan 'digital age' bills pulled after privacy concerns raised

GDPR, HTTP 451, and Local News Sites

  • Many focus on the irony that an article about privacy is blocked in the EU due to GDPR (HTTP 451).
  • One view: this shows privacy regulation “working” — the site refuses to track/sell data under EU rules, so it simply blocks access.
  • Counterview: small US outlets lack resources to understand and comply with complex foreign laws, especially where there’s little audience/advertising value.
  • Some argue blocking EU users signals an intent to monetize user data without transparency; others say it just reflects legal and administrative overhead (e.g., EU representatives, retention rules, DSA duties).

Debate on GDPR’s Merits

  • Supporters: users shouldn’t need extensions or expertise to get privacy; GDPR empowers data access/erasure and protects the less tech-savvy.
  • Critics: cookie banners and geo-blocking degrade the web; they argue browser tools already block most tracking, and enforcement against small US sites is unrealistic.
  • There is confusion and disagreement on what “European law” entails and how far extraterritorial obligations reach.

Michigan “Digital Age” / Age-Verification Bills

  • Bills were pulled after privacy concerns, but sponsors are reportedly working on replacements with advocacy groups.
  • Some see this as proof democratic feedback still works; others suspect coordinated efforts and temporary tactical retreats rather than a real change of heart.
  • A recurring theme: you cannot restrict children online without verifying everyone’s age, which creates broad privacy and identification implications.

Age Verification, Porn, and the First Amendment

  • Discussion compares ID checks in physical porn stores vs mandated online age verification.
  • Key distinctions raised:
    • In-person checks are ephemeral and already non-anonymous; online checks create persistent, high-risk identity databases.
    • Physical sellers can rely on visual judgment; online systems effectively require ID for all.
  • Some argue claims that such laws are unconstitutional are weak and based more on policy dislike than solid legal reasoning; others emphasize chilling effects and anonymity loss.

Broader Fears: Surveillance and Power

  • Several comments frame age-verification and ID mandates as steps toward a global panopticon and regulatory capture by large tech and identity vendors.
  • There is deep cynicism about bipartisan alignment: many note that widely popular reforms stall, yet surveillance- and control-oriented laws advance quickly.
  • Others stress that public anger about social media’s impact on children creates political incentives to act, even if implementation harms privacy for everyone.

Android now stops you sharing your location in photos

Scope of the Change / What Actually Happens

  • Android’s scoped storage from Android 10 onward hides EXIF GPS data by default unless an app requests ACCESS_MEDIA_LOCATION and the user grants it.
  • Mobile browsers like Chrome and Firefox on Android do not request this permission, so any image selected via the system photo picker is delivered with GPS coordinates zeroed out.
  • Many share flows that use MediaStore URIs (including Bluetooth or QuickShare from gallery apps in some setups) also strip GPS; file managers using raw file access often do not.
  • The native Android photo picker also renames files (or hides originals) in ways that confuse users and some apps; Google classifies filename stripping as “intended behavior.”

Privacy and Safety Arguments (Pro Change)

  • Most users are unaware that photos contain precise location, timestamps, device details, etc.
  • Stripping EXIF on upload is seen as aligning with user expectations and reducing risks of stalking, doxxing, and inadvertent exposure (e.g., on mapping or social apps).
  • Some argue the OS should treat “my phone” as private and “browser / third‑party apps” as non‑private, enforcing data minimization by default.
  • Several commenters note similar behaviors on iOS and GrapheneOS and praise “maximum privacy by default,” especially for less technical users.

Criticism and Regressions (Against Change)

  • Power users and developers lose legitimate workflows: crowdsourcing (e.g., mapping, ecology), law‑enforcement documentation, photo organization, duplicate detection, and research projects.
  • Web apps cannot ask for EXIF GPS even with user consent; suggestions include HTML attributes or pickers with explicit “include location” toggles.
  • Some see this as turning general‑purpose phones into locked‑down appliances and as another unilateral, undocumented breaking change by a dominant platform.
  • Complaints extend to filename stripping and other “simplifying” changes (disallowed characters, hidden metadata), which break cross‑device sync and expert workflows.

Comparisons, Workarounds, and Broader Concerns

  • iOS reportedly allows per‑share control over including location; some wish Android copied this UI rather than hard‑stripping.
  • Workarounds mentioned: USB transfer, file managers, zipping files, alternative ROMs (e.g., GrapheneOS), or native apps that request the new permission.
  • Several note the irony that EXIF to websites is blocked while large platforms and data brokers still collect highly detailed location streams through other channels.

Show HN: I built a social media management tool in 3 weeks with Claude and Codex

AI-assisted development workflow

  • OP built a full-featured social media management tool in ~3 weeks using detailed specs plus two AI coding tools (one for initial implementation, one for review/security).
  • Specs, architecture docs, and style guide were largely AI-drafted then heavily refined over several days.
  • Work was structured into “layers” and parallel “streams” (e.g., content pipeline, providers, media, notifications), with 3–4 agents running in parallel; merging and human review became the bottleneck.
  • High parallelism also hit token/session limits and raised costs.

Where AI coding worked vs broke down

  • Worked well for: Django CRUD, models/views/serializers, Tailwind + HTMX UI, provider modules for well-documented APIs, tests, and cross-file refactors.
  • Failed or was risky for: poorly documented APIs (TikTok), multi-tenant permission logic (data leaks across workspaces), OAuth edge cases, and background job orchestration. These bugs often passed AI-generated tests.
  • Significant time went into UX polish; initial AI-built UI was feature-complete but confusing and inconsistent.

Stack and database debates

  • Discussion around Django + HTMX being “old” vs FastAPI/SvelteKit; some see Django/HTMX as pragmatic and well-documented, ideal for AI agents and solo devs.
  • Database debate: several argue Postgres should be the default for serious apps (strictness, transactional DDL, battle-tested), others say SQLite is enough for this kind of tool, while managed MySQL/Postgres favored for cloud hosting.

Platform APIs and coverage

  • LinkedIn posting appears to use their website/API; details examined via the repo.
  • X/Twitter integration initially omitted due to high API costs; recent per-request pricing may make it feasible, but some question whether it’s worth integrating given declining engagement.
  • Questions about whether automated posting is allowed or throttled; answers: depends on platform and can change, but all major platforms have official posting APIs for developers.

Monetization, cloning, and “SaaS-pocalypse”

  • Several see this as evidence that generic SaaS is easily cloned with AI; sustainable businesses may need:
    • Access to unique data/insights, or
    • Cutting-edge tech outside current training distributions, or
    • Strong distribution/brand.
  • Some argue many users will just “vibe code” bespoke tools instead of adopting generic open source.

Quality, trust, and “vibe coding”

  • Some potential users are wary of a “built in 3 weeks with AI” product for production use, preferring mature proprietary tools.
  • Others note much commercial software is already hurriedly built; AI doesn’t automatically make quality worse.
  • Debate over “vibe coding”: for some it means fully surrendering to the model and ignoring the code; others mean AI-assisted but with human review and testing.
  • Concerns center on multi-tenant bugs, maintenance of AI-generated code, and long-term support; some think AI also makes maintaining legacy systems easier.

Use cases and alternatives

  • Interest from agencies managing many client accounts; confirmed that multiple accounts/clients are supported.
  • Separate desire for a “social media reader” aggregating feeds into a calm, ad-free UI; considered hard to do without violating platform T&Cs, though RSS, open networks (e.g., Mastodon/Bluesky), and scraping tools were mentioned.
  • Some view social media as increasingly bot-driven and toxic; tools like this are seen as both practical and emblematic of that trend.

The AI Layoff Trap

AI Layoffs and Economic Risk

  • Several comments argue that AI-driven layoffs are already happening and may accelerate, creating fear and insecurity among workers.
  • Others are unconvinced that current “AI layoffs” are genuinely caused by automation, seeing them as normal cost-cutting with AI as PR cover.
  • A central concern: if AI displaces workers faster than they’re reabsorbed, consumer demand may fall, risking economic instability and civil unrest.
  • Some think this is a large, uncertain “if”; others say the stakes are high enough to justify proactive planning.

Taxing Automation and New Economic Models

  • The paper’s main proposal is a Pigouvian tax on AI/automation to compensate for the negative externality of job destruction.
  • Supporters see this as a way to avoid a “prisoner’s dilemma” where every firm cuts labor and collectively destroys demand.
  • Critics call this “neo-luddism,” arguing that taxing efficiency guarantees stagnation and would have blocked past progress.
  • Others suggest broader shifts: more tax on capital and corporate surplus, perhaps even on unrealized gains, as labor’s share of output shrinks.
  • Practical challenges noted: AI firms often lack profits; “simply” taxing AI is nontrivial in design and enforcement.

Historical Analogies: Luddite Fallacy vs. “This Time Might Be Different”

  • One side frames concern as the classic Luddite fallacy: tech displaces some jobs but creates others, and has never collapsed demand.
  • The opposing view says past transitions (e.g., agriculture) were slower and narrower; near-general automation could surpass all human comparative advantages, making history a poor guide.

Labor Demand, Robotics, and Sector-Specific Issues

  • Some argue there is “practically infinite” unmet demand in construction, manufacturing, agriculture; robotics, not LLMs, would be the real disruption trigger.
  • Others counter that demand at livable wages is limited, construction productivity has stagnated, and corruption/safety/contracting constraints make U.S. construction uniquely dysfunctional.

Human Role and Post-Work / Machine Economies

  • Multiple comments explore scenarios where most labor and even consumption are automated, with a tiny elite owning capital and a large underclass excluded.
  • Some outline dystopian outcomes: extreme inequality, humans as “pets,” or a purely machine economy that no longer needs humans.
  • There is recurring anxiety about how people will afford housing and food in a “post-work” setting and whether the state can or will manage the transition.

Current AI Capabilities and Reliability

  • Participants debate what “AI” refers to (LLMs vs broader techniques) and whether current systems justify the alarm.
  • Examples: useful for coding, translation, content generation; but also serious failures on tasks requiring up-to-date legal reasoning and robust logic.
  • Some say intelligence-on-tap is overhyped; others note that widespread disruption can occur long before AI matches expert human reliability.

The economics of software teams: Why most engineering orgs are flying blind

LLM Agents, Code Quality, and Maintainability

  • Many commenters reject the idea that messy, AI-generated code is fine if agents will maintain it.
  • Reports of AI-only projects stalling: agents get stuck, make wrong changes, and can’t recover without heavy human guidance.
  • Even when agents pass tests, code can be structurally unsound: “walls made of foam” analogies; defensive, contradictory logic that destroys invariants.
  • Consensus that good human practices (modularity, types, documentation, tests) matter even more with agents; rewrites may get faster, but not easier or safer.

Economics of Teams, ROI, and Cost Awareness

  • Strong agreement that most engineering orgs underthink ROI: huge efforts on internal tools or features that save minimal time or affect few users.
  • Some find “back-of-the-envelope” cost math (e.g., three weeks on a 2% feature = tens of thousands of euros) clarifying and underused.
  • Others argue precise dollar attribution per feature/team is often impossible or misleading, especially in multi-feature, multi-team products.
  • Several note indirect and strategic value: reliability, compliance, support load, retention, and option value rarely show up in simple ROI formulas.

Platform Teams, Internal Tools, and Indirect Value

  • Skepticism about treating platform teams purely as “time savers”; they also provide reliability, security, standardization, and risk reduction.
  • Point that platform/internal tools are akin to shared infrastructure or admin overhead; the choice is centralizing vs duplicating effort, not just “hours saved”.

Slack Clone and LLM Productivity Claims

  • Widespread dismissal of the “95% Slack replica in 14 days” as conflating UI-level cloning with enterprise-grade product (scale, compliance, legal holds, SSO, APIs, mobile, etc.).
  • Historical analogies: many “lightweight clones” of complex products (e.g., word processors, Twitter) failed despite feature overlap.

Technical Debt, Metrics, and Organizational Blindness

  • Many agree organizations “fly blind” mainly by ignoring technical debt, reliability, and maintenance costs until productivity collapses.
  • Suggested metrics for debt/liability: time to ship, change failure rate, rework, MTTR, dependency age, complexity, churn.
  • Others caution against over-reliance on numbers: risk of Taylorism, Goodhart’s law, and defending pet projects with dubious calculations.

What’s Actually Hard in Software

  • Strong theme: the hardest part is understanding what to build and iterating toward the right solution, not keystrokes.
  • Some push back, noting that for non-trivial domains, designing workable architectures and algorithms is itself very hard, even when requirements are clear.
  • LLMs may speed up iteration and prototyping, but they don’t remove the need for human judgment, product sense, and responsibility.

Apple's accidental moat: How the "AI Loser" may end up winning

Apple’s AI Strategy and “Accidental Moat”

  • Many see Apple following its usual pattern: let others burn cash, then ship a polished, late product once use‑cases are clearer.
  • Others argue AI shows Apple is not executing 4D chess: Apple Intelligence rollout felt half‑baked, Vision Pro underwhelmed, and some see the current position as luck more than strategy.
  • Several note Apple’s focus on privacy and on‑device ML long predates the LLM boom; this may have accidentally positioned their hardware well for local AI.

Hardware, On-Device AI, and Local Models

  • Strong agreement that Apple Silicon’s unified memory, Neural Engine, and fast SSDs are excellent for local inference and “LLM in flash” approaches.
  • Some expect open/local models (Gemma, Qwen, etc.) to be “good enough” for most users within a few years, eroding hyperscaler moats.
  • Others counter that state‑of‑the‑art models are too large; compression has limits, and frontier capabilities won’t fully fit on consumer devices with current architectures.
  • Nvidia is expected to defend its position with segmentation (consumer vs datacenter GPUs, Arm laptops), but local AI on Apple hardware is still seen as a serious alternative.

Siri, Software Quality, and UX

  • Widespread frustration with Siri: perceived as years behind Google Assistant/Alexa, unreliable even for simple OS tasks, accent issues.
  • Multiple comments describe a long‑running decline in Apple software UX and consistency, contrasting with earlier Mac OS design rigor.
  • Some say typical users don’t notice; others insist the “iOS‑ification” and “Liquid Glass” design are obvious regressions.

Business Model, Services, and Gatekeeping

  • Services are a large, high‑margin revenue stream; App Store commissions on AI subscriptions (e.g., ChatGPT) already generate substantial income.
  • Apple is criticized for App Store ads and search results that surface scammy or misleading apps, despite its curation narrative.
  • Several highlight Apple Intelligence as an orchestration layer that lets Apple:
    • Pre‑screen AI requests,
    • Decide when to route to third‑party models,
    • Collect data on demand patterns,
    • Act as a gatekeeper and rent‑collector over AI services.

Market Position, Ecosystem Lock-In, and Competition

  • Debate over why people buy iPhones: some emphasize iMessage lock‑in (especially in the US), others say messaging is mostly WhatsApp/other apps outside the US and that people simply prefer iPhones.
  • Thread notes that globally Android dominates by share, but Apple captures outsized revenue in rich markets.
  • Several argue that in an “LLMs are commodities” world, distribution and devices win; big platforms (Apple, Google, Meta, Microsoft) are better positioned than standalone labs like OpenAI/Anthropic.

Attitudes Toward AI Hype and Use Cases

  • Many users report “AI fatigue”: dislike for AI‑branded features everywhere, pop‑ups in productivity apps, and AI meddling in core tools (Maps, Workspace, etc.).
  • Consensus that users care about concrete benefits (battery life, speed, specific features) rather than “AI” as such; AI branding is viewed as overused and often hostile.

All elementary functions from a single binary operator

Basic idea and example constructions

  • EML is defined as eml(x,y) = exp(x) – ln(y); with EML and the constant 1, one can build all elementary functions.
  • Simple derived forms:
    • exp(x) = eml(x, 1)
    • ln(x) = eml(1, eml(eml(1, x), 1))
  • From exp and ln:
    • Subtraction: x - y = eml(ln x, exp y)
    • Addition via x + y = ln(exp(x) * exp(y))
    • Multiplication, division, powers, roots, trig and hyperbolic functions are then composed using standard identities.
  • Expanded EML trees become large; e.g., multiplication can require depth-8 trees with 40+ leaves.

Expressiveness, math context, and edge conventions

  • The result is likened to NAND/NOR functional completeness, but for continuous/elementary functions rather than Boolean logic.
  • Some note that hypergeometric or multi-argument “selector” functions already encode many functions; the novelty here is a binary operation plus one constant.
  • The completeness proof sometimes relies on extended real conventions like ln(0) = -∞, e^{-∞} = 0; this is called out explicitly in the paper and debated:
    • Some see this as a non-standard caveat.
    • Others argue it is standard when working over the extended reals and IEEE‑754 behavior.
  • There is discussion of domain issues (e.g., log not a single-valued function over ℂ), and that some constructions pass through ln(0) or infinities.

Practicality, efficiency, and hardware

  • Consensus: this is mainly a theoretical/symbolic result, not a better way to numerically compute basic functions.
  • Using EML to express simple operations like + or * is far more complex and inefficient than standard primitives.
  • Analogies are drawn to:
    • NAND/NOR as universal logical bases, but rarely used directly in optimized designs.
    • Lambda calculus/Iota as minimal universal formalisms with little direct practical use.
  • Some speculate on:
    • EML-based symbolic regression and function discovery, potentially using gradient descent on EML trees.
    • Specialized EML coprocessors or analog EML circuits, though others doubt performance benefits versus existing FPUs and polynomial/rational approximations.

Verification, tooling, and reactions

  • Several participants reconstruct or verify EML expressions (e.g., using SymPy or small interpreters) and confirm correctness for many constants and operations.
  • Others propose using EML as a benchmark challenge for LLMs (“express 2x+y or sin(x)/x in EML+1”).
  • Overall tone mixes excitement at the conceptual elegance with skepticism about real-world impact or novelty relative to existing analytic frameworks.

Taking on CUDA with ROCm: 'One Step After Another'

Overall sentiment on ROCm vs CUDA

  • Many see AMD as years behind CUDA, due to historic underinvestment and lack of vision despite early AI signals.
  • Several comment that AMD’s ROCm stack feels like it’s still in a “plan/cleanup” phase while CUDA is a mature, feature‑rich ecosystem.
  • Some think AMD can still succeed in data centers even if ROCm never fully catches CUDA, but most argue the software gap is now a critical competitive issue.

Hardware support, lifecycle, and value

  • Major frustration: ROCm’s limited and shifting hardware support, especially for consumer GPUs and APUs.
    • Only recent RDNA generations are officially supported; older and even high‑end RDNA2 cards are often left behind.
    • “Unofficial” workarounds sometimes function, but can break with kernel/driver updates.
  • ROCm is seen as “mayfly‑lifetime” compared to CUDA’s long support window for older NVIDIA GPUs.
  • On price/perf, AMD and Intel cards (including new Radeon Pro and Arc Pro) can be very attractive, but support headaches make some users regret not buying NVIDIA.
  • Several users report good local LLM performance on new AMD cards (especially RDNA4), but describe setup as fiddly.

Developer experience and ecosystem

  • ROCm is criticized as buggy, hard to install, poorly packaged, and slow to support popular frameworks and features.
  • Compared to CUDA, AMD lacks:
    • Rich, polyglot libraries.
    • First‑class IDE/debugger tooling.
    • Smooth experiences in PyTorch, vLLM, etc. (though things are improving).
  • Vulkan backends often “just work” and can match or beat ROCm in some LLM workloads, but Vulkan is viewed as low‑level, verbose, and ergonomically painful.

Open source, trust, and corporate culture

  • ROCm’s openness is praised (fully open userspace, community projects like TheRock), but undermined by:
    • Narrow official device support.
    • Complex, brittle build systems (especially on musl/Alpine).
  • NVIDIA’s proprietary stack is criticized on transparency and FOSS grounds, but many note the market overwhelmingly prioritizes performance, features, and stability over openness.
  • Internal AMD bureaucracy and conservative open‑source policies are described as serious drags on progress.

Proposed directions and alternatives

  • Calls for:
    • Supporting every new AMD GPU/APU with ROCm at launch.
    • Longer support windows for consumer hardware.
    • First‑class C/C++/Fortran (HIP) alongside Triton/Python, not just AI‑centric paths.
    • Better packaging in mainstream distros.
  • Alternatives discussed:
    • Vulkan and SYCL/oneAPI, OpenVINO on Intel.
    • Rust‑GPU, Triton, higher‑level abstractions.
    • Using LLMs/agents and RL to help port CUDA code, though current models are seen as not yet capable of this at scale.

Sam Altman's home targeted in second attack

Overall reaction to the attacks

  • Near-universal condemnation of the specific attacks; many stress that targeting individuals or homes is “unacceptable” and “counterproductive.”
  • Some worry this may be the beginning of a trend: attacks on AI executives, data centers, and other “elite” targets as anxiety over AI and the economy grows.
  • Several express concern for bystanders such as security guards or family, and predict heavier security and potential government crackdowns.

Debate over political violence

  • One camp insists political violence is never acceptable in a democracy and only strengthens repression and polarisation.
  • Others argue that political violence has historically created or reshaped democracies (US independence, labor movements, French Revolution, anti‑colonial struggles) and sometimes “works,” though at immense cost.
  • A darker minority suggests elites rely on courts and police violence, so non‑elite violence is an inevitable response when legal and electoral channels feel captured.

Systemic causes, class conflict, and AI

  • Many tie the attacks to broader resentment: job precarity, inequality, corporate capture of government, perceived impunity for the rich, and fear that AI will destroy livelihoods.
  • Some note that ordinary people may see AI CEOs as personally responsible for layoffs, surveillance, or military uses, regardless of the actual causal chain.
  • Others push back: evidence of large‑scale AI job destruction is still “murky,” and violence is seen as misdirected and unjust to those building or supporting AI.

Democracy, effectiveness of institutions, and alternatives

  • Several argue formal democratic mechanisms (voting, petitions, referenda, lobbying) can still work; others counter that these tools are structurally biased toward wealthy interests and often ineffective.
  • Suggested nonviolent levers: unions and strikes, boycotts and disinvestment, ballot initiatives on AI, sustained organizing, and public pressure campaigns.
  • There is disagreement over whether the US is still meaningfully a democracy or has slid into oligarchy; this colors people’s attitudes toward both violence and institutional remedies.

Perceptions of AI leaders and AI itself

  • Some see the attacked CEO as just one highly visible avatar of an almost-inevitable global AI “arms race,” so killing or intimidating individuals would not stop the trend.
  • Others enumerate grievances against AI executives: Pentagon deals, business‑model shifts, lobbying against regulation, privacy‑hostile projects, and public rhetoric about replacing jobs.
  • A few note possible ideological influences (e.g., AI‑doom writings about “everyone dies if AGI is built”), but available information on the attacker’s exact motivation is described as uncertain.

A perfectable programming language

Writing style and presentation

  • Several comments focus on the article’s lowercase-first-word style and heavy swearing.
  • Some see it as an informal chat/IM style imported into blogs; others find it painful to read in longform.
  • A few deliberately use all-lowercase as a personal or aesthetic choice; others call for more conventional grammar for readability (including for machine readers).
  • Capitalization is also described as a tone marker (e.g., to signal sarcasm or dismissiveness).

Lean tooling, docs, and distribution

  • The interactive site is praised for hover-based code documentation, attributed to the Verso system.
  • Lean is commonly installed via VS Code; some find this amusing, others say it’s sensible to polish that path.
  • Emacs and Neovim integrations are reported as solid alternatives.
  • A major complaint is Lean 4’s distribution size (≈2.5 GiB unpacked vs ~15 MiB for Lean 3), mostly due to compiled IR/environment files (.olean/.ilean/.ir).
  • Suggestions like compressing IR files have been floated but (per comments) not effectively realized. Some see this as bloat and a regression.

Lean’s role among proof-oriented languages

  • Lean 4 is praised as an expressive, high-quality functional language; some want it to eventually replace Haskell for typed FP in production.
  • Others feel its “perfectability” is limited by non-constructive axioms in the standard library.
  • F* is recommended as more practical for proof-oriented programming and low-level verified code; Agda and Idris2 are preferred by some for “mathy” or program-oriented dependent types.

Types, safety, and language evolution

  • There is debate over the claim that dynamically typed languages “tend to grow types.”
  • One side stresses that all values already have types; exposing them helps tooling and linters, explaining trends in PHP/Python.
  • Others argue this is driven by a vocal pro-type minority and that type advocates understate trade-offs.
  • Some note dynamic languages accreting static features, while static languages rarely move toward dynamism.

Semantics surprises and Lean “footguns”

  • Examples: converting 256 to UInt8 and subtracting 4 - 5 : Nat both yield 0 instead of compile-time errors.
  • Critics call this surprising and unsafe; they want safe-by-default operations with explicit “unchecked/saturating” variants and distinct notation.
  • Defenders argue these total, saturating/wrapping semantics are reasonable defaults and avoid burdensome proof obligations for every subtraction, while strict proofs are still possible where needed.
  • This is used both as criticism of Lean’s API design and as a discussion of the practical limits of encoding invariants in types.

Lisp, homoiconicity, and culture

  • Some equate Lean’s “perfectable” self-description with Lisp-style homoiconicity.
  • The “Lisp Curse” (power causing fragmentation and odd local cultures) is raised; others counter that modern Lisp ecosystems can be highly productive and unify disparate runtimes.
  • There’s debate over how much Lisp is actually used in everyday software, with one commenter noting we lack visibility into much infrastructure code.

Dependently typed languages compared

  • Idris2 is highlighted as particularly well-designed and program-centric, with an interactive “hole-driven” workflow that feels like Haskell with stronger types and good editor support.
  • F* and similar systems integrate heavy SMT solving; commenters find these powerful but cognitively taxing for complex software because of constant context switching.
  • Lean is described as proof-goal–centric: work is framed around adjusting proof strategies and lemmas, suiting mathematics but feeling less like ordinary programming.
  • Some feel there is still no dependently typed language that is clearly aimed at everyday software development first.

“Perfect language” wishlists and trade-offs

  • One detailed wishlist combines Go’s speed and single-binary deployment, Kotlin-like types, OCaml-style result/option and pattern matching, Elixir-like docs/tests, strong REPL, and minimal transitive dependencies, without a heavy VM.
  • Others respond that such a “one size fits all” language conflicts with the reality that languages are designed for specific contexts (e.g., Go for Google-scale services, Erlang for telecoms).
  • Examples like Common Lisp/SBCL and Delphi are proposed as already meeting many wishlist items; differing opinions remain on type-system sophistication (Kotlin vs Scala/Haskell) and VM trade-offs.

Practical use and ecosystem

  • Questions arise about whether Lean and other provers run real-world businesses.
  • Replies cite use of Isabelle and Lean at a major cloud provider, and formally verified compilers (like CompCert) in regulated industries, plus tools like TLA+ for practical design work.
  • Some express fatigue with Lean 4 due to regressions (e.g., unification issues) and the default reliance on GitHub-based packaging.

I ran Gemma 4 as a local model in Codex CLI

Overall impressions of Gemma 4 for local use

  • Many commenters report Gemma 4 (especially 26B and 31B) as the first local model that feels “good enough” for serious coding and doc navigation.
  • Several say it’s close to GPT-OSS tier for one-shot coding, but weaker in iterative / agentic workflows and non-coding reasoning.
  • Some users find it flails on moderately complex codebases or specific tasks where other models succeed.

Hardware, engines, and performance

  • Successful setups span:
    • Nvidia 4090 / dual 4090, 3090, GB10/Spark, 24–128 GB+ RAM.
    • Mac M1/M3/M4/M5 Pro/Max/Ultra with 16–64+ GB RAM.
    • AMD RX 7900 XTX, CPU-only 64 GB machines.
  • Mac M5 Pro reported as ~8x faster tokens/s vs M4 Pro for the same MoE Q4 model; unclear how much is CPU vs RAM.
  • Some consider Macs poor ROI for inference vs Nvidia GPUs; others report great real-world speed with MLX/LM Studio when supported.
  • Engines tried: llama.cpp, LM Studio, Ollama, vLLM, OpenWebUI. Opinions differ: some say Ollama is worst; others find vLLM finicky and prefer llama.cpp.

Quantization, context, and quality

  • Strong consensus: for coding, high-quality quants (Q6_K, Q8_0) are much better; heavily compressed (Q4) hurts reliability.
  • Advice: choose the highest quant your memory can handle, even if slower.
  • Larger context windows (64k–512k) are possible but can constrain speed or memory; some offload MoE to CPU to trade speed for context.

Coding, tool calling, and agents

  • Tool/function calling remains a weak spot: models get stuck in loops, miscall tools, or fail follow-up calls.
  • Newer tokenizer/chat templates reportedly improve Gemma 4 tool use, but results are mixed.
  • Some pair Gemma 4 with lighter agents (e.g., Pi) for lower overhead, or use draft models / speculative decoding for speed.
  • Emerging pattern: local Gemma handles bulk experiments and small refactors, cloud frontier models handle architecture and hard bugs.

Safety, censorship, and uncensored variants

  • One thread criticizes Gemma 4’s “censorship,” especially on medical questions, arguing that best-effort answers are needed offline.
  • Others defend refusals as safer than potentially wrong high-stakes advice.
  • Several note that “uncensored” or “abliterated” Gemma variants exist and can be used instead.

Comparisons and alternatives

  • Qwen 3.5, GLM Flash, some distilled Qwen-based models, and other open models are cited as stronger in some coding or tool-calling tasks.
  • Benchmarks shared in the thread show Gemma 4 26B-A4B exceptionally strong in one-shot coding but weaker in agentic scenarios and large-context tasks.