Stories - Page 556 | HN Distilled

2025-01-28

Almost one in 10 people use the same four-digit PIN

How secure are 4‑digit PINs really?

Several comments note that 4‑digit space (10,000 combos) overstates real security because humans choose predictable values (1234, years, birthdays, patterns like 4321).
Attacks often don’t need your PIN, just any valid one (e.g., shared gates, car washes, calling cards), making “dictionary attacks” of common numbers very effective.
Some mention that many devices only require the last 4 digits and treat overlapping keypresses as multiple attempts; de Bruijn sequences can test all codes in near‑minimal keypresses.
Others argue 4‑digit PINs are acceptable when they’re only a second factor, backed by bank fraud detection and the difficulty of stealing the physical card.

What is the PIN in the auth model?

One side: card = “password” (secret you have), PIN = “username” (identifier), account number = underlying identity.
Opposing view: PIN behaves like a password, not a username, since many people can share the same PIN and even multiple cardholders on one account can use identical PINs.
Consensus leans toward the standard “something you have” (card) + “something you know” (PIN) two‑factor framing, rather than username/password analogies.

Terminology and data‑model pedantry

Debate over whether PINs and phone numbers are “numbers” or digit strings; risk highlighted when developers store identifiers as integers and lose leading digits.
Others note dictionaries explicitly define “number” as any figure(s) used for identification, so identifiers with digits/letters still count.
Recommendation appears: treat such identifiers as strings; represent phone numbers in standardized E.164 format.

Visualization and data quirks

Many like the ABC visualisation but criticize lack of hover labels and difficulty seeing relationships in only two spatial dimensions; some suggest gridlines or interactive versions.
Observations from the heatmap:
- Strong clustering around dates (DDMM, MMDD) and birth years.
- PINs starting with 0 and not forming dates are noticeably rarer (leading‑zero bias).
- Some culturally meaningful numbers (e.g., 2112, 1701, 6969) are less common than expected.

Usability, reuse, and alternatives

People commonly reuse the same PIN across phones, cards, and banking apps, effectively turning PINs into shared, weaker passwords.
Complaints about finance apps forcing 4‑digit “fast access” PINs instead of allowing strong passwords or password managers.
Some propose biometrics on cards; others push back that biometrics can’t be changed if compromised.

View on HN ↗ Original Article ↗

2025-01-28

IAC confirms existence of a Super-earth in the habitable zone of a Sun-like Star

Meaning of “habitable zone”

Thread spends a lot of time arguing what “habitable zone” means and how it’s communicated.
Technically: the orbital range where a rocky planet could have surface liquid water, given stellar flux. It’s a term of art in astrobiology, not “comfortable for humans.”
Some argue the term misleads the public into thinking “Earth‑like and ready to move in,” and wish outreach pieces would avoid or rename it. Others say using the proper term and defining it once is fine.

Eccentric orbit and habitability

This planet has a highly elliptical orbit, moving from the outer to inner edge of the habitable zone.
People debate whether “intersecting” the zone is meaningfully “in” it; parts of the orbit may be too cold or too hot.
Several note that life on such a world might adapt via global hibernation or deep subsurface/ocean refuges, citing Earth extremophiles and SF examples (Vinge, Three‑Body Problem).

Water, biochemistry, and where to look for life

One camp: water + carbon are the only chemistry we have hard evidence for supporting life; with finite resources we should prioritize water‑and‑carbon‑rich environments in habitable zones.
Another camp: with only one data point, we shouldn’t be too Earth‑centric and should also consider icy moons, tidal heating, and possible non‑water solvents (ammonia, methane), even if experiments have not yet yielded alternative biochemistries.
There’s back‑and‑forth over whether this is open‑mindedness or just “you never know” speculation that can’t yet drive mission design.

Detection methods & PLATO mission

Strong enthusiasm for ESA’s upcoming PLATO telescope at Sun–Earth L2, optimized to find Earth‑like planets around Sun‑like stars via the transit method.
Contributors explain:
- Why staring at one field for years maximizes chances of catching multiple transits and characterizing small planets.
- How CCDs are arranged and optimized for dynamic range.
- Why transits are better than radial-velocity for detecting small, Earth‑like worlds in bulk.
Some technical discussion of transit depth (~~0.01%), stellar variability, and geometric alignment (~~order‑percent of systems are favorably aligned).

Super‑Earth gravity and technology limits

The planet’s minimum mass (~6× Earth) triggers discussion of surface gravity: depends on radius/density; could be much less than 6g, but likely higher than Earth.
Speculation on what lifeforms might look like under high g (stocky, many‑legged, aquatic, or rolling morphologies).
Several note that chemical rockets may be unable to reach orbit from a high‑g super‑Earth; advanced options (nuclear thermal, railguns, exotic drives) are suggested as requirements for any spacefaring civilization there.

Distance and interstellar travel

“Only 20 light‑years” is repeatedly contrasted with the enormous travel times at current or near‑term speeds: tens to hundreds of thousands of years with today’s fastest probes.
People estimate required speedups (factors of ~10,000) to reach human‑lifetime transits, discuss continuous 1g acceleration, relativistic time dilation, and the huge energy cost.
Alternatives raised: robotic or AI probes, self‑replicating machines, seeded life, solar gravitational lens telescopes to image exoplanets instead of visiting them, and concepts like Breakthrough Starshot.

Fermi paradox and broader implications

One commenter notes: if we ever found a technological civilization only 20 ly away, Bayesian reasoning would suggest that intelligent life is common and that the “Great Filter” likely lies ahead of us, which would be ominous.
Others push back on grand conclusions; we’re still very early in exoplanet and SETI surveys.

Emotional and cultural reactions

Mix of awe and escapist fantasy (“can I move there?”) with reminders that interstellar migration is effectively impossible for now.
Numerous science‑fiction references (Helldivers’ “Super Earth,” Vinge, Banks, The Expanse, classic SF propulsion) are used to frame plausibility and inspire imagination.
One side thread questions whether such discoveries are “ultimately meaningless”; others argue that fundamental discovery (like NMR → MRI) often finds major applications decades later.

View on HN ↗ Original Article ↗

2025-01-28

Promising results from DeepSeek R1 for code

DeepSeek R1 writing most of a llama.cpp PR

A llama.cpp PR claims ~99% of the WASM SIMD code was written by DeepSeek R1, guided by a human over a weekend.
Workflow was iterative: repeated re-prompts (4–8 times in hard cases), constraints like “optimize only this part,” and manual debugging and test-writing.
Some functions were pure translations (ARM NEON → WASM SIMD); at least one was “invented from scratch” after earlier attempts failed.
Commenters disagree on significance: some see a genuine milestone in practical codegen; others call it “glorified translation” and note that review/validation still require full expertise.

Chain-of-thought as the main value-add

Many find R1’s visible reasoning more useful than its final answers—for refactoring, bug-hunting, and understanding overlooked edge cases.
Several anecdotes describe wrong final answers but correct or inspiring ideas inside the CoT.
This is contrasted with models that hide their internal traces; some argue OpenAI hurt itself by not exposing o1’s reasoning.

Quality, limits, and “jagged frontier”

Experiences are mixed: some users say R1 (and its distills) match or beat o1/Claude/Qwen on coding and math; others report gaslighting, wrong assumptions, and destructive edits on complex logic.
Rust and bespoke APIs remain hard: models often hallucinate methods, traits, or crate names, even when given examples.
Consensus: LLMs excel at clear, localized tasks (ports, boilerplate, SQL, tests); they struggle with underspecified, domain-heavy or highly coupled changes.

Tools, hosting, and distill models

Popular setups: Ollama, LM Studio, EXO, Continue.dev, and Aider. Aider’s own releases are now ~70–80% AI-generated by line count.
Most people use distilled Qwen/Llama variants (e.g., 32B Q4–Q6) locally on 20–30GB machines; full 671B R1 is out of reach for most.
Some report API outages and latency; others route via third-party hosts.

Economic and governance debates

Large subthreads debate whether this heralds mass SWE displacement or just another productivity jump that creates more software and shifts roles toward “product/solution engineers.”
Concerns focus less on usefulness than on wages, junior hiring, and concentration of power.
DeepSeek’s openness and Chinese origin trigger discussion about geopolitics, motives, censorship (e.g., Taiwan queries), and the lack of any real “moat” in foundation models.

View on HN ↗ Original Article ↗

2025-01-28

Boom Supersonic to break sound barrier during historic test flight today

Market and Value Proposition

Many see clear appeal in cutting transoceanic flight times by 30–50%, especially for:
- Frequent business travelers crossing the Atlantic/Pacific.
- Parents on very long-haul flights and people with limited vacation time.
Others doubt the addressable market:
- Expectations that tickets will cost business/first-class or higher (often quoted as $10k+).
- Question whether many will pay 2–4× business class for only a few hours saved when work and video calls are possible in-flight.
- Likely limited to a handful of prestige routes, similar to Concorde.

Economics and Efficiency

Repeated point: small capacity (≈64–80 seats) plus higher fuel burn per seat-mile almost guarantees premium pricing.
Some argue supersonic can approach regional-jet fuel efficiency (e.g., Embraer 175) and that time savings can offset higher operating costs.
Others counter that:
- Airlines already slow planes and ships for efficiency.
- If supersonic is affordable, subsonic will always be cheaper and lower-emission for the same route.

Environment and Externalities

Strong concern about CO2 and climate at a time when aviation is trying to decarbonize, optimize routes, and manage contrails.
Discussion of carbon pricing estimates and how “full cost” tickets would be substantially higher.
One proposed compromise: mandate synthetic or low-carbon fuel for supersonic flights, using this ultra-premium segment to fund early scaling.
Some worry about impacts on marine life and aviation systems; others downplay noise over open ocean.

Noise, Overland Flight, and Social Friction

Official plan (per website) is: supersonic over water, only modestly faster over land.
Several fear a later political push for overland supersonic, with resentment if even “muffled” booms regularly rattle homes.
Debate over NASA’s quiet-boom research:
- Supporters claim dramatic reductions in ground-level noise.
- Skeptics say decades of work haven’t yet produced truly acceptable results.

Technology, Concorde, and Risk

Comparisons to Concorde:
- Boom’s design is expected to improve safety (e.g., engine placement) and maintenance economics.
- But it still faces high development and certification costs, especially for a new engine.
Linked critical analysis suggests commercial supersonic may be structurally uneconomic; others think modern tools and demand justify another attempt.

View on HN ↗ Original Article ↗

2025-01-28

HD Hyundai set to debut production 14 ton hydrogen wheeled excavator

Headline and initial reactions

Some readers found “hydrogen wheeled excavator” unclear or clickbaity; others noted “wheeled vs tracked excavator” is standard industry terminology.
A few comments highlight how well electric heavy vehicles can work already (quiet, no fumes), citing real-world dump trucks and small EV excavators.

Hydrogen vs battery-electric for heavy machinery

Many argue the machine is “dead on arrival” compared with battery-electric (BEV) excavators, which already exist in similar or larger sizes.
Counterpoint: BEVs supposedly “don’t have the capacity” for 8–12 hours of continuous work with acceptable downtime; hydrogen’s fast refueling is seen as the main advantage.
Others respond that endurance can be solved by larger or swappable batteries and that heavy equipment can easily carry extra battery mass.

Refueling, infrastructure, and remote sites

Strong criticism that hydrogen distribution is a “nightmare”: sparse stations, no simple jerrycans, high-pressure or cryogenic handling, and safety concerns.
Pro-BEV comments point out that worksites can use grid hookups or on-site diesel generators to charge equipment, often more efficiently than running diesel engines directly.
Hydrogen proponents ask how you power equipment on new highway builds or remote mines; trucking in hydrogen is seen as simpler than provisioning high-capacity grid power or lots of batteries.

Emissions and “zero-emission” claims

Multiple comments stress that “zero-emission” only applies at the tailpipe; upstream emissions depend on how electricity or hydrogen are produced.
Criticism that 98–99% of current hydrogen is fossil-derived; in that case it is effectively a fossil fuel with extra losses, though some note potential for cleaner production and carbon capture.
Several users provide data and links showing EVs typically emit less CO₂ over their lifecycle than ICE vehicles, even on relatively dirty grids.

Hydrogen technology, economics, and politics

Repeated points: hydrogen’s poor round-trip efficiency, storage difficulty, leaks, and embrittlement make it an “ultimate tarpit” technology for transport.
Some argue hydrogen persists because it fits subsidy regimes and oil/gas interests: produced from methane, generates CO₂ for enhanced oil recovery, yet still qualifies as “green” on paper.
Others counter that it remains early for large-scale green hydrogen; infrastructure and electrolysis might improve, and not all applications should be judged by current costs.

Niche and strategic use-cases

Mining is highlighted as a plausible economic niche: air-quality permits can cap diesel use, making hydrogen-powered trucks and excavators cheaper than battery logistics or reduced production.
Heavy long-duration applications (large ships, planes, remote heavy machinery) are cited by some as areas where batteries struggle and some kind of fuel (hydrogen or derivatives like ammonia/methanol) may be necessary.
Others think synthetic fuels or methane made from captured CO₂ plus clean electricity are more practical than hydrogen itself.

Excavator-specific technical discussion

Some speculate excavators are well-suited to electrification because boom/bucket lowering could, in principle, regenerate energy, unlike current hydraulic systems that waste it as heat.
There is disagreement over how much of that is realistically recoverable given typical hydraulic designs.

View on HN ↗ Original Article ↗

2025-01-28

DeepSeek could represent Nvidia CEO Jensen Huang's worst nightmare

Market reaction and perceived irrationality

Several comments note there was little reaction to DeepSeek-V3 or R1 themselves, but a sharp reaction once a consumer-facing app appeared, seen as evidence of shallow, narrative-driven markets.
Some argue this “signals money on the table” for those who paid attention to the underlying tech earlier; others say timing lags are normal (e.g., COVID).

Jevons paradox, efficiency, and Nvidia’s valuation

Jevons paradox is heavily debated:
- One camp: greater efficiency in model training/inference will increase total compute use (and thus GPU demand).
- Another camp: Jevons applies to compute in general, not to Nvidia’s profits or growth specifically; high margins and growth expectations are what got repriced.
Some stress that even modest downward revisions in long-term growth assumptions can justify large market-cap drops.

Is DeepSeek bad or good for Nvidia?

Bearish arguments:
- More efficient training means fewer GPUs needed to hit a given capability; hyperscalers may slow new cluster purchases and rely on existing capacity.
- Long term, breakthroughs could move more workloads to cheaper or non-GPU hardware; Nvidia’s “training monopoly” may soften.
Bullish/neutral arguments:
- R1-style “thinking” models increase inference compute per query.
- Lower training cost democratizes model-building, inducing many more models and thus more total compute.
- Nvidia’s moat (CUDA, NVLink, Mellanox/Infiniband ecosystem) is seen as extremely strong; “Nvidia sells clusters and a full stack, not just chips.”

Costs, hardware, and sanctions

DeepSeek-V3 training is widely cited as ~2.8M H800 GPU hours ≈ $5–6M at $2/hr, but multiple comments emphasize this excludes capex, experimentation, RL steps, data generation, and staff.
Back-of-envelope estimates put a 2,048‑H800 cluster into the ~$100–200M range including infrastructure; the “$6M model” narrative is viewed as technically narrow but still notable for showing efficiency.
Export controls to China are discussed: DeepSeek was likely trained on Nvidia cards acquired before/around sanctions; this undercuts the idea that restrictions would block Chinese AI progress.

Democratization and new opportunities

Many see DeepSeek as enabling smaller or mid-stage companies to train competitive domain-specific models instead of paying incumbents’ API “rent.”
Some foresee more on-prem deployments (for legal/privacy reasons), family or small-org “AI stations,” and induced demand for mid-range hardware.

Skills, education, and systems knowledge

A strong thread argues DeepSeek’s success highlights the value of deep systems knowledge (OS, compilers, Mellanox/InfiniBand tuning, scheduling, concurrency) over “glue-code ML.”
US CS programs are criticized for weakening OS/architecture requirements, contrasted with stronger systems pipelines in places like Israel and India.
Concrete resources for leveling up: classic OS/systems courses (e.g., UIUC CS241, Berkeley CS162, MIT 6.1810), then HPC, then ML.

Media hype, bubble concerns, and future direction

Several participants are uneasy with media exaggeration around DeepSeek and the sudden explosion of “instant experts” (e.g., on Jevons paradox).
Some think this is a rational correction of an “obvious bubble”; others think markets still underprice the broader productivity impact of current LLMs.
There is disagreement on whether we are at “the end of brute-force scaling” or at the beginning of “test-time scaling” and algorithmic refinement that will still demand massive compute over time.

View on HN ↗ Original Article ↗

2025-01-28

AI, but at What Cost? Breakdown of AI's Carbon Footprint

Scope of the Problem & Comparisons

Some argue AI’s footprint is minor compared with aviation, tourism, fast fashion, or Bitcoin; others call this whataboutism that distracts from legitimately new demand.
Several comments note Bitcoin still likely uses more energy than current AI, but projections suggest AI could surpass it soon.
A recurring theme: comparing AI to “worse” sectors is seen by some as an excuse to avoid improving anything.

Per-Query Impact vs. Aggregate Demand

Multiple commenters criticize the article’s math: misused units, dubious “daily active users,” unrealistic prompts-per-user assumptions, and ignoring batching.
Back-of-envelope estimates suggest one LLM query or a few images cost about the energy of seconds of video streaming or heating a meal—negligible at individual scale.
However, others stress that at billions of queries and images, total demand is large enough to justify new data centers and even restarting nuclear or gas plants, which is evidence in itself.

Usefulness, Waste, and Where to Cut

Some argue we should judge energy use by both emissions and the activity’s societal value (e.g., hospitals vs. cruises; AI as assistive tool vs. novelty images).
Others see generative AI art/text as pseudo‑creativity that displaces genuine skill-building and is both materially and existentially wasteful.
There’s disagreement on whether AI meaningfully boosts “human flourishing” or mainly erodes the skill premium and pushes more work toward minimum-wage tasks.

Climate Concern, Responsibility, and Policy

Debate over how much people really care about climate change vs. what they’re willing to sacrifice (flying less, consuming less, not using AI frivolously).
Some say focusing on AI’s footprint is “fearmongering” or anti‑AI bias; others reply that every marginal increase matters in a collective-action problem.
Several propose taxing or pricing carbon (and letting markets decide which uses survive) rather than moralizing about specific technologies.

Energy Source vs. Energy Use

Strong thread arguing the right lever is cleaning up the grid (solar, wind, nuclear) rather than suppressing new uses like AI.
Others counter that we already lack clean capacity; adding big new loads (AI data centers) now mostly means more fossil generation, at least in the short term.
Suggestions include colocating data centers with abundant renewables (e.g., Nordic wind) and accelerating nuclear/renewables build‑out.

Water, Infrastructure, and Transparency

Comments highlight water use for data-center cooling and power generation, with concern about aquifer depletion and local impacts.
Several note that providers are secretive about real energy and water numbers; calls for mandatory disclosure and better empirical studies.
Some point out the article ignores training costs, failed models, overhead (cooling, PSUs, buildings), and the broader cloud footprint, likely underestimating total impact.

Growth vs. “Enough”

Philosophical split: one side sees continuous technological and energy growth as necessary for progress (health, knowledge, quality of life).
The other questions whether humanity needs endless acceleration, arguing we haven’t defined “enough” and risk overshooting ecological limits.

View on HN ↗ Original Article ↗

2025-01-28

US pauses all federal aid and grants

Scope and Impact of the Aid/Grant Pause

Order described as extremely broad and vaguely written; initial interpretations suggested almost all grants and many aid programs could halt, with exceptions for Social Security, Medicare and “direct benefits” like SNAP.
Commenters highlight real-world fallout: missed payrolls, stalled research, disrupted universities, nonprofits, and clinical trials; one estimates roughly a multi‑trillion drop in GDP exposure.
Later updates note administration clarifications and a federal court’s preliminary blocking of the order, but confusion and short‑term damage are seen as already significant.

Legality, Impoundment, and the Courts

Multiple comments argue this is classic “impoundment” of congressionally appropriated funds, explicitly constrained by the Impoundment Control Act.
Debate over whether a “temporary” delay is also illegal; some point out the statute explicitly covers delays.
Several expect rapid lawsuits from states and affected entities; others worry about slow litigation and the Supreme Court’s deference to expansive presidential power after the recent immunity ruling.

Impeachment, Checks and Balances, and Authoritarian Drift

Many see this as part of a pattern: firing inspectors general, ignoring Congress, using EOs to test boundaries, and learning from a first term with few constraints.
Consensus that a Republican Congress will not impeach or remove, regardless of scale of misconduct; impeachment is described as politically neutered.
Some argue the system now relies almost entirely on courts and individual civil servants; others warn that ignoring court orders would leave only mass resistance or institutional collapse.

Voters, “Chaos Agents,” and Party Symmetry

Big thread on whether voters intentionally chose a “chaos agent” versus a lesser‑evil in a corrupt two‑party system.
One side emphasizes Trump’s uniquely brazen personal corruption and authoritarian rhetoric; the other focuses on systemic “policy corruption” and donor capture across both parties.
Disagreement over Democratic performance: some cite ACA/IRA and a normal policy process; critics say these are modest half‑measures that didn’t alter deep inequality.

Foreign Aid, PEPFAR, and Moral vs. Self‑Interest

Strong concern about suspension of HIV, malaria, and other health programs (PEPFAR) affecting tens of millions abroad.
Moral arguments (“less disease benefits everyone”) clash with “why are US taxpayers on the hook” and “should be zero cost to me” positions.
Others defend aid as enlightened self‑interest and soft power: disease control, geopolitical influence, and economic ties; critics counter with debt, prioritizing domestic needs, and skepticism of US “empire.”

Spending, Deficits, and What to Cut

Deep disagreement on where the real fiscal problem lies: some target social insurance and healthcare; others insist the main drivers are military and tax cuts for the wealthy.
Several want rigorous evaluation of every line item and complain about odd or symbolic foreign‑aid projects; others note these are tiny relative to the overall budget and often mischaracterized.

NGOs, Incentives, and Administrative Chaos

Some suggest NGOs have perverse incentives: more homelessness or border crossings can mean more grant money.
Others respond that NGO staff are generally mission‑driven and that such cynicism ignores real outcomes.
Broad worry that abrupt, poorly planned freezes destroy institutional capacity that can’t be quickly rebuilt, even if courts later reverse the policy.

Geopolitics and Soft Power

Multiple comments link the cuts to a broader US retreat from global leadership, arguing China is filling the gap via Belt and Road and targeted aid.
View that dismantling US soft‑power programs will accelerate a shift toward a China‑centered order, even among traditional US allies.

HN Meta: Politics Fatigue and Moderation

Non‑US readers and some regulars complain about constant US‑politics content on a tech forum; others insist this decision is clearly relevant to technology, research, and startups.
Discussion about flagging, flamewar detection, and whether controversial US political threads are being suppressed by users or site staff.

View on HN ↗ Original Article ↗

2025-01-28

Cleveland police used AI to justify a search warrant. It derailed a murder case

AI, Deepfakes, and Individual Misuse

Commenters connect this case to broader fears about AI misuse: deepfakes, swatting, mass personalized scams, and even homebrew automated weapons.
Several note that “pictures don’t lie” was never fully true, but now cheap, scalable manipulation makes evidentiary video/photos far less trustworthy, and easy tools empower even teenagers or angry vigilantes.

US Evidence Law and “Fruit of the Poisonous Tree”

Much debate centers on the exclusionary rule: illegally obtained evidence must be suppressed, even if it is incriminating.
Some argue this is necessary to give the Fourth Amendment teeth; otherwise police can violate rights, then “use what they find.”
Others compare systems where all evidence is admissible (e.g., Sweden/Norway as described in-thread), suggesting that can work only where police are broadly trusted.

AI as Anonymous Informant & Parallel Construction

The judge’s framing of AI face ID as akin to an anonymous informant resonates: it can be an investigative tip but not probable cause for a warrant.
Multiple comments explain “parallel construction”: using inadmissible intel (like AI or foreign spying) to guide a separate, fully legal investigation.
Many note Cleveland police skipped this step and allegedly misrepresented how they got the ID in the warrant affidavit, which is what poisoned the search.

Guilt, the Gun, and Unclear Evidence

Some readers think the narrative implicitly assumes the AI was right and the suspect is the killer. Others push back: the article only says police “say” the gun is the murder weapon, with no disclosed ballistics or forensic link.
There’s concern that readers treat “a gun in the house” as proof, in a context where guns are common and forensic methods (like ballistics) can be shaky.

Police Behavior, Incentives, and Accountability

Many see this less as an “AI problem” and more as standard police misconduct: lying or omitting material facts in a warrant application.
Several argue that unless detectives and prosecutors are personally sanctioned (perjury, civil liability, career consequences), exclusion alone won’t deter rights violations.
Others respond that even the risk of losing a homicide case, media scrutiny, and possible discipline are significant deterrents—though critics cite how rarely officials are actually punished.

Clearview AI’s Position

Clearview’s own disclaimer (“not admissible in court”) is seen as an attempt to market powerful surveillance while dodging legal responsibility.
Commenters note its promise to “fight crime” yet insistence that results not be used as evidence, suggesting its primary value is as intelligence feeding parallel construction or non-judicial actions.

Race, Neighborhood, and Perception

One commenter stresses the murder occurred in a predominantly Black neighborhood, arguing that calling this straightforwardly “racial profiling” overshoots what’s in the story.
Others highlight that vague descriptors like “build, hairstyle, clothing, gait” in a heavily Black area risk funneling suspicion toward a broad racial category, especially when combined with opaque AI matching.

Anonymous Tips, AI Tips, and Legal Lines

Several explore analogies: if an anonymous caller or psychic tip can lawfully prompt further investigation (but not alone justify a warrant), why can’t AI?
The consensus from quoted legal reasoning: AI or anonymous tips are permissible as leads, but must be transparently disclosed and supplemented with independent evidence to establish probable cause.

View on HN ↗ Original Article ↗

2025-01-28

US Civil servants are being asked who they voted for in 2024 election

Parallels to Authoritarianism and Historical Precedent

Many see asking civil servants about their 2024 vote or “loyalty” as a classic authoritarian move.
Explicit comparisons are drawn to early Nazi-era civil service purges and to standard “dictator’s manual” tactics: purge non-loyal staff, centralize power, remove appeal to courts.
Some argue this is a late-stage warning sign, not an early one, given years of norm erosion and prior firings.

Legality, Civil Service Protections, and Scope

Several commenters insist this is likely illegal under U.S. civil service rules and democratic norms that voting should never affect employment.
Others note that NSC career staff are “detailed” from home agencies and simply return there if removed, framing it as reassignment, not firing.
There is dispute over scope: some claim it’s limited to National Security Council staff; others read it as a broader civil service threat.
A few say the headline overstates things, asserting staff were asked if they can support the administration’s agenda, not literally “who they voted for.”

Loyalty vs Competence and the “Deep State”

Strong concern that the administration prioritizes personal loyalty over expertise, especially in national security.
Critics warn this revives a spoils system that U.S. law tried to end in the 19th century, undermining a neutral, professional bureaucracy.
Others push back on “deep state” continuity at NSC, arguing elected leaders must be able to change course and control high-level policy staff.

Individual Responses, Fear, and Self-Censorship

Some advise workers to “just say Trump” or lie, treating it as self-preservation in an unsafe system.
Others warn that forcing people to lie about loyalty is itself a control tactic: it demoralizes, isolates potential whistleblowers, and binds them to the regime.
There’s unease that people are starting to normalize lying to government rather than insisting government not ask.

Broader Political and Social Trajectory

Several see this as part of the U.S. “speedrunning” toward failed-state or fascist dynamics, with checks and balances already badly weakened.
Others argue it’s a (dangerous) reaction to earlier bureaucratic resistance to Trump and reflects deeper failures of governance, education, and political parties.
Some commenters dismiss the article as exaggerated or partisan “fearmongering,” while others stress that prior clear warnings (e.g., Project 2025) mean no one should be surprised.

View on HN ↗ Original Article ↗

2025-01-28

Run DeepSeek R1 Dynamic 1.58-bit

Model design, scaling, and training approaches

Some hope future base models will target 128GB-class consumer hardware, e.g. MoE with ~16B active params, leveraging heavy quantization and strong routing.
Commenters note DeepSeek already uses multi-stage training where smaller reasoning models generate synthetic data for larger ones; this is compared conceptually to “dreaming”.
Discussion on FP8/INT8 training: DeepSeek’s large‑scale FP8 training without loss spikes is seen as technically notable.

1.58‑bit / dynamic quantization findings

Naive uniform 1.58‑bit quantization leads to “fried” models: infinite repetition, forgetting context, and general nonsense.
Several argue repetition penalties or advanced samplers (DRY, min_p, temperature tweaks) can mitigate symptoms but cannot restore true accuracy if probabilities are too distorted.
The “dynamic” scheme—keeping sensitive components (e.g. attention, some projections) at higher precision and applying 1.58‑bit mainly to MoE experts—largely removes the repetition problem while delivering ~80% size reduction.
Debate on how far such extreme quantization can go before it’s better to use a smaller but higher‑precision model.

Running huge MoE models: hardware and parallelism

MoE is described as memory‑bound: only a fraction of experts are active per token (e.g. 8/256), but routing incurs heavy all‑to‑all GPU communication.
Inference strategies discussed: pipeline parallelism (layer‑wise sharding), tensor parallelism, and combinations thereof.
Many compare options: multi‑3090 rigs vs 192GB Mac Ultra vs upcoming AMD APUs and Nvidia “Digits”; trade‑offs revolve around VRAM+RAM, bandwidth, power, and portability.
CPU‑only (EPYC/Threadripper) is seen as workable but slow; bandwidth, not capacity, is usually the main bottleneck.

Practical usability and benchmarks

1.58‑bit R1 reportedly reaches ~140 tok/s on dual H100s; some users get a few tok/s on multi‑GPU consumer rigs—usable but not snappy.
Several ask for standard benchmarks; lack of direct evals versus full‑precision R1 leaves “how lobotomized is it?” somewhat unclear.

Ecosystem, OpenAI, and market impact

Strong disagreement on claims that DeepSeek “kills” OpenAI: some predict OpenAI’s decline; others argue large labs will adopt similar efficiency tricks and still win via scale (Jevons paradox).
Many stress DeepSeek’s significance is cost/efficiency and the open training recipe, not just the released weights.
Ongoing concerns about censorship (e.g. on politically sensitive topics) and about calling “open weights” models truly “open source.”

Distills, Ollama, and local use

Distilled R1 variants (Qwen/Llama 7–70B) are widely used locally but are consistently reported as weaker and less knowledgeable than full R1, merely imitating its reasoning style.
Some accuse tools of marketing distills as “R1” and confusing non‑experts.
For many real workloads (RAG, narrow tasks), moderate‑size quantized models remain sufficient and cheaper to run.

View on HN ↗ Original Article ↗

2025-01-28

FTC takes action against GoDaddy for alleged lax data security

FTC action and political context

Several comments praise the FTC’s case against GoDaddy as the kind of enforcement they want to see, but note the press release predates the new administration.
Discussion branches into politics: commenters worry that the new FTC leadership will prioritize anti-DEI moves over consumer protection, citing recent FTC press releases.
Broader anxiety appears about U.S. democratic “guardrails,” presidential immunity, and the potential for abuses of power by the executive branch; others argue courts and Congress might still limit excesses, though optimism is fading.

GoDaddy’s reputation and customer base

Many describe GoDaddy as sleazy, insecure, and hostile to users: aggressive upsells, confusing UX, lock-in via tools like a non-exportable site builder, and frequent breaches.
Despite this, they remain dominant due to early advertising (e.g., Super Bowl ads), strong brand recognition, and appeal to non-technical users who are unaware of the bad reputation.
Commenters contrast this with technically oriented alternatives (Cloudflare, AWS, Gandi, Namecheap/spaceship, Porkbun), but note casual users rarely know or switch.

Security incentives, penalties, and regulation

Security professionals express frustration that breaches rarely hurt companies financially; fines are seen as a “cost of doing business,” leading executives to de-prioritize security.
Healthcare is cited as an exception: HIPAA penalties per affected person and regular training make organizations take security more seriously, though some argue real-world consequences are still weak (e.g., Change Healthcare incident).
There is debate over how high per-user penalties should be: some push for very strong fines up to or beyond profit; others warn this could destroy small services or be abused, and suggest scaled, risk-adjusted penalties instead.
Several argue that if companies had to return all revenue associated with leaked customers, they would radically minimize stored data and use existing security features properly; others oppose allowing insurance to blunt these incentives.

Security practices and training

Multiple anecdotes describe very poor practices: unchanged passwords for a decade, GoDaddy “security” add-ons that introduce new vulnerabilities (e.g., caching admin pages publicly), and support-driven social engineering takeovers of domains.
Commenters describe the wider web-hosting and “cybersecurity” industries as normalizing lax security and superficial compliance, with certifications (e.g., ISO 27001) seen as proof of spend, not of real safety.
Security awareness tools and frameworks (KnowBe4, ProofPoint, NIST guidance) are mentioned as useful starting points, though often boring or superficial; tailoring to audience and using “painful” incentives (extra training) is seen as effective.

Specific GoDaddy issues and practices

GoDaddy is criticized for:
- Charging extra for MFA and enhanced security, seen as irresponsible.
- Selling privacy services while allegedly failing to protect underlying data (Domains by Proxy dataset mentioned as leaked).
- Possible domain front‑running/parking behavior after searches, and steep markup on “premium” domains.
Network Solutions is cited as somehow worse in UX and DNS management, underscoring that the registrar market is broadly low-quality and inertia-driven.

View on HN ↗ Original Article ↗

2025-01-28

Open-R1: an open reproduction of DeepSeek-R1

What “open” means for R1 and Open-R1

DeepSeek-R1 is seen as “open-weights” only: weights are public, but training code and datasets are not.
Several commenters argue a truly reproducible “open” model needs at least code + data, ideally also weights; others say expecting full datasets is unrealistic given legal and competitive risks.
Open-R1’s goal is explicitly to rebuild the missing pieces (recipes, code, data) so others can train similar or better reasoning models, not just use DeepSeek’s weights.

Compute, cost, and feasibility of reproduction

Confusion over the widely quoted ~$5.5M figure: some clarify this was for DeepSeek V3 base model, not R1 reasoning tuning.
R1 reportedly used ~800k samples for reinforcement learning, leading some to think the “R1 trick” could be comparatively cheap once a strong base model exists.
Skepticism remains about whether Open-R1 can match R1’s performance without comparable resources or hidden tricks.

Datasets, legality, and “knowledge laundering”

Many believe no major lab will release raw training data due to copyright and terms-of-service liability, plus competitive advantage.
One discussion describes a multi-step scheme: train on copyrighted data, generate synthetic data, then train a new model on that—framed as “knowledge laundering.”
There is interest in fully open datasets (e.g., Allen Institute work, RedPajama), and proposals for a decentralized, deduplicated, community-maintained training-data archive.

Geopolitics, censorship, and trust

Debate over whether Chinese models are especially untrustworthy or just differently “massaged” compared to US/European models.
Some point out Western models are also heavily aligned and censored (especially around sexuality, politics, and safety topics).
A few commenters “trust” some Western labs slightly more on political independence, but others argue US tech firms also “bend the knee” to power.

Open source vs big tech framing

Several see DeepSeek and projects like Open-R1 as part of a broader battle: heavily-capitalized US incumbents vs open or non‑US efforts, not simply “US vs China.”
Others push back on romanticizing open models as “gifts” or morally superior, and emphasize precise terminology (“open source” vs “open weights”).

Other domains for RL with verifiable rewards

Suggested areas: law (case outcomes, codes), medical diagnosis (test results, outcomes), stochastic processes, robotics and chip design with simulators, RFP responses, management consulting, and any domain with good simulators or automated checks.

AI hype vs early web nostalgia

Some compare today’s rapid AI progress to the early web or Web 2.0—continuous excitement, but with faster information flow now.
Others express burnout: generative AI is seen as flooding the internet with low-quality content and undermining human connection.

Status of Open-R1 and criticism of the announcement

Multiple readers stress this is just an announcement of an effort, not a working R1 reproduction; some call the headline misleading without evaluation numbers.
Nonetheless, many welcome an independent, open attempt to replicate DeepSeek’s reasoning methods.

Security and backdoor concerns in local LLMs

Worry that people are now “running anything,” reminiscent of the early Windows/Internet era.
While runtimes like Ollama/llama.cpp are likened to relatively safe interpreters, commenters note that models used as agents—with tool and code-execution access—could, in theory, be trained to trigger hidden behaviors (date‑based or keyword‑based attacks).
No concrete backdoor examples are given; risk is discussed as a plausible future vector, especially for widely adopted “open” models.

Crowdsourcing and how to help

Some ask how to contribute data or effort; suggestions include crowdsourcing domain-specific data (e.g., local-language stories, speech), BOINC‑style distributed training, and building shared infrastructure for open datasets.
One commenter half-jokingly says “we don’t need human help anymore, we have DeepSeek,” reflecting both excitement and anxiety.

View on HN ↗ Original Article ↗

2025-01-28

Why OpenAI's $157B valuation misreads AI's future (Oct 2024)

Capital intensity, valuations, and funding risk

Several comments frame OpenAI’s CapEx as historically large and paradigm‑shifting, but likely damaging to the broader startup ecosystem by crowding out non‑AI funding.
Many expect a “haircut” on AI valuations when revenue/profits underwhelm, comparing this cycle to SoftBank’s failed blitzscaling bets and predicting a possible “AI nuclear winter.”
OpenAI’s ~$157B valuation is seen as disconnected from fundamentals: high revenue growth but costs scaling with usage and huge infra plans, with multiples viewed as “crazy” even by big‑tech standards.

DeepSeek’s impact: cost, moat, and credibility

DeepSeek is widely cited as evidence that model training can be much cheaper and that OpenAI’s technical moat is weak or nonexistent.
Some argue DeepSeek merely stacked known techniques (MoE, fp8, attention compression, PTX optimization, RL) and optimized under harsh constraints; impressive, but not fundamentally new.
Others question DeepSeek’s cost claims (omitted pretraining, GPU acquisition, data) and see them as partly geopolitical signaling, but agree inference efficiency is real and verifiable.

Cloud vs edge and the shrinking API margin

Many see plummeting training/inference costs and strong open models as bearish for API margins and centralized cloud AI; if high‑quality models run on phones or local clusters, why pay OpenAI?
Counterpoint: top models will still outstrip local hardware; enterprises will pay for the very best, at least for some workloads.

Is AI a fad? Lived experience vs skepticism

One camp dismisses “AI is a fad,” pointing to concrete productivity gains: coding assistants, game prototypes, customer service, legal/medical workflows, etc.
Another is unimpressed by current UX (prompting overhead, undermining personal skills) and uses “self‑driving cars” as a benchmark; they see more hype than transformative value.
Multiple engineers report huge productivity boosts (e.g., using Sonnet/Cursor as “power tools” for large codebases), insisting doubters are “holding it wrong.”

Where value will accrue: platforms vs applications

Many think foundational models will commoditize; the durable value will be in vertical, workflow‑integrated applications and niche domain models (medicine, logistics, hospitality).
There’s debate over whether application‑level moats (data lock‑in, switching costs, personalization) will be strong enough to sustain margins.

Open source and long‑term structure of the market

DeepSeek, Llama, etc. are likened to Linux: open ecosystems that eventually overpower proprietary stacks.
Some predict that in a few years, nobody will remember “Open”AI, and that human‑AI collaboration on widely available open models will be where the real breakthroughs occur.

View on HN ↗ Original Article ↗

2025-01-27

I trusted an LLM, now I'm on day 4 of an afternoon project

Role of LLMs: Junior Dev, Tool, or Copilot?

Many frame LLMs as “junior devs faking competence” or “cocky grads”: can be productive but require heavy supervision.
Others say that’s the wrong framing: LLMs don’t learn or build trust over time; the human is the one who improves at “AI wrangling.”
Several argue they’re closer to power tools or nail guns than coworkers: massive leverage if you’re in control, dangerous if you aren’t.
Some push back on “we’re not expecting a copilot” because commercial products are explicitly marketed as such.

Where They Work Well

Boilerplate, scaffolding, repetitive translation (e.g., SDKs across languages, Rust no_std setups, WASM SIMD optimizations).
Acting as an “augmented search engine” or fluent interface to docs/Stack Overflow: surfacing concepts, APIs, package options, RFC details.
Placeholder or mundane code (tests, endpoints, simple screens), code completion/autocomplete, and research on common topics.
Learning aid for some: interactive tutoring, synthesizing tutorials and Wikipedia-like content, generating questions from notes.

Where They Fail or Mislead

Niche, hardware, and low-level work (Raspberry Pi/Arduino, Linux device trees, USB quirks, C alignment rules) where training data is noisy or sparse.
Subtle logic, TS generics, UI/UX details, multi-component interactions, and “second-order” issues.
Hallucinated APIs/behaviors, plausible but wrong math, and inability to admit ignorance.
Long conversations accumulating context drift; repeated “fixes” that reproduce the same bug; 99%-right code that hides a brutal 1% error.

Effective Usage Patterns

Seniors or people with strong fundamentals benefit more: they can say “this is obviously wrong” and use LLMs for acceleration, not substitution.
Strategies:
- Maintain explicit specs/CONVENTIONS and feed them in each time.
- Use LLMs to generate tests/boilerplate, then do refactoring and design yourself.
- Restart chats when the model gets stuck; load specific docs instead of relying on generic knowledge.
- Use adversarial/custom prompts to fight sycophancy and force self-critique.

Broader Concerns and Disagreements

Split between users who find LLMs transformational for side projects and those who waste days in “wild goose chase” debugging.
Worry that juniors will become dependent and never build deep skills, while seniors get even more leverage (“LLM paradox”).
Skepticism about hype that frames LLMs as imminent job-killers; many see them instead as mess-creators demanding more human oversight.

View on HN ↗ Original Article ↗

2025-01-27

Developers should embrace creative coding again

Business pressure, utilization, and lost creativity

Several comments argue that high “utilization” targets and distrust of employees have squeezed out slack time, which used to support experimentation, quality improvements, and creativity.
Queueing theory is cited: maximizing resource utilization (keeping devs 90–100% busy) increases wait times and harms throughput, even ignoring creativity.
Many see “Agile” as having degenerated into Taylorism plus dashboards, despite formal agile frameworks warning against overloading teams.

Is programming inherently creative?

Some say businesses treat creativity as belonging to UI/UX or “creative roles,” while engineering is seen as mere implementation to be optimized.
Others push back: engineering itself is described as deeply creative problem-solving, often more like a craft or trade than pure “execution.”
There’s frustration at interview practices focused on algorithms over craft, which reinforces a non-creative view of software work.

Figma, AI, and replacing workers

Several comments note the irony of a Figma “developer advocate” urging creativity while working for a company many see as automating away jobs (including developers via AI).
Broader point: most developers already build automation that replaces non-developers; some see it as hypocritical to object when automation targets developers.
A long tangent debates socialism vs capitalism as responses to job-displacing automation and who benefits from productivity gains.

What “creative coding” the article is actually about

Multiple readers initially find the article’s thesis unclear; others summarize it as: “do more digital art/expressive web work using HTML/CSS/SVG and modern browser features, instead of template-driven sameness.”
It’s framed as a reaction to the dominance of Wix/Squarespace/Bootstrap/Tailwind-like sameness despite powerful browser capabilities.
Critics say the post focuses on “unique design” and shiny CSS features rather than the deeper math/graphics/algorithmic side of creative coding (e.g., Processing, p5.js, demoscene-style work).
Some see it as corporate, sanitized “career advice” and subtle marketing for Figma, not a serious exploration of creative coding.

Native desktop vs web apps (big subthread)

Many lament the dominance of Electron/web apps and wish for fast, native desktop software that fully uses modern hardware. Examples of snappy native apps are contrasted with sluggish chat/music Electron clients.
Others argue the browser is effectively a cross-platform app runtime with good abstractions and sandboxing; rebuilding that natively is costly, especially for multi-platform support and updates.
There’s broad agreement that web UI primitives are too limited for rich “full-fat” desktop-style apps; lack of built-in, high-quality widgets leads to many buggy, custom reimplementations and a perception of bloat.
Supporters of web apps highlight: centralized updates, better default sandboxing, easy syncing and multi-device access, and organizational simplicity when managing many users.
Critics counter that native apps can be sandboxed, synced and tabbed as well, and that web-based distribution shifts control to cloud providers and weakens user control over data.

Tooling, AI, and the creative process

One thread argues that to “creatively code” you must genuinely code: relying heavily on AI/IDE assistance can drown out the reflective, exploratory thinking that sparks creativity.
Some developers intentionally use simpler editors and minimize AI/docs to “wrestle with the code” and treat programming as a way of thinking out loud.

Crypto, NFTs, and contemporary creative coding

Commenters point out that generative-art NFT communities (especially on Tezos) were vibrant centers of creative coding from about 2019–2022.
Others admit the scene produced interesting work but say the surrounding greed and hype made them dismissive of it, and question how much of it was real “art.”

UI creativity vs usability and business needs

Several participants stress the tension between surprising, delightful interfaces and predictable, learnable ones.
For portfolios and personal experiments, “wild” creative sites are celebrated; for corporate or productivity tools, predictability and efficiency usually win, or clients will reject the work.
Some frame the author’s call as really about “unconstrained, personal play” with technology—projects that don’t need to justify themselves with business cases.

Template cultures and Tailwind

Tailwind is cited as both an enabler of uniqueness (after the Bootstrap era) and an example of convergence: over time a recognizable “Tailwind style” emerged around influential examples.
Broader point: tools that can support creativity don’t guarantee creative outcomes; social imitation and familiarity pull designs back toward common patterns.

Fatigue with “you should…” culture

One commenter voices exhaustion at constant pressure to learn new frameworks, do side projects, study AI, and treat every hobby as a startup opportunity.
This is linked back to the article: the same industry that pushes relentless productivity and careerism is now telling developers to “be more creative” in their off-hours.

View on HN ↗ Original Article ↗

2025-01-27

The Illustrated DeepSeek-R1

Author and Foundations vs. Fast-Changing Details

Commenters praise the “Illustrated …” series as high quality and view the author’s name as a reliability signal.
There’s skepticism that an LLM book can stay current, countered by the argument that core foundations (gradient descent, tokenization, embeddings, self-attention, MLPs, SFT, RLHF) change slowly even if products evolve quickly.
Minor debate over mentioning “gradient descent” but not “Transformer encoders”; clarification that modern top LLMs are decoder‑only, with self‑attention + MLP as the core, while encoders remain for some tasks.

DeepSeek Training Data, Distillation, and “I’m GPT‑4” Confusion

Several people don’t understand how DeepSeek trained so cheaply, or what the 14.8T tokens actually were, given the vague corpus description.
Some suspect heavy use of GPT‑4 outputs or outright distillation; others counter that:
- Distillation via OpenAI API would not be cheaper (you still pay for compute and API).
- Lack of probability outputs from OpenAI makes true distillation harder; more likely they fine‑tuned on public GPT‑4‑style datasets and broad web “radiation.”
Multiple users note many models say they’re ChatGPT/OpenAI due to training on web text where those terms dominate, not because they literally are those models.

Synthetic Data, Reasoning, and Creativity Evaluation

Commenters highlight that large‑scale synthetic chain‑of‑thought data (hundreds of thousands of long CoT traces) is novel and expensive, and that CoT is explicitly part of training, not just an inference trick.
There’s excitement that verifiable reasoning tasks look “solvable” via synthetic data; concern shifts to qualitative/creative domains.
Debate on whether creativity can be evaluated:
- One side: art is irreducibly subjective and culture‑dependent.
- Other side: you can model raters + artworks, decompose images into features (composition, rule of thirds, familiarity vs. surprise), and learn predictive “creativity/appeal” scores.
Distinction is drawn between combinatorial “card‑shuffling” creativity vs. inventing truly new concepts; some think the latter might emerge from noisy latent reasoning.

Technical Innovations Behind DeepSeek V3/R1

Summary of claimed improvements:
- Latent Multi‑Head Attention: low‑rank compression of KV matrices to trade memory for compute with small accuracy loss.
- MoE with one shared expert + many small experts, with only a subset active per token; plus improved load balancing via biasing underused experts.
- Multi‑token prediction heads that train on several next‑token predictions, thought to be key for better sequence modeling.
- FP8 where possible and extensive infrastructure work (e.g., DualPipe, efficient all‑to‑all) to overcome bandwidth limits on H800 GPUs.
Commenters are unsure which change drives the reported ~10× efficiency; many suspect multi‑token prediction + infrastructure as main contributors and note some skepticism versus the marketing claims.

R1 Capabilities: Impressed vs. Underwhelmed

Some users find R1 or its distilled variants dramatically better than prior open models for coding, especially when run locally, and see RL‑style reasoning (search over action sequences + reward) as a fundamental qualitative jump over pure RLHF “vibe checks.”
Others report R1 performing notably worse than o1/o1‑pro on complex real‑world coding and scientific tasks, with visible hallucinations and reasoning loops; they point to R1’s own paper admitting only modest gains over V3 on software engineering tasks.
Consensus: R1 is a significant research and engineering milestone, but its practical reasoning quality vs. top proprietary models is contested and task‑dependent.

Cost, Openness, and Geopolitical Shock

Many see the real story as:
- A Chinese team, constrained by export controls and weaker hardware, achieved near‑frontier benchmarks at dramatically lower reported training cost.
- Open weights and unusually detailed training disclosures reduce the mystique around closed‑model “secret sauce” and may weaken Nvidia’s and OpenAI’s perceived moats.
Some emphasize that DeepSeek is not a casual side project but a serious spin‑out repurposing hedge‑fund compute and talent.

Censorship, Alignment, and Political Influence

Long subthread debates using a Chinese model vs. Western ones:
- One concern: large‑scale deployment of a censored/“approved” model could shape homework help, political recommendations, and global narratives.
- Others respond that Western models are also heavily “aligned” (censored) and trained on dubiously obtained data; “nobody is innocent,” so cost and openness matter more.
Some propose using a panel of diverse models to balance national and ideological biases.
Several note that R1 itself exhibits both censorship and alignment; diversity of models is seen as a strength.

HN Meta and Presentation Style

Users are puzzled that such a “high signal” post fell off the front page quickly while lower‑point posts linger; algorithm behavior remains opaque.
One commenter criticizes the “illustrated” style as mostly text‑in‑boxes rather than deeply visual explanations, suggesting inspiration from more sophisticated visualization thinkers.

View on HN ↗ Original Article ↗

2025-01-27

Go 1.24's go tool is one of the best additions to the ecosystem in years

Perceived Benefits of go tool

Many see first-class support for per-project tools as closing a long-standing gap around “metaprogramming” and codegen that had been handled ad‑hoc with go generate, Makefiles, or tools.go.
Advocates say it:
- Ensures consistent tool versions across developers and CI.
- Integrates with Go’s reproducible builds and security scanning.
- Is simpler and more “Go-native” than Bazel/Buck for pure-Go shops.
Some teams at large companies report that something like go tool is “sorely needed” even with sophisticated internal tooling.

Concerns About go.mod Pollution and Dependency Isolation

A major complaint: tool dependencies are added as regular indirect deps in go.mod, with no clear annotation or separation from runtime deps.
This:
- Bloats go.mod (one example: 93 → 247 lines), making manual review difficult.
- Merges tool and app dependency graphs, so version conflicts and vulnerability noise become more likely.
Go team rationale: keeping a single dependency graph simplifies security scanners and tooling; module graph pruning prevents most tool deps from propagating downstream.
Workarounds and alternatives:
- Use comments (desired but not yet implemented) or a separate tools.mod with -modfile.
- Continue using go install, Makefiles, local bin/, or env tools (Nix, flox, devenv, mise).

Build Performance and Caching Behavior

Some worry that rebuilding on every go tool call is too slow, especially on CI.
Clarifications:
- Tools aren’t fully rebuilt each time; Go checks they’re up to date and uses caching (extended in 1.24 to cache final executables for both go run and go tool).
- Benchmarks show large speedups from caching, but there is still noticeable overhead for large packages and version resolution.

“Dev Dependencies” and Version Management Model

go tool is essentially Go’s take on dev-dependencies, but:
- There are no version ranges; Go uses Minimum Version Selection with semver-major split into distinct modules.
- Some argue this is elegant and deterministic; others think relying on “newer is always compatible” is unrealistic and risky.
Critics argue tools should keep their own isolated dependency graphs (like go install module@version or a Nix shell) so that:
- Tools aren’t run against untested dependency versions.
- Application upgrades aren’t blocked by tool constraints (and vice versa).

Language-Specific vs General Build/Tooling Systems

Several compare go tool with:
- Bazel/Buck/Pants (powerful, cross-language, but complex and heavy).
- Nix, conda, flake-based or containerized dev environments (more generic & tool-agnostic).
One camp argues language-specific tooling gives a better UX, stronger conventions, and less configuration.
Another favors language-agnostic environments where all tools (Go and non-Go like protoc) are declared and versioned together, seeing go tool as partial and Go-centric.

Broader Sentiment About Go Tooling and Process

Some praise Go modules and MVS as best-in-class vs Java/Python/JS ecosystems; others describe Go’s module tooling as confusing, “non-deterministic,” or historically late and opinionated.
A subset sees go tool as a modest quality-of-life improvement, not a game-changer, and worry it nudges the community into a problematic pattern of mixing tool and app deps.
Others are unconcerned because the feature is optional and compatible with existing approaches like tools.go, Makefiles, or external env managers.

View on HN ↗ Original Article ↗

2025-01-27

We're bringing Pebble back

Overall reaction

Many commenters are intensely excited and nostalgic; several still daily‑drive original Pebbles, have multiple units, or kept one “waiting for this day.”
Others are openly wary: they felt “rug‑pulled” when Pebble shut down and/or Kickstarter devices never shipped, and say they won’t pre‑order or crowdfund again without strong assurances.

Why people loved Pebble

Core virtues repeatedly cited:
- Always‑on reflective/MIP display that’s sunlight‑readable with long battery life (5–10+ days).
- Physical buttons that work without looking, with gloves, in the shower, and while biking or swimming.
- Simple, coherent, non‑touch UI (timeline for calendar, easy media controls, quick alarms/timers).
- A playful, “soulful” UX with great animations and little flourishes.
- Hackability: C SDK, CloudPebble, JS companions, watchface generators, vibrant hobbyist ecosystem.
Many see Pebble as correctly designed as a “phone extension,” not a tiny phone on your wrist.

Skepticism and trust

People burned by the shutdown, bricked devices, and the canceled Time 2/Core stress “fool me once…” and want:
- No VC‑driven growth at all costs.
- A sustainable business with realistic pricing and modest ambitions.
Some distrust Google but also praise it for open‑sourcing PebbleOS instead of letting it rot.
A few complain the repo doesn’t compile; others explain missing pieces are third‑party proprietary components (BT stack, vendor libs).

Comparisons to other wearables

Garmin: closest in spirit (MIP displays, buttons, long battery, strong fitness), but widely criticized for clunky, inconsistent UX, confusing menus, proprietary charging, and cloud‑dependent app. SDK viewed as limited and sandboxed.
Apple Watch: praised for health and swim tracking, payments, and OS integration; criticized for short battery life, touch‑heavy UI, and “soulless,” locked‑down feel.
Other mentions: Amazfit, Fitbit, Withings/Fossil hybrids, Bangle.js, PineTime, SensorWatch, Casio/Timex, Oura. None are seen as hitting the same UX + openness + battery niche.

What people want from a new Pebble

Broad agreement:
- Keep reflective/always‑on display, long battery, buttons, and simple UX.
- Maintain or modernize the Timeline‑style calendar and great notification/media controls.
- Robust, repairable hardware; nicer designs including a Round‑style option; ideally user‑replaceable battery.
Divided wishes:
- Some want solid health/fitness sensors (HR, SpO2, GPS, sleep, swim tracking, fall detection).
- Others explicitly want to avoid fitness bloat and preserve thin, light, “just a tool” design.
- Contactless payments and LTE/eSIM are highly desired by some, seen as overkill or unrealistic by others.

Openness, ecosystem, and privacy

Strong calls for:
- Keeping PebbleOS open, publishing companion apps, and ideally making cloud services self‑hostable (no forced subscriptions).
- Good Linux/Android integration and compatibility with Gadgetbridge/Home Assistant.
Rebble is widely praised for keeping old devices alive; people want the new effort to embrace that ethos.

Site / messaging

The opening animation and the playful “No” redirect (Apple/Pixel watch) are widely enjoyed.
Some feel sending people to an Apple Watch is a bit salty toward Google; others find it perfectly on‑brand humor.

View on HN ↗ Original Article ↗

2025-01-27

Google open-sources the Pebble OS

Pebble’s Appeal and Nostalgia

Many commenters still consider Pebble the best smartwatch they’ve owned.
Praised traits:
- Always-on, highly readable transflective LCD (“epaper-like”) with great daylight visibility.
- Multi-day battery life (often ~5 days or more) without “power-saving” compromises.
- Simple UI with physical buttons, minimal features, and polished, playful details (animations, notification handling).
- Cheap, thin, “second screen for the phone” rather than a heavy health platform.
Several say they never found a satisfactory replacement after their Pebble died.

Clarifying the Display Myth

Multiple comments correct the common misconception that Pebble used e‑ink.
All Pebbles used low-power reflective / transflective LCDs (Sharp Memory-in-Pixel), visually similar to e‑paper but technically different.
Debate over tradeoffs: washed-out colors vs. dramatically better battery life and sunlight readability.

What Google Released (and What’s Missing)

The repo is explicitly “for information only” and does not build as-is.
Removed proprietary pieces:
- System fonts
- Bluetooth stack (only an emulator stub left)
- STM32 peripheral library
- Voice codec
- ARM CMSIS (old, awkward licensing)
- Pebble 2 HR heart-rate driver
Commenters note:
- CMSIS and STM libraries are now available under more permissive terms, and BT stacks, codecs, and fonts have better open alternatives than a decade ago.
- The missing Bluetooth stack is seen as the most serious gap for real devices.

Impact for Existing and Future Hardware

Current Pebble users hope this will:
- Extend the life of old devices via community fixes.
- Enable new hardware (“rePebble” and similar projects) reusing the OS.
Some warn that new hardware is hard to make sustainable for small companies unless existing ODM platforms can be reused.

Reactions to Google’s Move

Strong appreciation that Google open-sourced a dead product rather than burying it.
Counterpoint: the platform was effectively killed years earlier via Pebble → Fitbit → Google acquisitions.
Discussion on internal dynamics: this likely required persistent advocacy inside Google, but had little direct business value and some legal risk.
Speculation on motives ranges from pure goodwill to low-cost PR and “commoditizing complements” in the smartwatch ecosystem.

Technical Side Threads

Light discussion of:
- Pebble’s FreeRTOS-based architecture and custom malloc/heap implementation.
- Floating-point comparisons in the math library and when they’re acceptable.
- Broader RTOS landscape (FreeRTOS, Zephyr, NuttX, Rust-based systems).

View on HN ↗ Original Article ↗

Hacker News, Distilled

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics

Related topics