Niantic plans a “Large Geospatial Model” trained on Pokémon Go player data
Scope and status of Niantic’s “Large Geospatial Model” (LGM)
- Several commenters stress this is mostly a plan/vision document, not proof a large unified model has been trained.
- Confusion is attributed to an editorialized HN title using past tense (“trained”) versus Niantic’s more aspirational language.
- Some see the post as positioning Niantic as an “AI company” to investors, leveraging its dataset more than demonstrated model capabilities.
How the tech works and what’s new
- Described as a large-scale evolution of existing Visual Positioning Systems (VPS): photo → camera pose, using dense 3D point clouds built from many scans.
- Key challenges mentioned: scale beyond room-sized point clouds, keeping localization robust to lighting/weather, and avoiding “hallucinated” wrong locations under uncertainty.
- LGM is framed as replacing explicit point-cloud databases plus feature matching with a single learned model; some are skeptical it will scale better.
Data sources, scale, and quality
- Niantic reportedly has ~10M scanned locations, ~1M “activated” for VPS, and ~1M new scans/week; people debate whether that implies multiple scans per location.
- Much data comes from deliberate “scan this POI” tasks in Pokémon Go/Ingress, not from casual AR battle use.
- Commenters note data quality issues: outdated or removed POIs, obstructed views, poor GPS in dense cities, night-time scans rejected, and users often scanning sidewalks or hands instead of targets.
Privacy, consent, and intelligence ties
- Strong privacy concerns: location + imagery can reveal habits, events, seasons, and trajectories; risk of deanonymization at sparsely visited places.
- Debate over whether models will encode only pose or richer “cultural” signals, given Niantic’s mention of broader applications.
- Multiple references to Niantic’s historic ties to In-Q-Tel/CIA and possible interest from intelligence agencies; others question how much extra value this adds beyond existing data holdings.
Ownership, fairness, and “free labor” debates
- Many players feel “tricked” into unpaid data collection; others counter that they received value via a free game, items, and exercise.
- Large argument over whether contributors to crowdsourced datasets deserve access to resulting models or datasets.
- Some propose geospatial data should be treated as a public commons; others argue Niantic added significant value by organizing and processing it.
- Questions raised about informed consent (especially for children) and whether ToS-based consent is ethically or legally adequate (GDPR mentioned).
Potential applications and risks
- Positive uses cited: AR navigation and HUDs, robotics and autonomous vehicles, indoor/outdoor relocalization, search-and-rescue, VR/AR world-building, procedural city/planet generation, GeoGuessr-like tasks.
- Darker possibilities: mass OSINT/geolocation of images, surveillance, military/intelligence targeting, AI-guided weapons.
- Some see it as a natural extension of prior work (Photosynth, NeRF-like models); others are impressed by the decade-long data-collection vision.