How we run Firecracker VMs inside EC2 and start browsers in less than 1s

Overall focus

  • Discussion centers less on the article’s mechanics (already known) and more on:
    • Why Firecracker VMs vs containers/Lambda.
    • Startup-time tricks and scaling strategies.
    • Security/isolation tradeoffs.
    • Ethical implications of “stealth” browsers and bot evasion.

Firecracker, nested virtualization, and EC2 choices

  • Some note that EC2 itself is a VM; Firecracker here runs as a VM-in-VM via nested virtualization.
  • Only certain EC2 instance families currently support nested virtualization; supply is limited and some report reduced stability vs metal.
  • Others highlight that metal instances have better performance but are slow to start/stop.
  • Several comments stress Firecracker’s stronger isolation vs containers, especially for untrusted browser workloads and frequent container-escape CVEs.

Containers vs microVMs for browsers

  • Multiple posters ask why not just use Docker/containers.
  • Arguments against containers:
    • Not considered a strong security boundary; share host kernel.
    • Slower startup and heavier resource use vs Firecracker snapshots.
  • Counterpoints:
    • Docker escapes are “very hard” according to some; disagreement exists on how weak the boundary really is.
    • For some workloads, Chromium in containers or simple subprocesses is workable, just harder to scale and secure.

Performance, snapshots, warm pools

  • Firecracker VM startup (20 ms) is fast; browser startup (300 ms) dominates.
  • Snapshotting of post-OS, pre-Chromium state is used to cut startup; future work aims to snapshot with Chromium already initialized, targeting ~50 ms end-to-end.
  • Debate over “warm pools” of pre-running browsers:
    • Proponents: simple, near-zero latency if pool sizing is right.
    • Opponents: idle capacity is costly, complex to tune for many feature combinations; snapshots may beat warm pools in both cost and speed.

Browser engine, GPU, and Android

  • Chromium retained primarily for “stealth” and anti-detection; alternatives like Lightpanda seen as easier to fingerprint.
  • GPU passthrough is not available in Firecracker; workloads rely on CPU and software rasterization, which is considered acceptable for most automation.
  • Android browser VMs are reported as very heavy, slow to start, and harder to scale; currently not production-ready for this use case.

Ethics and impact of stealth automation

  • Major subthread debates whether bypassing anti-bot measures is ethical:
    • Critics: services like this increase bot traffic, drive harsher CAPTCHAs/anti-bot tech, externalize costs to site operators, and often rely on residential proxies with questionable consent.
    • Defenders: distinguish between abusive mass scraping and light, purpose-driven automation (price monitoring, ticket watching, link checking, archival, privacy tools, counterfeit detection, legacy SaaS automation).
    • Some argue ToS violations ≠ inherent unethical behavior; context and intent matter.
    • Others see it as entitlement to others’ resources “because you want it.”
  • Concerns raised that escalating bot/anti-bot arms races make the web more hostile for humans, but some users claim automation is now necessary because many websites have become so user-unfriendly.

Alternative approaches and cost comparisons

  • Some run Chromium in AWS Lambda container images, citing simpler architecture and automatic scaling; tradeoff is cold starts and 15-minute limits.
  • For short screenshot jobs, Lambda billed per millisecond is reported as cheaper; for longer-running automation, per-minute VM billing may win.
  • Other alternatives mentioned: Cloudflare Workers-like products, Shellbox-style sandboxes, Kubernetes + Kata, or just processes with Chrome’s own sandbox.
  • View that EC2 (plus Firecracker) is “more scalable than your wallet,” while others note opportunity cost of idle capacity and the value of spot/preemptible pricing.

Security and broader web ecosystem

  • Repeated emphasis that containers are not widely treated as hard security boundaries, whereas microVMs are.
  • Some see bot traffic as a long-standing cost of running any public service; others argue that scale and impact have worsened significantly.
  • Thread surfaces broader frustration with anti-user web practices (tracking, ads, CAPTCHAs) and the increasing shift toward AI and automation as a more pleasant interface to web content.