Will Amazon S3 Vectors kill vector databases or save them?

Integrated document + vector storage

  • Some want a single doc store that natively handles both text/metadata and vectors; many vector DBs are perceived as “just storing vectors.”
  • Others note existing options that combine both: Chroma, Azure AI Search, Elasticsearch, Vespa, MongoDB Atlas, Postgres, SQLite, and various vector indexes layered on general-purpose DBs.
  • There’s interest in upcoming or niche systems that tightly integrate search with storage and return exact spans, not just documents.

AWS S3 Vectors: role, design, and limitations

  • Many see S3 Vectors as a “lightweight, good-enough” primitive rather than a full search engine: useful for cheap, cold, low-QPS retrieval, but not a replacement for systems like Milvus, Elasticsearch, or Turbopuffer.
  • Limitations raised: topK capped at 30 (smaller when filters apply), no clear hybrid (dense + sparse) search story, unknown or undocumented latency characteristics.
  • Some argue it’s premature to judge performance from a Preview release, since AWS historically raises quotas and improves behavior at GA.

Cost, performance, and vector DB value

  • One cited case: an AI note-taking app spends more on vector search than on LLM calls, surprising some readers and provoking discussion about memory-heavy HNSW indexes and expensive managed services.
  • Commentary that vector DBs earn their keep with latency, recall, hybrid search, and integrated pipelines; S3-backed systems and services like Turbopuffer or LanceDB aim to cut storage costs while caching hot data.
  • Others emphasize that if you start sending full documents as context, LLM costs can easily dominate again.

Documentation opacity and AWS internals

  • Multiple comments lament AWS’s sparse documentation on internal behavior (e.g., S3 Vectors filtering pipeline, ALB load balancing, DynamoDB scaling).
  • Arguments:
    • Users need to understand performance trade-offs (indexing, filtering, scaling) to design architectures.
    • AWS teams fear that documenting details makes them de facto contracts, complicating future changes and migrations.
  • Counterpoints: Hyrum’s Law means customers will depend on observed behavior anyway; reverse-engineering is now an implicit “shadow cost” of cloud use.

Security, censorship, and data access

  • One view: by hosting vectors, AWS could “meta-optimize” infrastructure, support censorship more cheaply (re-using customer embeddings), and increase lock-in via proprietary embedding models.
  • Pushback:
    • AWS’s data-plane vs control-plane separation means they supposedly can’t casually inspect customer data; specialized regions (GovCloud, HIPAA-eligible services) are more about compliance and segmentation than routine access.
    • Skepticism about the censorship thesis: similar concerns would apply to any managed database.
  • Some speculate (unclear, not evidenced) that cloud providers may already be synthesizing/deriving training corpora from customer data, even if PII is scrubbed.

Postgres/pgvector and general-purpose DBs vs. vector DBs

  • One camp: Postgres + pgvector (and similar extensions) is “good enough” for most workloads (up to millions of vectors), keeps data co-located, is OSS, and avoids operational overhead and vendor risk of specialized vector DBs.
  • Another camp: for “hot loop,” low-latency, or very large-scale workloads, Postgres/pgvector is inadequate; you’ll hit performance and replication gymnastics, and dedicated systems provide better recall, latency, and indexing.
  • Rough consensus: pgvector is great for prototyping, small/medium or non-core workloads; specialized DBs shine at 10^8–10^9 vectors, complex filters, and heavy throughput.

Alternative tools and directions

  • Mentioned options: Turbopuffer (S3-backed with caching, BM25, recall tuning), LanceDB (object-store-based, S3-compatible, cheap), Cloudflare Vectorize (very low per-vector cost), Qdrant, on-device/edge stores like ObjectBox.
  • Some see S3’s move as part of a broader play against data platforms like Databricks by making S3 more query- and analytics-capable over time.
  • A few think S3 Vectors is “game changing”; others see it as another tier in a maturing, multi-layered vector ecosystem rather than a killer of vector databases.