The semantic web is now widely adopted
LLMs vs. Semantic Web
- Many argue LLMs will make manual semantic markup redundant: they already do high‑accuracy extraction/categorization for many tasks and will get cheaper and better.
- Counterpoints:
- LLMs “routinely get stuff wrong,” including catastrophic errors (e.g., mislabeling people as criminals).
- Automation scales errors: even a 1% error rate can mean thousands of serious mistakes per day.
- LLMs are opaque, non‑deterministic, and trained on SEO spam and blogspam, so they inherit those pathologies.
- Some see LLMs and semantic tech as complementary: LLMs can help build or fill out metadata and knowledge graphs; structured data and ontologies can ground and constrain LLMs (“neuro‑symbolic” approaches).
Incentives, Trust, and Business Case
- Key reason cited for “classic” Semantic Web failure: no business case for publishing rich open data. It reduces clicks and helps competitors and aggregators.
- Strong incentives exist to lie or game metadata (SEO), so publishers cannot be the sole source of semantic truth.
- Trust is unsolved: users need ways to judge reliability, provenance, and authority; simple vocabularies don’t address this.
Current Adoption and Practical Uses
- JSON‑LD + schema.org is widely deployed for SEO, rich Google results, and link previews; CMS plugins generate it automatically.
- Other formats appear in niches: RDF/XML in PDFs and archives, MARC/BibTeX in libraries, RSS/Atom, microformats, RDFa/Microdata.
- Semantic tech is reportedly used in enterprise/sector contexts (e.g., European electricity grids, procurement, enterprise knowledge graphs), often internally.
Gap from Original Semantic Web Vision
- Many see today’s usage (author/title/image/date for previews, basic structured data) as a drastic retreat from the original “Web‑scale queryable knowledge graph” dream.
- Lack of a “killer app” for ordinary users is emphasized; benefits mainly accrue to large platforms and scrapers.
Formats, Tooling, and Developer Experience
- JSON‑LD is seen as pragmatic for CMSes but aesthetically disliked (data in
<script>blobs, duplication, namespaces). - Microformats/RDFa embed semantics inline but are harder to maintain and poorly supported by tooling and browsers.
- Ontology work is considered cognitively heavy; global, static ontologies are viewed as unrealistic, and mapping between competing schemas is hard.
Search, Semantics, and User Control
- Pure keyword search is limited but transparent and user‑driven; semantic/ML search can obscure user intent and favor “average” interpretations.
- Some propose user‑controlled classifiers and knowledge graphs to mitigate the publisher vs. reader incentive misalignment.
- Overall, participants see the “semantic web” ideas as valuable, but adoption, incentives, and human–machine meaning gaps remain unresolved.