20 years of Google Scholar

Perceived value of Google Scholar

  • Widely praised as a “net good” and “fantastic” resource; many say they use it daily and can’t imagine pre-Scholar research workflows.
  • Seen as vastly superior to most university library search tools and as having broken proprietary search monopolies (e.g., Scopus/Web of Science).
  • Used informally as a CV/author-profile system and for tracking citations, including long-forgotten papers.
  • Especially valued for indexing theses, preprints, and case law in addition to journal articles.

Concerns about Google as steward

  • Strong worry that, like other beloved Google products, Scholar could be killed or degraded, leaving inferior alternatives.
  • Some argue it survives mainly because it is cheap to run, piggybacks on core search infrastructure, and is useful to Google’s own researchers and AI efforts.
  • Others lament the “soulless” / unsupported feel: no clear strategy, minimal support channels, and fragile dependence on a single proprietary provider.

Features, limitations, and annoyances

  • No official public API is a recurring grievance; attempts to build tools on top of Scholar hit rate limits and IP bans.
  • Sorting “by date” is criticized for implicitly restricting to the last year; users report confusion and see it as intentionally “broken,” possibly for anti-scraping or contractual reasons (though this is disputed).
  • Access is often blocked for users on self-hosted VPNs or after “unusual traffic” (e.g., from browser extensions), with no effective recourse.
  • Metadata quality is sometimes “wildly inaccurate.”
  • Positive notes include email citation alerts, case-law coverage, and new features like PDF reader/AI outlines and article audio with word highlighting.

Bibliometrics and citation metrics

  • Some criticize Scholar for reinforcing the “bibliometrics game” and want author pages default-sorted by date instead of citation count.
  • Others strongly defend citation-based sorting as the most useful default for typical readers seeking influential work; they see date-sorting as a one-click alternative.
  • Debate arises over the value of low-citation “hidden gems” versus the collective judgment encoded in highly cited work.

Access to papers and open access

  • Common workflow: use Scholar to discover papers, then rely on Sci‑Hub, Anna’s Archive, LibGen, etc. for full texts.
  • Several note Sci‑Hub’s stagnation and declining coverage; open-access (OA) mandates and repositories are seen as improving but uneven across fields.
  • Institutional access disparities are highlighted; some rely on emailing authors for “legal” copies, but this is seen as high-friction and impossible for older/obscure work.

Alternatives and complementary tools

  • Semantic Scholar is frequently recommended: smaller but higher-perceived data quality, open data/API, willingness to fix errors, plus AI features (summaries, “influential citations”).
  • Critiques of Semantic Scholar include overemphasis on a questionable “highly influential citations” metric, nudging toward its Semantic Reader UI, and use of Google Analytics.
  • OpenAlex is praised as an open successor to Microsoft Academic Graph, with a strong free API and good Python tooling.
  • Field-specific tools: PubMed/Entrez (life sciences), dblp (computer science), Scopus/Web of Science, ResearchGate, and institutionally licensed systems.
  • Consensus: no single alternative fully matches Scholar’s breadth; effective practice is to combine several tools.

Broader views on Google products

  • Thread contains broader comparisons of Google with other tech giants and discussions of other “net-positive” products (Search, Maps, Docs, Gmail) versus concerns about surveillance, ads-driven priorities, and product shutdowns.
  • Some fear public celebrations of Scholar’s anniversary might draw unwanted managerial attention and raise its risk of cancellation.