2024-11-18

20 years of Google Scholar

Perceived value of Google Scholar

Widely praised as a “net good” and “fantastic” resource; many say they use it daily and can’t imagine pre-Scholar research workflows.
Seen as vastly superior to most university library search tools and as having broken proprietary search monopolies (e.g., Scopus/Web of Science).
Used informally as a CV/author-profile system and for tracking citations, including long-forgotten papers.
Especially valued for indexing theses, preprints, and case law in addition to journal articles.

Concerns about Google as steward

Strong worry that, like other beloved Google products, Scholar could be killed or degraded, leaving inferior alternatives.
Some argue it survives mainly because it is cheap to run, piggybacks on core search infrastructure, and is useful to Google’s own researchers and AI efforts.
Others lament the “soulless” / unsupported feel: no clear strategy, minimal support channels, and fragile dependence on a single proprietary provider.

Features, limitations, and annoyances

No official public API is a recurring grievance; attempts to build tools on top of Scholar hit rate limits and IP bans.
Sorting “by date” is criticized for implicitly restricting to the last year; users report confusion and see it as intentionally “broken,” possibly for anti-scraping or contractual reasons (though this is disputed).
Access is often blocked for users on self-hosted VPNs or after “unusual traffic” (e.g., from browser extensions), with no effective recourse.
Metadata quality is sometimes “wildly inaccurate.”
Positive notes include email citation alerts, case-law coverage, and new features like PDF reader/AI outlines and article audio with word highlighting.

Bibliometrics and citation metrics

Some criticize Scholar for reinforcing the “bibliometrics game” and want author pages default-sorted by date instead of citation count.
Others strongly defend citation-based sorting as the most useful default for typical readers seeking influential work; they see date-sorting as a one-click alternative.
Debate arises over the value of low-citation “hidden gems” versus the collective judgment encoded in highly cited work.

Access to papers and open access

Common workflow: use Scholar to discover papers, then rely on Sci‑Hub, Anna’s Archive, LibGen, etc. for full texts.
Several note Sci‑Hub’s stagnation and declining coverage; open-access (OA) mandates and repositories are seen as improving but uneven across fields.
Institutional access disparities are highlighted; some rely on emailing authors for “legal” copies, but this is seen as high-friction and impossible for older/obscure work.

Alternatives and complementary tools

Semantic Scholar is frequently recommended: smaller but higher-perceived data quality, open data/API, willingness to fix errors, plus AI features (summaries, “influential citations”).
Critiques of Semantic Scholar include overemphasis on a questionable “highly influential citations” metric, nudging toward its Semantic Reader UI, and use of Google Analytics.
OpenAlex is praised as an open successor to Microsoft Academic Graph, with a strong free API and good Python tooling.
Field-specific tools: PubMed/Entrez (life sciences), dblp (computer science), Scopus/Web of Science, ResearchGate, and institutionally licensed systems.
Consensus: no single alternative fully matches Scholar’s breadth; effective practice is to combine several tools.

Broader views on Google products

Thread contains broader comparisons of Google with other tech giants and discussions of other “net-positive” products (Search, Maps, Docs, Gmail) versus concerns about surveillance, ads-driven priorities, and product shutdowns.
Some fear public celebrations of Scholar’s anniversary might draw unwanted managerial attention and raise its risk of cancellation.

Related topics