20 years of Google Scholar
Perceived value of Google Scholar
- Widely praised as a “net good” and “fantastic” resource; many say they use it daily and can’t imagine pre-Scholar research workflows.
- Seen as vastly superior to most university library search tools and as having broken proprietary search monopolies (e.g., Scopus/Web of Science).
- Used informally as a CV/author-profile system and for tracking citations, including long-forgotten papers.
- Especially valued for indexing theses, preprints, and case law in addition to journal articles.
Concerns about Google as steward
- Strong worry that, like other beloved Google products, Scholar could be killed or degraded, leaving inferior alternatives.
- Some argue it survives mainly because it is cheap to run, piggybacks on core search infrastructure, and is useful to Google’s own researchers and AI efforts.
- Others lament the “soulless” / unsupported feel: no clear strategy, minimal support channels, and fragile dependence on a single proprietary provider.
Features, limitations, and annoyances
- No official public API is a recurring grievance; attempts to build tools on top of Scholar hit rate limits and IP bans.
- Sorting “by date” is criticized for implicitly restricting to the last year; users report confusion and see it as intentionally “broken,” possibly for anti-scraping or contractual reasons (though this is disputed).
- Access is often blocked for users on self-hosted VPNs or after “unusual traffic” (e.g., from browser extensions), with no effective recourse.
- Metadata quality is sometimes “wildly inaccurate.”
- Positive notes include email citation alerts, case-law coverage, and new features like PDF reader/AI outlines and article audio with word highlighting.
Bibliometrics and citation metrics
- Some criticize Scholar for reinforcing the “bibliometrics game” and want author pages default-sorted by date instead of citation count.
- Others strongly defend citation-based sorting as the most useful default for typical readers seeking influential work; they see date-sorting as a one-click alternative.
- Debate arises over the value of low-citation “hidden gems” versus the collective judgment encoded in highly cited work.
Access to papers and open access
- Common workflow: use Scholar to discover papers, then rely on Sci‑Hub, Anna’s Archive, LibGen, etc. for full texts.
- Several note Sci‑Hub’s stagnation and declining coverage; open-access (OA) mandates and repositories are seen as improving but uneven across fields.
- Institutional access disparities are highlighted; some rely on emailing authors for “legal” copies, but this is seen as high-friction and impossible for older/obscure work.
Alternatives and complementary tools
- Semantic Scholar is frequently recommended: smaller but higher-perceived data quality, open data/API, willingness to fix errors, plus AI features (summaries, “influential citations”).
- Critiques of Semantic Scholar include overemphasis on a questionable “highly influential citations” metric, nudging toward its Semantic Reader UI, and use of Google Analytics.
- OpenAlex is praised as an open successor to Microsoft Academic Graph, with a strong free API and good Python tooling.
- Field-specific tools: PubMed/Entrez (life sciences), dblp (computer science), Scopus/Web of Science, ResearchGate, and institutionally licensed systems.
- Consensus: no single alternative fully matches Scholar’s breadth; effective practice is to combine several tools.
Broader views on Google products
- Thread contains broader comparisons of Google with other tech giants and discussions of other “net-positive” products (Search, Maps, Docs, Gmail) versus concerns about surveillance, ads-driven priorities, and product shutdowns.
- Some fear public celebrations of Scholar’s anniversary might draw unwanted managerial attention and raise its risk of cancellation.