AI Engineer Claims to Have Cracked Linear A
Overall claim and status
- Thread discusses a claimed decipherment of Linear A as an extinct Semitic language (possibly ancestral/related to Biblical Hebrew).
- Researcher reportedly has a draft manuscript shared with linguists at major universities; no preprint or full data is public yet.
- Many commenters stress this is, at best, an interesting attempt and should not be treated as a solved decipherment until experts review it and further evidence appears.
Role of AI tools
- An AI coding assistant was used mainly to:
- Build Python tools to parse and organize the Linear A corpus (GORILA, SigLA).
- Run large numbers of simulations to test whether a Semitic signal is statistically stronger than chance.
- Multiple commenters emphasize the human did the conceptual work; the AI sped up coding and analysis.
- Others argue the achievement is inseparable from modern tooling; without AI, the work might not have been feasible for a non‑specialist.
Reproducibility and skepticism
- Strong pushback on the lack of a public manuscript, code, or translation tables (despite claims of ~300 translated words).
- Concerns that LLM-based workflows are “softly reproducible” due to stochastic outputs and prompt sensitivity.
- Some liken this to past high‑profile “breakthroughs” that collapsed under scrutiny (e.g., other Linear A claims, room‑temperature superconductors).
- A classics-focused community elsewhere is reported as dismissive of the claim.
Linguistic and technical issues
- Linear A is notoriously difficult:
- Tiny corpus (~7,500 characters across ~1,500 mostly list-like inscriptions).
- Possibly multiple languages and heavy use of abbreviations.
- Unknown underlying language and incomplete understanding of the script.
- Debate over:
- Whether assuming Linear A signs share values with Linear B is justified.
- How much can be inferred from recurring formulas like the “libation formula.”
- Whether mapping fragments of words to Semitic roots is convincing or just cherry-picking amid huge degrees of freedom.
- Why a Semitic language would be written with a CV syllabary including vowels, given typical consonantal Semitic scripts.
Broader implications
- Some see promise that AI-augmented methods could help with other undeciphered scripts (e.g., Indus, Isthmian, Voynich).
- Others argue information-theoretic limits may make fully cracking an unknown language/script pair impossible without more data or bilingual texts.