2024-10-23

Everything I built with Claude Artifacts this week

Overall reaction to the artifacts

Many see the examples as simple “toy apps” (QR readers, YAML→JSON, URL extractors) that any web dev could write quickly without an LLM.
Others argue that the key change is how fast such tools can now be built (often in minutes), making things worth building that previously weren’t worth an hour of effort.
Some express fatigue with yet another demo of small web utilities amid a perceived “AI glut”.

Usefulness of LLMs for coding

Clear divide:
- Enthusiasts say LLMs are a big productivity boost for scripting, glue code, one‑off utilities, ETL, research tooling, and small web apps.
- Skeptics say they’re of little use on complex, long‑lived systems and often slower than just coding or googling.
Several non‑experts and researchers report 10–20x gains for small Python/shell/SQL tasks.
Multiple people describe building entire small apps or CLIs with ~5–10% manual coding and iterative LLM refinement.

Integration with large or complex codebases

People working on very large monorepos (tens of millions of LOC) say current tools can’t realistically “understand” the whole codebase; at best they help on small, localized changes.
Tools like Cursor, Aider, VS Code extensions, and “projects” with large context windows are cited as partial solutions, but still require careful scoping and human guidance.
Strongly typed or niche languages (Scala, Haskell, Rust with advanced types) are reported as much harder for LLMs to handle correctly.

Code quality, reliability, and prompting

Recurrent reports of:
- Compiling but incorrect code, subtle bugs, non‑idiomatic style, and over‑complex solutions.
- Models looping on wrong fixes or rewriting too much, deleting comments, or making unnecessary changes.
Supporters counter that:
- You must treat the LLM as a junior dev: give precise specs, break work into small steps, review everything, and often paste in relevant files.
- LLM‑written tests plus human review can mitigate many issues.
Some criticize “prompt hacks” (e.g., “I’ll tip you for good work”) as superstition; others cite papers showing small gains.

Broader concerns and optimism

Worries about:
- Over‑reliance on plausibly wrong outputs (“false knowledge”), long‑term erosion of skills, and bad code entering production.
- Hype outpacing real capabilities, especially for non‑trivial business logic and integration.
Optimists expect:
- Fewer hours spent on boilerplate, more on design/architecture.
- Expanded demand for custom software as development gets cheaper, not necessarily fewer developer jobs, but changed roles.

Related topics