AI is code – and can't be prompted into being smarter
AI vs. Traditional Code and “Smartness”
- Some argue prompting an LLM is similar to configuring a program: you don’t change the algorithm, but you can make behavior more “fit for purpose.”
- Others emphasize a key difference: traditional code can be formally specified and (partially) verified; LLM behavior cannot be reliably specified or proven correct.
- A counterpoint notes that AI is still “just a program”: if you fix seed/temperature, you can verify it meets a trivial spec (map input tokens to output tokens), but not semantic correctness.
Prompting, Capability, and Determinism
- Several comments say prompts can clearly make models “dumber,” so they can also make them “smarter” in practice by better exploiting existing capabilities.
- Prompting, retrieval, and “context engineering” are compared to improving data flow and instrumentation in conventional systems.
- Determinism is contrasted: classic algorithms are deterministic; LLM outputs vary, which complicates reliability and verification.
Prompt Injection and the jqwik-Style Booby Trap
- Prompt injection (e.g., “disregard previous instructions…”) is seen as a persistent, fundamental issue because LLMs mix “instructions” and “data” in one token stream.
- Others push back, predicting future architectures will separate control/data planes with token metadata and roles.
- The specific case of a testing library embedding a hidden prompt that causes AI agents to delete its own tests sparked heavy debate:
- Supporters see it as a protest against using their work in AI tools, or as “benign” sabotage that only harms those ignoring clear warnings.
- Critics call it malware, a supply-chain attack, or at least a “dick move” that punishes unsuspecting users and projects.
Legal and Ethical Disagreement
- Some claim this behavior clearly violates computer misuse laws (e.g., unauthorized access/damage is illegal regardless of mechanism).
- Others argue the user/agent chose to run the tool and follow the instructions; responsibility lies with those who delegated unsafe authority to the AI.
- Analogies used: SQL injection, loosening wheel bolts vs. placing a visible log in the road, booby-trap packages, “shouting fire in a crowded theater.”
- No consensus emerges; legality is widely debated and flagged as unclear.
Licensing, Open Source, and Control Over Use
- One side: open-source licenses (especially those meeting the “no field-of-endeavor discrimination” rule) cannot restrict AI use; picking such a license implies consent.
- Another side: even if legally open, authors’ stated wishes about how their work is used should be respected to avoid a chilling effect on contributors.
- Distinction is drawn between copyright conditions (copying/redistribution) and attempts to control how software is used, which open-source licenses typically do not restrict.
Backups, Source Control, and Practical Impact
- Some mock users who claimed “months of work destroyed,” suggesting proper use of Git and remote repos should have prevented permanent loss.
- Others stress that source control is not automatically a backup; without offsite copies, deletion can still be catastrophic.
- The incident highlights how AI agents with broad file permissions can cause real damage, whether or not a prompt injection is present.