2024-12-20

Building Effective "Agents"

Workflows vs “Agents” in Practice

Many commenters agree the real current value is in workflow automation rather than fully autonomous agents.
“Agentic” planning steps are often removed in production because companies want predictable, repeatable behavior; what’s left is a deterministic workflow with LLM calls.
DAG-like or state-machine workflows with clear decision trees (e.g., email triage, customer support, marketing flows) are seen as much more deployable than open-ended agents.
Narrow, well-defined tasks with constrained choice spaces work far better than “boil the ocean” general-purpose agents.

Frameworks vs Simple Patterns

Strong sentiment against heavyweight frameworks like LangChain/LangGraph for many real-world cases: seen as adding verbosity, indirection, and complexity without proportional benefit.
Several practitioners report ripping out such frameworks and replacing them with plain code and simple async patterns, often with large LOC reductions.
Lightweight abstractions or “transitional software design” is favored: mostly traditional code, with LLMs used only for “LLM-hard” subproblems (creative writing, fuzzy classification, nuanced decisions).

Reliability, Testing, and Limits of LLMs

Recurrent theme: LLMs are inherently probabilistic and unreliable; you must test hard, with archives, edge cases, and regression suites.
Some argue occasional failures are acceptable in domains already dominated by probabilistic methods (e.g., spam-like tasks); others warn that “magic thinking” about reliability is widespread and dangerous.
Concerns about compounding errors in multi-step workflows; mitigation via verifiable checks, tests, and possibly LLM-as-judge.
Complaints about lack of robust structured output and degradation of JSON-like patterns, making agents brittle.

Definitions and Ethics of “Agents”

Long debate over the term “agent”:
- One camp emphasizes the legal/ordinary sense (acting on behalf of someone, with responsibility).
- Another cites the AI-textbook sense (anything that perceives and acts in an environment).
Some see current marketing use of “agent” as confusing or disingenuous for non-technical audiences, especially when responsibility for errors is at stake.

Architecture, Orchestration, and Tooling

Several view LLMs as components in larger orchestrated systems: prefrontal-cortex analogies, system‑1 vs system‑2 splits, and complex multi-model architectures are discussed.
Durable workflow engines (Temporal, Inngest, Hatchet, Windmill) are highlighted for long-running, fault-tolerant, human-in-the-loop processes, though they introduce their own complexity and learning costs.
Tool use is seen as central to powerful agentic workflows, but defining tools and handling stateful tools correctly is non-trivial.

Related topics