Building Effective "Agents"
Workflows vs “Agents” in Practice
- Many commenters agree the real current value is in workflow automation rather than fully autonomous agents.
- “Agentic” planning steps are often removed in production because companies want predictable, repeatable behavior; what’s left is a deterministic workflow with LLM calls.
- DAG-like or state-machine workflows with clear decision trees (e.g., email triage, customer support, marketing flows) are seen as much more deployable than open-ended agents.
- Narrow, well-defined tasks with constrained choice spaces work far better than “boil the ocean” general-purpose agents.
Frameworks vs Simple Patterns
- Strong sentiment against heavyweight frameworks like LangChain/LangGraph for many real-world cases: seen as adding verbosity, indirection, and complexity without proportional benefit.
- Several practitioners report ripping out such frameworks and replacing them with plain code and simple async patterns, often with large LOC reductions.
- Lightweight abstractions or “transitional software design” is favored: mostly traditional code, with LLMs used only for “LLM-hard” subproblems (creative writing, fuzzy classification, nuanced decisions).
Reliability, Testing, and Limits of LLMs
- Recurrent theme: LLMs are inherently probabilistic and unreliable; you must test hard, with archives, edge cases, and regression suites.
- Some argue occasional failures are acceptable in domains already dominated by probabilistic methods (e.g., spam-like tasks); others warn that “magic thinking” about reliability is widespread and dangerous.
- Concerns about compounding errors in multi-step workflows; mitigation via verifiable checks, tests, and possibly LLM-as-judge.
- Complaints about lack of robust structured output and degradation of JSON-like patterns, making agents brittle.
Definitions and Ethics of “Agents”
- Long debate over the term “agent”:
- One camp emphasizes the legal/ordinary sense (acting on behalf of someone, with responsibility).
- Another cites the AI-textbook sense (anything that perceives and acts in an environment).
- Some see current marketing use of “agent” as confusing or disingenuous for non-technical audiences, especially when responsibility for errors is at stake.
Architecture, Orchestration, and Tooling
- Several view LLMs as components in larger orchestrated systems: prefrontal-cortex analogies, system‑1 vs system‑2 splits, and complex multi-model architectures are discussed.
- Durable workflow engines (Temporal, Inngest, Hatchet, Windmill) are highlighted for long-running, fault-tolerant, human-in-the-loop processes, though they introduce their own complexity and learning costs.
- Tool use is seen as central to powerful agentic workflows, but defining tools and handling stateful tools correctly is non-trivial.