Asterisk AI Voice Agent
Perceived Uses & Abuse Potential
- Many commenters immediately associate an Asterisk AI voice agent with more spam, scams, and “horrible customer support lines.”
- A counter-use is proposed: running it as a honeypot to waste spam callers’ time (e.g., “Lenny”-style scripts).
- Some fear businesses will use such systems to justify cutting remaining human support staff.
Caller Experience & Interface Preferences
- Strong dislike for current voice-driven IVRs: unreliable in noisy environments, socially awkward to speak commands in public, and inconsistent UX.
- Several prefer simple DTMF menus; many systems still accept keypad input even when pushing voice.
- One practitioner notes that when IVR trees get complex, callers just mash “0” and demand a human; AI intent capture can be “less bad” than long menus in that scenario.
Legitimate Use Cases vs. “No Value”
- People in the industry report large fractions of calls are simple or already self-serviceable (password resets, FAQ-style questions, “did you power cycle it?”, checking amenities) where an AI agent could help.
- Concrete positive examples: dealership service line where an AI instantly books oil changes instead of putting callers on hold; potential hands-free tools for field staff or drivers.
- Skeptics respond that if something is simple enough for AI, it should be a web/app flow instead; many only call when self-service has already failed or can’t be trusted.
Ethics, Deception & Expectations
- Debate over background noise/“on brand” ambience: some see it as harmless branding; others call it deceptive, especially when it nudges callers to think it’s a human.
- Strong sentiment that phone calls implicitly promise a human; using human-like voices without clear disclosure is framed as a “switcheroo” or scam.
- Others push back: users mainly want to express needs in natural language; they don’t necessarily care about human vs machine if problems get solved.
Latency, Turn-Taking & Technical Challenges
- Concern about the repo’s stated 2–3s latencies being “rage inducing.” Multiple commenters say SOTA can be sub-second, even ~300ms, with 2s+ causing hangups.
- Practitioners report 500–1000ms as common and acceptable today, with major effort going into interruption handling and turn detection rather than raw speed.
- Techniques discussed: streaming partial LLM output, chunking at punctuation, using fast TTS on short fragments, canceling/resyncing when the user interrupts, and blending with “thinking” sounds to mask latency.
Stacks, Deployment & VAD
- Twilio integration: possible via SIP trunks or Twilio MediaStream WebSockets.
- Pipecat mentioned as an open-source framework with many integrations (STT/LLM/TTS, turn detection model, state machine library); compared against proprietary players like Vapi/Retell/Sierra.
- Deployment complexity is a pain point; some prefer Cloudflare Workers + Durable Objects with external STT (AssemblyAI/Deepgram with built-in VAD) and LLM/TTS for low-latency, low-cost scaling.
- Discussion touches on where to keep conversation state (e.g., in Durable Objects) and compatibility with OpenAI-style realtime APIs.
Asterisk-Specific Notes & Nostalgia
- Commenters are pleased to see Asterisk back on HN; default music-on-hold is widely recognized.
- One person asks how to correlate CDRs with voicemail recordings for a unified dashboard; others suggest using channel vars, voicemail metadata files, or AGI/ARI logging.
Repository Style & Trust
- Several note the GitHub repo looks heavily AI-generated: emoji-heavy headings, AI-like commit logs, Cursor traces.
- This style triggers distrust for some: they assume docs may not be carefully reviewed, making them hesitant to rely on the project.
Overall Sentiment
- Split roughly between:
- Enthusiasm from builders and some users who see real operational value in handling simple, high-volume calls.
- Deep skepticism from others who see little benefit to customers, expect more hostile support experiences, and are uncomfortable with human-mimicking automation on the phone.