2026-05-03

DeepClaude – Claude Code agent loop with DeepSeek V4 Pro

Purpose of DeepClaude / Using DeepSeek with Claude Code

Main idea: route Claude Code’s CLI through DeepSeek V4 (Pro/Flash) or other models via Anthropic-compatible/OpenRouter endpoints and env vars.
Several comments show one‑liner shell scripts achieving this; some note this has been possible “since the beginning” via Anthropic_BASE_URL + token + model env vars.
DeepClaude adds a local proxy that can switch models mid‑session and track combined costs across Anthropic and non‑Anthropic models, though this is under‑emphasized in its README.

Is DeepClaude Necessary?

Some see the repo as trivial “env-var glue” unworthy of a full project or HN post.
Others value a packaged solution that streamlines setup and highlights the pattern.
There’s confusion about tool-call compatibility; clarifications note DeepSeek exposes an Anthropic-compatible endpoint but it’s missing some features.

Alternative Harnesses and Tools

Many suggest ditching Claude Code entirely in favor of open-source harnesses: OpenCode, Pi, Hermes, Forge Code, Langcli, custom CLIs, etc.
Opinions differ on quality: some find OpenCode less effective than Claude Code; others claim Forge Code or Codex-style harnesses outperform Claude Code in benchmarks.
Claude Code is praised for rich plugins/skills and MCP ecosystem, criticized for regressions, cost-optimization at quality’s expense, and being closed-source.

Model Quality and Use Cases

DeepSeek V4 Pro/Flash are widely reported as strong coding models, often “close to” or better than Claude Sonnet and approaching Opus for some tasks, at much lower cost.
Other favored models: GLM 5.1 (surprisingly strong, sometimes near Opus), GPT 5.5 + Codex for some languages, Gemini Flash for speed, Kimi/Qwen/MiniMax for cost.
Some argue you need the best model for planning, architecture, and debugging; others say “good enough” cheaper models suffice, especially in mixed workflows (e.g., premium for design, cheaper for implementation).

Cost, Subsidies, and Data Privacy

Cost engineering is a major theme; some report burning tens of dollars quickly even with cheap models.
DeepSeek’s very low pricing is noted as heavily discounted and time-limited; debate over whether inference is subsidized.
Repeated concerns about data training/retention, especially with Chinese providers; mitigations include OpenRouter’s zero‑data‑retention flags and alternative inference providers.
Some distrust US firms less, others point out US data protections are also limited; geopolitical tensions color perceptions.

Harness Design, Contracts, and Local Inference

Building robust agentic harnesses is described as harder than expected: models must reliably honor tool formats, planning structures, permissions, and error recovery; weaker models expose harness fragility.
Local and hybrid setups (llama.cpp, Ollama, custom harnesses, DGX‑like boxes, upcoming DeepSeek V4 local support) are seen as key for privacy and long‑term cost control.

Meta: LLM Slop, Benchmarks, and OSS Process

Strong backlash to “vibe‑coded” LLM-generated repos that are minimal but heavily promoted; some call for age/history gates on HN projects.
Benchmarks (Terminal Bench, LiveCodeBench) are cited but also criticized as unrepresentative or vulnerable to “cheating.”
Separate thread on OSS projects auto‑closing issues to cope with low‑quality, LLM‑generated bug reports and PRs.

Related topics