Codestral: Mistral's Code Model
Model quality vs. existing tools (Copilot, GPT‑4/4o, others)
- Mixed impressions vs. GitHub Copilot: some say Codestral is “miles better” and fast enough to stop using GPT‑4; others report serious hallucinations (e.g., made‑up SDKs).
- Several compare indirectly via benchmarks: claim it slightly beats Llama 3‑70B; Copilot is said to rely mostly on GPT‑3.5 for completions, which some consider outclassed.
- Against GPT‑4o: some find Codestral a bit weaker overall; others prefer Codestral’s consistency and lower hallucination, criticizing GPT‑4o’s long‑output failures and repetition.
Usefulness and limitations for coding
- Works well for boilerplate, refactoring, and explaining or modifying existing code; less reliable for complex, multi‑constraint tasks (e.g., intricate ASGI middleware, tricky multi‑tenant schemas, Rust lifetimes).
- Several note that expecting perfect one‑shot solutions is unrealistic; iterative prompting, specs first, and diff‑based workflows are recommended.
- Some use personal “challenge prompts” (hard Python/Rust/Node tasks) as informal benchmarks and report most models still fail them.
Local deployment, hardware, and quantization
- Raw 22B FP16 weights ≈44 GB; plus extra for KV cache and activations.
- Unquantized model needs ~50 GB RAM; too large for many single GPUs, but quantized versions (e.g., 4‑bit ≈11 GB) fit on cards like 3090/4090 and high‑RAM Macs.
- Discussion of Apple Silicon vs. PC+Nvidia: Macs praised for unified memory capacity; PCs for cost, flexibility, and Linux support.
Licensing, “open‑weight,” and legal/ethical debate
- License (MNPL) is non‑production: allows research, testing, and some “development” use; bans commercial and most “live” uses, including internal business usage.
- Many see it as “demoware” or “weights available,” not open source. Concern that it’s practically unusable for companies compared to permissive models like Llama.
- Strong criticism of asymmetry: community code is used for training, but model outputs are tightly restricted. Others argue legality and copyright implications are unsettled and jurisdiction‑dependent.
Ecosystem, tooling, and business model questions
- People seek VS Code/IDE plugins that support Codestral via generic backends (Ollama, Continue, LlamaCoder, Cody, Tabnine).
- Some view this as a viable business model: free non‑commercial weights plus paid API/commercial licenses; others doubt it can compete with cheaper, stronger proprietary models and ubiquitous Copilot.
Broader impact on programming
- Opinions split: some see LLMs as democratizing coding and boosting productivity; others fear skill atrophy, poor debugging ability, and an influx of low‑quality “AI garbage” code and libraries.