Ask HN: Machine learning engineers, what do you do at work?
Reality of the ML Engineer Role
- Work is far from “just training models.”
- Many describe ~80–95% of time on data collection/cleaning, feature engineering, ETL, infra, and tooling; only a small fraction on fitting/tuning models.
- Others say their title is ML engineer but the day-to-day is mostly backend/software engineering or MLOps for ML systems.
- A minority argues that if you don’t spend most of your time on model development/research, it’s not really an ML role.
Role Boundaries and Team Structure
- Frequent confusion between “ML engineer,” “data scientist,” “applied scientist,” and “data engineer”; in small orgs these often blend.
- Some argue for specialization (research vs infra vs ops) due to limited time and deep expertise needs.
- Others insist engineers should understand at least one layer above and below their stack (e.g., drivers and infra vs model math) to avoid Conway’s Law / coordination issues.
Data, Experimentation, and Model Work
- Emphasis on being “knee‑deep” in data to discover patterns and ask the right questions.
- Tasks include experiment design, A/B testing, metrics definition, model deployment, retraining pipelines, and long-running experiments with careful monitoring.
- Classical ML required explicit feature engineering; with deep learning, architectural choices and data quality/diversity matter more.
Tooling, Environments, and Pain Points
- Heavy frustration with Python environments, native/CUDA dependencies, and package managers.
- pip, conda/mamba, venv, poetry, Nix, Docker, pyenv, and newer tools like uv are all mentioned; each has tradeoffs and failure modes.
- ARM MacBooks are seen as problematic for cutting-edge local ML; many prefer Linux GPU servers or cloud images.
- Dependency hell and constant breakage are seen as a systemic drag on productivity.
LLMs and Changing Work Patterns
- Some roles shifted from training models to integrating LLM APIs, prompt engineering, and RAG; feels closer to standard SWE to some.
- Observed that very few people work on LLM training itself compared to many “AI engineers” calling APIs.
Collaboration, Domain Experts, and Explainability
- Collaborating with nontechnical domain experts (e.g., in healthcare, business units) is seen as highly valuable and rewarding.
- Explaining stochastic model behavior and misclassifications is hard; expectations from traditional deterministic software often clash with ML reality.
- Teaching Python and basic tooling to less-technical colleagues is a notable part of some MLE jobs.
Healthcare, Privacy, and Ethics
- Several work on healthcare ML (claims, diagnosis from images, vital signs, etc.).
- There is concern that sensitive health data is widely accessible to engineers despite HIPAA; “dead privacy” is discussed.
- Insurance/claims ML can move millions of dollars (e.g., subrogation, upcoding detection); some criticize models used to increase billing.
Career Satisfaction and Demand
- Mixed feelings about the GenAI hype: more demand for flashy LLM work, less focus on “boring” but valuable ML.
- Some feel marginalized as “old-school” data/ML people while budgets flow to AI APIs and non-coding “AI scientists.”
- Others report high satisfaction when given low-meeting time, good infra, and real ownership of ML products.