Mistral AI · Paris/London · Hybrid

AI Engineer, Product, (Le Chat)

18.8.2025

Description


Embedded directly in Le Chat product Team, you will build the evaluation and A/B testing framework, add end-to-end observability, and run a reliable model release process. Work with Science to ship measurable improvements to quality, latency, safety, and reliability



• Build and maintain an LLM evaluation framework (reference tests, heuristics, model-graded checks).
• Define and track metrics: task success, helpfulness, hallucination proxies, safety flags, latency/cost.
• Run A/B tests for prompts, models, and system prompts, analyze results, recommend rollout or rollback.
• Set up observability for LLM calls: structured logging, tracing, dashboards, alerts.
• Operate the model release: canary and shadow traffic, sign-offs, SLO-based rollback criteria, regression detection.
• Improve core behaviors: memory write/retrieve policies and evals, intent classification, follow-ups, routing, tool-call reliability.
• Create templates and docs so other teams can author evals and ship safely.
• Partner with Science, diagnose regressions and lead post-mortems.

Qualifications


Strong TypeScript or Python skills
• Production LLM experience: prompts, tool/function calling, and system prompts.
• Hands-on with evals and A/B testing, you can design metrics and make rollout decisions from data.
• Observability: logging, tracing, dashboards, alerting
• Product mindset: form hypotheses, run experiments, interpret results, iterate.
• Clear written and spoken communication, autonomous; and product-oriented.

Now it would be ideal if you have experience with 
Safety systems: moderation, PII handling/redaction, guardrails.
• Release operations: canary/shadowing, automated rollbacks, experiment platforms.

Benefits

💰 Competitive salary and equity
🧑‍⚕️ Health insurance
🚴 Transportation allowance
🥎 Sport allowance
🥕 Meal vouchers
💰 Private pension plan
🍼 Parental : Generous parental leave policy
🌎 Visa sponsorship

Application

View listing at origin and apply!