
Fin x Mistral AI: Evaluating your AI Agent
Presented by: Kelly Farrell
Thursday, July 2, 2026
06:00 PM GMT+1
Thursday, July 2, 2026
08:00 PM GMT+1
Price
Free
Free entry
Side Events
7Everything on in London during Fin x Mistral AI: Evaluating your AI Agent: side events you can drop into while you're in town, day by day.
About the Event
Is your AI agent actually working? It's a deceptively simple question—and one of the hardest to answer honestly in production. As AI agents move from prototype to core product, the gap between benchmark performance and real-world results is becoming impossible to ignore. The teams building at the frontier are finding that smaller, fine-tuned models often outperform bloated generalists—but only if you have the eval infrastructure to know what "better" actually looks like. Join Fin and Mistral for a senior practitioner-level conversation spanning both ends of the stack—from frontier model development to production AI agents—on what it actually takes to evaluate, iterate, and trust AI in the real world. What you'll learn: Why offline metrics alone will mislead you, and what a production-grade eval framework actually looks like When fine-tuned, specialised models outperform larger generalist ones — and what it takes to get there How teams building at the model layer and the product layer think about evaluation differently What building your own models teaches you about the limits of benchmarks Speakers: Pedro Tabacof, Principal Machine Learning Scientist, Fin Henry Lagarde, Software Engineer, Mistral AI
Venue Details
2 Rue des Mathurins
2 Rue des Mathurins, Paris, Île-de-France
Explore nearby
Free for Visitors
Organized by
Tap "I'm attending" above first. Once you do, you can meet the right people at this event.