Fin x Mistral AI: Evaluating your AI Agent

Name: Fin x Mistral AI: Evaluating your AI Agent
Start: 2026-07-02T17:00:00+00:00
End: 2026-07-02T19:00:00+00:00
Location: 2 Rue des Mathurins

Presented by: Kelly Farrell

JUL2

Start

Thursday, July 2, 2026

06:00 PM GMT+1

JUL2

End

Thursday, July 2, 2026

08:00 PM GMT+1

Price

Free

Free entry

In person

2 Rue des Mathurins

2 Rue des Mathurins, Paris, Île-de-France

View on map Get directions

Side Events

Everything on in London during Fin x Mistral AI: Evaluating your AI Agent: side events you can drop into while you're in town, day by day.

Brick Works London

05:30 PM · Wienerberger Ltd., London Showroom

About the Event

Is your AI agent actually working? It's a deceptively simple question—and one of the hardest to answer honestly in production. As AI agents move from prototype to core product, the gap between benchmark performance and real-world results is becoming impossible to ignore. The teams building at the frontier are finding that smaller, fine-tuned models often outperform bloated generalists—but only if you have the eval infrastructure to know what "better" actually looks like. Join Fin and Mistral for a senior practitioner-level conversation spanning both ends of the stack—from frontier model development to production AI agents—on what it actually takes to evaluate, iterate, and trust AI in the real world. What you'll learn: Why offline metrics alone will mislead you, and what a production-grade eval framework actually looks like When fine-tuned, specialised models outperform larger generalist ones — and what it takes to get there How teams building at the model layer and the product layer think about evaluation differently What building your own models teaches you about the limits of benchmarks Speakers: Pedro Tabacof, Principal Machine Learning Scientist, Fin Henry Lagarde, Software Engineer, Mistral AI