Anish is the Co-Founder and CTO of Cleanlab, a platform optimizing AI workflows through data-centric machine learning. He previously conducted research at OpenAI, focusing on adversarial robustness in neural networks. An MIT PhD in Computer Science, Anish brings deep expertise in AI systems, security, and model reliability.
Building a prototype AI application is easy, but getting it to work reliably in production is not. Just ask the developers of New York’s MyCity, whose AI got caught telling businesses to break the law. (And there are many other stories like this!) We envision a future where AI applications avoid such issues by design, using automated detection and remediation of bad responses. In this hands-on workshop, we’ll walk you through a case study of building a reliable retrieval-augmented generation (RAG) application. We’ll start by implementing a baseline application, then integrate Cleanlab’s automated real-time evaluations to detect general issues such as knowledge gaps and hallucinations as well as custom application-specific evaluation criteria, and finally add real-time remediation of these issues.