Overview
In this 39-minute conference talk from Conf42 Python 2025, learn practical strategies for developing robust AI applications with Large Language Models. Discover how to overcome common challenges like LLM inconsistencies through proper implementation of retries and timeouts. Master techniques for handling structured outputs, implementing streaming responses for improved user experience, and managing long-running tasks with background jobs. Explore methods for evaluating prompt effectiveness and gain insights into various foundational models. The presentation provides a comprehensive walkthrough of best practices for building reliable AI applications, covering everything from initial development considerations to advanced implementation techniques.
Syllabus
00:00 Introduction to AI Application Development
01:03 Understanding LLM Inconsistencies
05:35 Implementing Retries and Timeouts
10:47 Handling Structured Outputs
15:44 Streaming Responses for Better UX
21:26 Using Background Jobs for Long Tasks
29:45 Evaluating Prompts with Evals
32:35 Overview of Foundational Models
38:57 Conclusion and Contact Information
Taught by
Conf42