Allgemein

Presentation: Building Embedding Models for Large-Scale Real-World Applications

Presentation: Building Embedding Models for Large-Scale Real-World Applications

Sahil Dua discusses the critical role of embedding models in powering search and RAG applications at scale. He explains the transformer-based architecture, contrastive learning techniques, and the process of distilling large language models into production-ready student models. He shares insights on optimizing query latency, handling document indexing, and evaluating retrieval quality.

By Sahil Dua