Learn how to supercharge your multimodal Retrieval-Augmented Generation (RAG) pipelines by combining NVIDIA’s GPU-accelerated tools with powerful open-source technologies. This talk will demonstrate the seamless integration of NVIDIA’s optimized inference tools like TensorRT-LLM with popular community tools to create high-performance applications. We’ll explore how this synergy dramatically enhances processing speed and accuracy across text, image, and other modalities. Join us to learn how you can leverage this integrated approach to build scalable, efficient, and state-of-the-art multimodal AI applications.
About the Speaker
Jay Rodge is a developer advocate for large language models (LLMs), where he demonstrates how developers can leverage GPU acceleration in their LLM workflows, using widely used tools and frameworks.
Not a Meetup member? Sign up to attend the next event:
https://voxel51.com/computer-vision-ai-meetups/
Recorded on Aug 29, 2024 at the AI, Machine Learning and Computer Vision Meetup.