This repo demonstrates how to perform RAG on audio data with LangChain using AssemblyAI for transcription, HuggingFace for embeddings, Chroma as a vector database, and OpenAI's GPT 3.5 as a language model.
Check out the companion article Retrieval Augmented Generation on audio data with LangChain