/vectordb-recipes

High quality resources & applications for LLMs, multi-modal models and VectorDBs

Primary LanguageJupyter NotebookApache License 2.0Apache-2.0

VectorDB-recipes


Dive into building GenAI applications! This repository contains examples, applications, starter code, & tutorials to help you kickstart your GenAI projects.
  • These are built using LanceDB, a free, open-source, serverless vectorDB that requires no setup.
  • It integrates into python data ecosystem so you can simply start using these in your existing data pipelines in pandas, arrow, pydantic etc.
  • LanceDB has native Typescript SDK using which you can run vector search in serverless functions!

Join our community for support - DiscordTwitter

This repository is divided into 3 sections:

  • Examples - Get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes!
  • Applications - Ready to use Python and web apps using applied LLMs, VectorDB and GenAI tools
  • Tutorials - A curated list of tutorials, blogs, Colabs and courses to get you started with GenAI in greater depth.

Examples

Applied examples that get right into the code with minimal introduction, aimed at getting you from an idea to PoC within minutes! Examples are available as:

  • Colab notebooks - that builds the application is stages allowing you to investigate results at every intermediate stage.
  • Python scripts - for cases where you'd like directly to use the file or snippets to integrate in your application
  • JS/TS scripts - Some examples are written using lancedb's native js library! These script/snippets can also be directly integrated in your web applications.

If you're looking for in-depth tutorial-like examples, checkout the tutorials section!

Example   Notebook & Scripts   Read The Blog!       
Youtube transcript search bot Open In Colab Python JS LLM
Langchain: Code Docs QA bot Open In Colab Python JSLLM
AI Agents: Reducing Hallucination Open In Colab Python JSLLM Medium
Multimodal CLIP: DiffusionDB Open In Colab Python LLM Medium
Multimodal CLIP: Youtube videos Open In Colab Python LLM Medium
Multimodal Image + Text Search Open In Colab Python LLM Medium
TransformersJS Embedding example JS LLM
Movie Recommender Open In Colab Python
Product Recommender Open In Colab Python
Audio Search Open In Colab Python LLM
Arxiv paper recommender Open In Colab Python LLM
Multi-lingual search Open In Colab Python LLM
Instruct-Multitask Open In Colab Python LLM Medium
Improve RAG with Re-ranking Open In Colab LLM Medium
Improve RAG with FLARE Open In Colab Python LLM Medium
Improve RAG with HyDE Open In Colab LLM Medium
Improve RAG with LOTR Open In Colab LLM Medium
Advanced RAG: Parent Document Retriever Open In Colab LLM Medium
RAG Fusion Open In Colab LLM
Hybrid search BM25 & lancedb Open In Colab LLM Medium
Evaluating Prompts with Prompttools Open In Colab LLM local LLM
NER powered with Semantic Search Open In Colab local LLM Medium
Sentiment Analysis : Analysing Hotel Reviews Open In Colab local LLM Medium
Facial Recognition Open In Colab
Accelerate Vector Search Applications Using OpenVINO Open In Colab local LLM Medium
Search Within Images Open In Colab local LLM Medium
Contextual-Compression-with-RAG Open In Colab local LLM Medium
Imagebind demo app hf spaces

Projects & Applications

These are ready to use applications built using LanceDB serverless vector database. You can explore these open source projects, use parts of them in your projects or build your applications on top of these.

Project Name Description Screenshot
YOLOExplorer Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds YOLOExplorer
Website Chatbot (Deployable Vercel Template) Create a chatbot from the sitemap of any website/docs of your choice. Built using vectorDB serverless native javascript package. Chatbot
Multi-Modal Search Engine Create a Multi-modal search engine app, to search images using both images or text Search
Chat with multiple URL/website Conversational AI for Any Website with Mistral,Bge Embedding & LanceDB webui_aa
Hr chatbot Hr chatbot - ask your personal query using zero-shot React agent & tools image
Talk with Youtube Video using GPT4 Vision API Talk with Youtube Video using GPT4 Vision API and Langchain demo
Talk with Podcast Talk with Youtube Podcast using Ollama and insanely-fast-whisper demo
Talk with Wikipedia Talk with Wikipedia Pages demo
Talk with Github Talk with Github Codespaces using Qwen1.5 demo
Document Chat with Langroid Talk with your Documents using Langroid demo
Fastapi RAG template FastAPI based RAG template with Websocket support image
GTE MLX RAG mlx based RAG model using lancedb api support image
Multilingual-RAG Multilingual RAG with cohere embedding & support 100+ languages image

Tutorials

Looking to get started with LLMs, vectorDBs, and the world of Generative AI? These in-depth tutorials and courses cover these concepts with practical follow along colabs where possible.

Tutorial Interactive Environment Blog Link
Corrective RAG with Langgraph Open In Colab LLM
Product Quantization: Compress High Dimensional Vectors Medium
LLMs, RAG, & the missing storage layer for AI Medium
Fine-Tuning LLM using PEFT & QLoRA Open In Colab local LLM Medium
Context-Aware Chatbot using Llama 2 & LanceDB Open In Colab local LLM Medium
A Primer on Text Chunking and its Types Open In Colab Medium
NER powered Semantic Search Open In Colab local LLM Medium
Better RAG with FLARE Open In Colab local LLM LLM Medium
Accelerate Vector Search Applications Using OpenVINO Open In Colab local LLM Medium

🌟 New! 🌟 Applied GenAI and VectorDB course on Udacity Learn about GenAI and vectorDBs using LanceDB in the recently launched Udacity Course

Contributing Examples

If you're working on some cool applications that you'd like to add to this repo, please open a PR!