/evadb

fork

Primary LanguagePythonApache License 2.0Apache-2.0

EvaDB

Bring AI inside your database system and build AI-powered apps

EvaDB forks EvaDB stars EvaDB pull-requests EvaDB Commits

Follow EvaDB

Join EvaDB Slack Community Follow evadb_ai EvaDB on Medium EvaDB Website

Share EvaDB

Follow _superAGI Share on Telegram Share on Reddit

Launch EvaDB on Colab Roadmap Python Versions Supported License Coverage Status
Open in Gitpod

EvaDB enables software developers to build AI apps in a few lines of code. Its powerful SQL API simplifies AI app development for both structured and unstructured data. EvaDB's benefits include:

  • 🔮 Easy to connect EvaDB with your SQL database system and build AI-powered apps with SQL queries
  • 🤝 Query your data with a pre-trained AI model from Hugging Face, OpenAI, YOLO, PyTorch, and other AI frameworks
  • ⚡️ Faster queries thanks to AI-centric query optimization
  • 💰 Save money spent on running models by efficient CPU/GPU use
  • 🔧 Fine-tune your AI models to achieve better results

👋 Hey! If you're excited about our vision of bringing AI inside database systems, show some ❤️ by:

Quick Links

Documentation

You can find the complete documentation of EvaDB at evadb.ai/docs 📚✨🚀

Why EvaDB

In the world of AI, we've reached a stage where many AI tasks that were traditionally handled by AI or ML engineers can now be automated. EvaDB enables software developers with the ability to perform advanced AI tasks without needing to delve into the intricate details.

EvaDB covers many AI applications, including regression, classification, image recognition, question answering, and many other generative AI applications. EvaDB targets 99% of AI problems that are often repetitive and can be automated with a simple function call in an SQL query. Until now, there is no comprehensive open-source framework for bringing AI into an existing SQL database system with a principled AI optimization framework, and that's where EvaDB comes in.

Our target audience is software developers who may not necessarily have a background in AI but require AI capabilities to solve specific problems. We target programmers who write simple SQL queries inside their CRUD apps. With EvaDB, it is possible to easily add AI features to these apps by calling built-in AI functions in the queries.

How does EvaDB work

  • Connect EvaDB to your database system with the `CREATE DATABASE` statement.
  • Write SQL queries with AI functions to get inference results:
    • Pick a pre-trained AI model from Hugging Face, Open AI, Ultralytics, PyTorch, and built-in AI frameworks for generative AI, NLP, and vision applications;
    • or pick from a variety of state-of-the-art ML engines for classic ML use-cases (classification, regression, etc.);
    • or bring your custom model built with any AI/ML framework using `CREATE FUNCTION`.

Follow the getting started guide to get on-boarded as fast as possible.

Illustrative Queries

  • Run the MNIST Image Classification model to obtain digit labels for each frame in the video.
SELECT MnistImageClassifier(data).label FROM mnist_video;
  • Build a vector index on the feature embeddings returned by the SIFT Feature Extractor on a collection of Reddit images.
CREATE INDEX reddit_sift_image_index
    ON reddit_dataset (SiftFeatureExtractor(data))
    USING FAISS
  • Retrieve the top-5 most similar images for the given image using the index.
SELECT name FROM reddit_dataset ORDER BY
    Similarity(
        SiftFeatureExtractor(Open('reddit-images/g1074_d4mxztt.jpg')),
        SiftFeatureExtractor(data)
    )
    LIMIT 5

Illustrative Apps

Here are some illustrative AI apps built using EvaDB (each notebook can be opened on Google Colab):

More Illustrative Queries

  • Store the text returned by a Speech Recognition model on the audio component of a video in a table.
CREATE TABLE text_summary AS
    SELECT SpeechRecognizer(audio) FROM ukraine_video;
  • Run ChatGPT on the text column in a table.
SELECT ChatGPT('Is this video summary related to Ukraine russia war', text)
    FROM text_summary;
  • Train an ML model using the Ludwig AI engine to predict a column in a table.
CREATE FUNCTION IF NOT EXISTS PredictHouseRent FROM
( SELECT * FROM HomeRentals )
TYPE Ludwig
PREDICT 'rental_price'
TIME_LIMIT 120;

Architecture of EvaDB

EvaDB's AI-centric query optimizer takes a query as input and generates a query plan. The query engine takes the query plan and hits the relevant backends to efficiently process the query: 1. SQL Database Systems (Structured Data) 2. AI Frameworks (Transform Unstructured Data to Structured Data, Unstructured data includes PDFs, images, podcasts, etc. stored on cloud buckets or local filesystem) 3. Vector Database Systems (Feature Embeddings)

Architecture Diagram

Community and Support

If you run into any bugs or have any comments, you can reach us on our Slack Community 📟 or create a Github Issue 🐛.

Here is EvaDB's public roadmap 🛤️. We prioritize features based on user feedback, so we'd love to hear from you!

Contributing

We are a lean team on a mission to bring AI inside database systems! All kinds of contributions to EvaDB are appreciated 🙌 If you'd like to get involved, here's information on where we could use your help: contribution guide 🤗

CI Status:

CI Status Documentation Status

Star History

EvaDB Star History Chart

License

Copyright (c) Georgia Tech Database Group. Licensed under an Apache License.