/mini_search

Primary LanguagePython

Simple Query Engine with LanceDB and OpenAI GPT

First Step

Data comes from Kaggle and the format is pretty weird.

    pip install -r requirements.txt

Ingest Data

To create the database and ingest the data, run the following command. This will create a vector databse in the local directory.

    python src/ingest.py

Query Data

This will query the openai endpoint and return an answer. Requires the env variable os.environ['OPENAI_API_KEY'] to be set to the OpenAI API key.

    python src/query.py --query "What are Soft-tissue Sarcomas?"