This repository includes scripts for downloading Langchain documents, ingesting them into Pinecone, and running a Streamlit app to explore the data.
Follow these steps to set up and run the project:
-
Clone the Repository:
git clone https://github.com/serkanyasr/langchain-chatbot-explorer.git cd langchain-chatbot-explorer
-
Install Dependencies:
pip install -r requirements.txt
-
Download Langchain Documents: Run the following command to download Langchain documents.
python download_docs.py
or
wget -r -A.html -P langchain-docs https://api.python.langchain.com//latest/api_reference.html
-
Update Pinecone API Key and Environment: Open
ingest_docs.py
and replace the placeholders in theinit_pinecone
function with your Pinecone API key and environment. -
Ingest Documents into Pinecone: Run the following command to process and send Langchain documents to Pinecone.
python ingest_docs.py
-
Run the Streamlit App: Run the Streamlit app to explore the data.
streamlit run app.py
langchain-chatbot.mp4
download_docs.py
: Script to download Langchain documents.ingest_docs.py
: Script to ingest documents into Pinecone.app.py
: Streamlit app to explore Langchain data.
- Update Pinecone API key and environment in
ingest_docs.py
. - Configure Streamlit app in
app.py
if necessary.
If you'd like to contribute to this project, please follow these guidelines:
- Fork the repository.
- Create a new branch for your feature:
git checkout -b feature-name
. - Make your changes and commit them:
git commit -m 'Add some feature'
. - Push to the branch:
git push origin feature-name
. - Submit a pull request.
This project is licensed under the MIT License.