🎓 KONSPECTO - LLM Agent for Note Management

📋 Project Description

KONSPECTO is an intelligent agent based on a local LLM model, offering the following capabilities:

🔍 Search Through Notes

Semantic search across the notes database
Generation of structured responses based on the retrieved information
Ability to view original documents

🎥 Video Processing

Extraction of keyframes from YouTube videos
Creation of DOCX documents with images
Filtering of similar frames

🎤 Voice Input

Transcription of voice messages using Whisper
Support for the Russian language
Ability to combine voice and text input

🛠 Tech Stack

Frontend

⚛️ React + Vite
🎨 TailwindCSS
🔄 React Router
✨ React Icons

Backend

🚀 FastAPI
🤖 LangChain
🔍 LlamaIndex
📝 Whisper
🎥 OpenCV
🗄️ Redis Stack

📦 Installation

Prerequisites

Docker and Docker Compose
Node.js 18+
Python 3.11+
Poetry
pre-commit

1️⃣ Clone the Repository

git clone https://github.com/RomiconEZ/KONSPECTO
cd KONSPECTO

2️⃣ Configure Settings

Create configuration files in the backend/app/config/ directory:

.env

FOLDER_ID=your_google_drive_folder_id
GOOGLE_SERVICE_ACCOUNT_KEY_PATH=config/service_account_key.json

TRANSCRIPTION_MODEL=whisper
WHISPER_MODEL_SIZE=large-v3

LLM_STUDIO_BASE_URL=http://localhost:1234/v1

EMBEDDING_MODEL_NAME="intfloat/multilingual-e5-large"
EMBEDDING_BATCH_SIZE=16
EMBEDDING_DIMENSION=1024

service_account_key.json

{
  // Your Google service account credentials
  // Obtain them from the Google Cloud Console
}

3️⃣ Install Dependencies

Frontend:

cd frontend
npm install

Backend:

cd backend
poetry install

4️⃣ Set Up pre-commit Hooks

pre-commit install --install-hooks
pre-commit run --all-files

5️⃣ Run Tests

Frontend tests:

cd frontend
npm run test

Backend tests:

cd backend
bash tests/run_tests.sh

6️⃣ Launch the Application

docker compose up --build

The application will be available at the following addresses:

🔄 Workflow

Information Search
- The user sends a request through the UI
- The agent analyzes the request and determines the necessary tools
- A search is performed across the knowledge base and a response is generated
Video Processing
- Uploading a YouTube video
- Extracting frames every 5 seconds
- Filtering similar images
- Creating a DOCX document
Voice Input
- Recording audio via the browser
- Transcription using Whisper
- Adding the text to the current query

📜 License

Apache License

⭐️ Support the Project

If you like the project, give it a star on GitHub!