🚀 Devika - Agentic AI Software Engineer 👩‍💻

Important

This project is currently in a very early development/experimental stage. There are a lot of unimplemented/broken features at the moment. Contributions are welcome to help out with the progress!

About
Key Features
System Architecture
Quick Start
Installation
Getting Started
Configuration
Under The Hood
Community Discord Server
Contributing
License

About

Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika utilizes large language models, planning and reasoning algorithms, and web browsing abilities to intelligently develop software.

Devika aims to revolutionize the way we build software by providing an AI pair programmer who can take on complex coding tasks with minimal human guidance. Whether you need to create a new feature, fix a bug, or develop an entire project from scratch, Devika is here to assist you.

Note

Devika is modeled after Devin by Cognition AI. This project aims to be an open-source alternative to Devin with an "overly ambitious" goal to meet the same score as Devin in the SWE-bench Benchmarks... and eventually beat it?

Demos

devika-pygame-demo.mp4

Key Features

🤖 Supports Claude 3, GPT-4, GPT-3.5, and Local LLMs via Ollama. For optimal performance: Use the Claude 3 family of models.
🧠 Advanced AI planning and reasoning capabilities
🔍 Contextual keyword extraction for focused research
🌐 Seamless web browsing and information gathering
💻 Code writing in multiple programming languages
📊 Dynamic agent state tracking and visualization
💬 Natural language interaction via chat interface
📂 Project-based organization and management
🔌 Extensible architecture for adding new features and integrations

System Architecture

Devika's system architecture consists of the following key components:

User Interface: A web-based chat interface for interacting with Devika, viewing project files, and monitoring the agent's state.
Agent Core: The central component that orchestrates the AI planning, reasoning, and execution process. It communicates with various sub-agents and modules to accomplish tasks.
Large Language Models: Devika leverages state-of-the-art language models like Claude, GPT-4, and Local LLMs via Ollama for natural language understanding, generation, and reasoning.
Planning and Reasoning Engine: Responsible for breaking down high-level objectives into actionable steps and making decisions based on the current context.
Research Module: Utilizes keyword extraction and web browsing capabilities to gather relevant information for the task at hand.
Code Writing Module: Generates code based on the plan, research findings, and user requirements. Supports multiple programming languages.
Browser Interaction Module: Enables Devika to navigate websites, extract information, and interact with web elements as needed.
Knowledge Base: Stores and retrieves project-specific information, code snippets, and learned knowledge for efficient access.
Database: Persists project data, agent states, and configuration settings.

Read ARCHITECTURE.md for the detailed documentation.

Quick Start

The easiest way to run the project locally:

Install uv - Python Package manager (https://github.com/astral-sh/uv)
Install bun - JavaScript runtime (https://bun.sh/docs/installation)
Install and setup Ollama (https://ollama.com/) (if you want don't want to use the local models then you can skip this step)

For ollama you need to install the models
For API models, configure the API keys via setting page in UI.

Then execute the following set of command:

ollama serve
git clone https://github.com/stitionai/devika.git
cd devika/
uv venv
source .venv/bin/activate
uv pip install -r requirements.txt
playwright install --with-deps
cd ui/
bun install
bun run dev
cd ..
python3 devika.py

Docker images will be released soon. 🙌

Installation

Devika requires the following things as dependencies:

Ollama (follow the instructions here to install it: https://ollama.com/)
Bun (follow the instructions here to install it: https://bun.sh/)

To install Devika, follow these steps:

Clone the Devika repository:

git clone https://github.com/stitionai/devika.git

Navigate to the project directory:
```
cd devika
```
Create a virtual environment and install the required dependencies:
```
uv venv
uv pip install -r requirements.txt
```

Install the required dependencies:

pip install -r requirements.txt
playwright install --with-deps # installs browsers in playwright (and their deps) if required

Set up the necessary API keys and Configuration
Start the Devika server:
```
python devika.py
```
Compile and run the UI server:
```
cd ui/
bun install
bun run dev
```
Access the Devika web interface by opening a browser and navigating to http://127.0.0.1:3000.

Getting Started

To start using Devika, follow these steps:

Open the Devika web interface in your browser.
Create a new project by clicking on the "New Project" button and providing a name for your project.
Select the desired programming language and model configuration for your project.
In the chat interface, provide a high-level objective or task description for Devika to work on.
Devika will process your request, break it down into steps, and start working on the task.
Monitor Devika's progress, view generated code, and provide additional guidance or feedback as needed.
Once Devika completes the task, review the generated code and project files.
Iterate and refine the project as desired by providing further instructions or modifications.

Configuration

Devika requires certain configuration settings and API keys to function properly:

when you first time run Devika, it will create a config.toml file for you in the root directory. You can configure the following settings in the settings page via UI:

STORAGE
- SQLITE_DB: The path to the SQLite database file for storing Devika's data.
- SCREENSHOTS_DIR: The directory where screenshots captured by Devika will be stored.
- PDFS_DIR: The directory where PDF files processed by Devika will be stored.
- PROJECTS_DIR: The directory where Devika's projects will be stored.
- LOGS_DIR: The directory where Devika's logs will be stored.
- REPOS_DIR: The directory where Git repositories cloned by Devika will be stored.
- WEB_SEARCH: This determines the default web search method for browsing the web. Accepted values are: google, bing, or ddgs.
API KEYS
- BING: Your Bing Search API key for web searching capabilities.
- GOOGLE_SEARCH: Your Google Search API key for web searching capabilities.
- GOOGLE_SEARCH_ENGINE_ID: Your Google Search Engine Id for web searching using google.
- OPENAI: Your OpenAI API key for accessing GPT models.
- GEMINI: Your Gemini API key for accessing Gemini models.
- CLAUDE: Your Anthropic API key for accessing Claude models.
- MISTRAL: Your Mistral API key for accessing Mistral models.
- GROQ: Your Groq API key for accessing Groq models.
- NETLIFY: Your Netlify API key for deploying and managing web projects.

Make sure to keep your API keys secure and do not share them publicly.

Configuring web search method

Devika currently supports Bing, Google, and DuckDuckGo for web searches. You can configure the web search method via UI.

Under The Hood

Let's dive deeper into some of the key components and techniques used in Devika:

AI Planning and Reasoning

Devika employs advanced AI planning and reasoning algorithms to break down high-level objectives into actionable steps. The planning process involves the following stages:

Objective Understanding: Devika analyzes the given objective or task description to understand the user's intent and requirements.
Context Gathering: Relevant context is collected from the conversation history, project files, and knowledge base to inform the planning process.
Step Generation: Based on the objective and context, Devika generates a sequence of high-level steps to accomplish the task.
Refinement and Validation: The generated steps are refined and validated to ensure their feasibility and alignment with the objective.
Execution: Devika executes each step in the plan, utilizing various sub-agents and modules as needed.

The reasoning engine constantly evaluates the progress and makes adjustments to the plan based on new information or feedback received during execution.

Keyword Extraction

To enable focused research and information gathering, Devika employs keyword extraction techniques. The process involves the following steps:

Preprocessing: The input text (objective, conversation history, or project files) is preprocessed by removing stop words, tokenizing, and normalizing the text.
Keyword Identification: Devika uses the BERT (Bidirectional Encoder Representations from Transformers) model to identify important keywords and phrases from the preprocessed text. BERT's pre-training on a large corpus allows it to capture semantic relationships and understand the significance of words in the given context.
Keyword Ranking: The identified keywords are ranked based on their relevance and importance to the task at hand. Techniques like TF-IDF (Term Frequency-Inverse Document Frequency) and TextRank are used to assign scores to each keyword.
Keyword Selection: The top-ranked keywords are selected as the most relevant and informative for the current context. These keywords are used to guide the research and information gathering process.

By extracting contextually relevant keywords, Devika can focus its research efforts and retrieve pertinent information to assist in the task completion.

Browser Interaction

Devika incorporates browser interaction capabilities to navigate websites, extract information, and interact with web elements. The browser interaction module leverages the Playwright library to automate web interactions. The process involves the following steps:

Navigation: Devika uses Playwright to navigate to specific URLs or perform searches based on the keywords or requirements provided.
Element Interaction: Playwright allows Devika to interact with web elements such as clicking buttons, filling forms, and extracting text from specific elements.
Page Parsing: Devika parses the HTML structure of the web pages visited to extract relevant information. It uses techniques like CSS selectors and XPath to locate and extract specific data points.
JavaScript Execution: Playwright enables Devika to execute JavaScript code within the browser context, allowing for dynamic interactions and data retrieval.
Screenshot Capture: Devika can capture screenshots of the web pages visited, which can be useful for visual reference or debugging purposes.

The browser interaction module empowers Devika to gather information from the web, interact with online resources, and incorporate real-time data into its decision-making and code generation processes.

Code Writing

Devika's code writing module generates code based on the plan, research findings, and user requirements. The process involves the following steps:

Language Selection: Devika identifies the programming language specified by the user or infers it based on the project context.
Code Structure Generation: Based on the plan and language-specific patterns, Devika generates the high-level structure of the code, including classes, functions, and modules.
Code Population: Devika fills in the code structure with specific logic, algorithms, and data manipulation statements. It leverages the research findings, code snippets from the knowledge base, and its own understanding of programming concepts to generate meaningful code.
Code Formatting: The generated code is formatted according to the language-specific conventions and best practices to ensure readability and maintainability.
Code Review and Refinement: Devika reviews the generated code for syntax errors, logical inconsistencies, and potential improvements. It iteratively refines the code based on its own analysis and any feedback provided by the user.

Devika's code writing capabilities enable it to generate functional and efficient code in various programming languages, taking into account the specific requirements and context of each project.

Community Discord Server

We have a Discord server for the Devika community, where you can connect with other users, share your experiences, ask questions, and collaborate on the project. To join the server, please follow these guidelines:

Be respectful: Treat all members of the community with kindness and respect. Harassment, hate speech, and other forms of inappropriate behavior will not be tolerated.
Contribute positively: Share your ideas, insights, and feedback to help improve Devika. Offer assistance to other community members when possible.
Maintain privacy: Respect the privacy of others and do not share personal information without their consent.

To join the Devika community Discord server, click here.

Contributing

We welcome contributions to enhance Devika's capabilities and improve its performance. To contribute, please see the CONTRIBUTING.md file for steps.

License

Devika is released under the MIT License. See the LICENSE file for more information.

Star History

We hope you find Devika to be a valuable tool in your software development journey. If you have any questions, feedback, or suggestions, please don't hesitate to reach out. Happy coding with Devika!

harshakhmk/devikaAI