/devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Primary LanguagePythonMIT LicenseMIT

Devika Logo

🚀 Devika - Agentic AI Software Engineer 👩‍💻

devika screenshot

Important

This project is currently in a very early development/experimental stage. There are a lot of unimplemented/broken features at the moment. Contributions are welcome to help out with the progress!

Table of Contents

About

Devika is an advanced AI software engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika utilizes large language models, planning and reasoning algorithms, and web browsing abilities to intelligently develop software.

Devika aims to revolutionize the way we build software by providing an AI pair programmer who can take on complex coding tasks with minimal human guidance. Whether you need to create a new feature, fix a bug, or develop an entire project from scratch, Devika is here to assist you.

Note

Devika is modeled after Devin by Cognition AI. This project aims to be an open-source alternative to Devin with an "overly ambitious" goal to meet the same score as Devin in the SWE-bench Benchmarks... and eventually beat it?

Demoes

devika-pygame-demo.mp4

Key Features

  • 🧠 Advanced AI planning and reasoning capabilities
  • 🔍 Contextual keyword extraction for focused research
  • 🌐 Seamless web browsing and information gathering
  • 💻 Code writing in multiple programming languages
  • 📊 Dynamic agent state tracking and visualization
  • 💬 Natural language interaction via chat interface
  • 📂 Project-based organization and management
  • 🔌 Extensible architecture for adding new features and integrations

System Architecture

Devika's system architecture consists of the following key components:

  1. User Interface: A web-based chat interface for interacting with Devika, viewing project files, and monitoring the agent's state.
  2. Agent Core: The central component that orchestrates the AI planning, reasoning, and execution process. It communicates with various sub-agents and modules to accomplish tasks.
  3. Large Language Models: Devika leverages state-of-the-art language models like Claude, GPT-4, and Local LLMs via Ollama for natural language understanding, generation, and reasoning.
  4. Planning and Reasoning Engine: Responsible for breaking down high-level objectives into actionable steps and making decisions based on the current context.
  5. Research Module: Utilizes keyword extraction and web browsing capabilities to gather relevant information for the task at hand.
  6. Code Writing Module: Generates code based on the plan, research findings, and user requirements. Supports multiple programming languages.
  7. Browser Interaction Module: Enables Devika to navigate websites, extract information, and interact with web elements as needed.
  8. Knowledge Base: Stores and retrieves project-specific information, code snippets, and learned knowledge for efficient access.
  9. Database: Persists project data, agent states, and configuration settings.

Read ARCHITECTURE.md for the detailed documentation.

Installation

To install Devika, follow these steps:

  1. Clone the Devika repository:
    git clone https://github.com/stitionai/devika.git
    
  2. Navigate to the project directory:
    cd devika
    
  3. Install the required dependencies:
    pip install -r requirements.txt
    
  4. Set up the necessary API keys and configuration (see Configuration section).
  5. Start the Devika server:
    python devika.py
    
  6. Compile and run the UI server:
    cd ui/
    bun run dev
    
  7. Access the Devika web interface by opening a browser and navigating to http://127.0.0.1:3000.

Getting Started

To start using Devika, follow these steps:

  1. Open the Devika web interface in your browser.
  2. Create a new project by clicking on the "New Project" button and providing a name for your project.
  3. Select the desired programming language and model configuration for your project.
  4. In the chat interface, provide a high-level objective or task description for Devika to work on.
  5. Devika will process your request, break it down into steps, and start working on the task.
  6. Monitor Devika's progress, view generated code, and provide additional guidance or feedback as needed.
  7. Once Devika completes the task, review the generated code and project files.
  8. Iterate and refine the project as desired by providing further instructions or modifications.

Configuration

Devika requires certain configuration settings and API keys to function properly. Update the config.toml file with the following information:

  • OPENAI_API_KEY: Your OpenAI API key for accessing GPT models.
  • CLAUDE_API_KEY: Your Anthropic API key for accessing Claude models.
  • BING_API_KEY: Your Bing Search API key for web searching capabilities.
  • DATABASE_URL: The URL for your database connection.
  • LOG_DIRECTORY: The directory where Devika's logs will be stored.
  • PROJECT_DIRECTORY: The directory where Devika's projects will be stored.

Make sure to keep your API keys secure and do not share them publicly.

Under The Hood

Let's dive deeper into some of the key components and techniques used in Devika:

AI Planning and Reasoning

Devika employs advanced AI planning and reasoning algorithms to break down high-level objectives into actionable steps. The planning process involves the following stages:

  1. Objective Understanding: Devika analyzes the given objective or task description to understand the user's intent and requirements.
  2. Context Gathering: Relevant context is collected from the conversation history, project files, and knowledge base to inform the planning process.
  3. Step Generation: Based on the objective and context, Devika generates a sequence of high-level steps to accomplish the task.
  4. Refinement and Validation: The generated steps are refined and validated to ensure their feasibility and alignment with the objective.
  5. Execution: Devika executes each step in the plan, utilizing various sub-agents and modules as needed.

The reasoning engine constantly evaluates the progress and makes adjustments to the plan based on new information or feedback received during execution.

Keyword Extraction

To enable focused research and information gathering, Devika employs keyword extraction techniques. The process involves the following steps:

  1. Preprocessing: The input text (objective, conversation history, or project files) is preprocessed by removing stop words, tokenizing, and normalizing the text.
  2. Keyword Identification: Devika uses the BERT (Bidirectional Encoder Representations from Transformers) model to identify important keywords and phrases from the preprocessed text. BERT's pre-training on a large corpus allows it to capture semantic relationships and understand the significance of words in the given context.
  3. Keyword Ranking: The identified keywords are ranked based on their relevance and importance to the task at hand. Techniques like TF-IDF (Term Frequency-Inverse Document Frequency) and TextRank are used to assign scores to each keyword.
  4. Keyword Selection: The top-ranked keywords are selected as the most relevant and informative for the current context. These keywords are used to guide the research and information gathering process.

By extracting contextually relevant keywords, Devika can focus its research efforts and retrieve pertinent information to assist in the task completion.

Browser Interaction

Devika incorporates browser interaction capabilities to navigate websites, extract information, and interact with web elements. The browser interaction module leverages the Playwright library to automate web interactions. The process involves the following steps:

  1. Navigation: Devika uses Playwright to navigate to specific URLs or perform searches based on the keywords or requirements provided.
  2. Element Interaction: Playwright allows Devika to interact with web elements such as clicking buttons, filling forms, and extracting text from specific elements.
  3. Page Parsing: Devika parses the HTML structure of the web pages visited to extract relevant information. It uses techniques like CSS selectors and XPath to locate and extract specific data points.
  4. JavaScript Execution: Playwright enables Devika to execute JavaScript code within the browser context, allowing for dynamic interactions and data retrieval.
  5. Screenshot Capture: Devika can capture screenshots of the web pages visited, which can be useful for visual reference or debugging purposes.

The browser interaction module empowers Devika to gather information from the web, interact with online resources, and incorporate real-time data into its decision-making and code generation processes.

Code Writing

Devika's code writing module generates code based on the plan, research findings, and user requirements. The process involves the following steps:

  1. Language Selection: Devika identifies the programming language specified by the user or infers it based on the project context.
  2. Code Structure Generation: Based on the plan and language-specific patterns, Devika generates the high-level structure of the code, including classes, functions, and modules.
  3. Code Population: Devika fills in the code structure with specific logic, algorithms, and data manipulation statements. It leverages the research findings, code snippets from the knowledge base, and its own understanding of programming concepts to generate meaningful code.
  4. Code Formatting: The generated code is formatted according to the language-specific conventions and best practices to ensure readability and maintainability.
  5. Code Review and Refinement: Devika reviews the generated code for syntax errors, logical inconsistencies, and potential improvements. It iteratively refines the code based on its own analysis and any feedback provided by the user.

Devika's code writing capabilities enable it to generate functional and efficient code in various programming languages, taking into account the specific requirements and context of each project.

Contributing

We welcome contributions to enhance Devika's capabilities and improve its performance. To contribute, please follow these steps:

  1. Fork the Devika repository on GitHub.
  2. Create a new branch for your feature or bug fix.
  3. Make your changes and ensure that the code passes all tests.
  4. Submit a pull request describing your changes and their benefits.

Please adhere to the coding conventions, maintain clear documentation, and provide thorough testing for your contributions.

License

Devika is released under the MIT License. See the LICENSE file for more information.


We hope you find Devika to be a valuable tool in your software development journey. If you have any questions, feedback, or suggestions, please don't hesitate to reach out. Happy coding with Devika!