/Kevin

⚡ Kevin: Code Quick, Create Fast

Primary LanguagePythonMIT LicenseMIT

The easiest way to run Kevin is to Open in GitHub Codespaces 🔥🔥🔥

The vision is to leverage SLMs effectively and work towards solving most of the issues on the SWE-Bench Lite evaluation.

Sample Examples using groq/llama3-8b-8192

  1. Create an application in flask that can convert AWS cloudformation stack format from json to yaml and use curl to test ; don't do anything extra please. Event history link

SWE-Bench Verified Results:

ID: astropy__astropy-12907

gemini-1.5-pro-latest event history link

gemini-1.5-flash-latest event history link

Kevin Changelogs:

  1. Added Auto Mode 🔥🔥🔥
  2. Restarted Jupyter kernel if package installed via bash too 👍
  3. Cleaned Browser Observattions 🧹
  4. Showed relevant error in UI 🚨
  5. Added Event History Condenser 📜
  6. Feat: Persist sandbox for Event Runtime 🥳🥳
  7. Parsed pip output and restarted kernel automatically (for bash too) 📦
  8. Added editable address bar in browser tab 🌐
  9. Include workspace contents if any at first step only. 📂
  10. Add start and kill modes in Makefile 📳
  11. Process interactive commands and stream output in logs 📜
  12. Use execute tags for browsing agent too 🏷️
  13. Feat: Regenerate message 🔄
  14. Feat: Editable Notebook 📝
  15. Feat: Add docker to sandbox 🐳
  16. UI: Enable right click to paste in terminal 🖱️
  17. Added override UI settings configuration 🛠️
  18. UI: Show Step Count 📊
  19. Import the event history upto specific steps 📜
  20. Add litellm caching 📦

Bug Fixes:

  1. Fixed GroqException - content must be a string for role system & assisstant 🛠️
  2. Fixed GroqException - condense' is unsupported 🛠️
  3. Clear history when starting a new task 🧹
  4. Add miniforge path to synchronize bash and notebook 🛣️
  5. Fixed frontend terminal prompt 🛠️
  6. Set TERM variable in bash 🛠️

Minor Changes:

  1. Notify after task is finished 📢

Separate Feature Branches:

  1. Added Tutor Agent 🧑‍🏫

Logo

OpenHands: Code Less, Make More

Contributors Stargazers CodeCov MIT License
Join our Slack community Join our Discord community Credits
Check out the documentation Paper on Arxiv Evaluation Benchmark Score

Welcome to OpenHands (formerly OpenDevin), a platform for software development agents powered by AI.

OpenHands agents can do anything a human developer can: modify code, run commands, browse the web, call APIs, and yes—even copy code snippets from StackOverflow.

Learn more at docs.all-hands.dev, or jump to the Quick Start.

App screenshot

⚡ Quick Start

The easiest way to run OpenHands is in Docker. You can change WORKSPACE_BASE below to point OpenHands to existing code that you'd like to modify.

See the Getting Started guide for system requirements and more information.

export WORKSPACE_BASE=$(pwd)/workspace

docker run -it --pull=always \
    -e SANDBOX_RUNTIME_CONTAINER_IMAGE=ghcr.io/all-hands-ai/runtime:0.9-nikolaik \
    -e SANDBOX_USER_ID=$(id -u) \
    -e WORKSPACE_MOUNT_PATH=$WORKSPACE_BASE \
    -v $WORKSPACE_BASE:/opt/workspace_base \
    -v /var/run/docker.sock:/var/run/docker.sock \
    -p 3000:3000 \
    --add-host host.docker.internal:host-gateway \
    --name openhands-app-$(date +%Y%m%d%H%M%S) \
    ghcr.io/all-hands-ai/openhands:0.9

You'll find OpenHands running at http://localhost:3000!

You can also run OpenHands in a scriptable headless mode, or as an interactive CLI.

Visit Getting Started for more information and setup instructions.

If you want to modify the Kevin source code, check out Development.md.

Having issues? The Troubleshooting Guide can help.

📖 Documentation

To learn more about the project, and for tips on using OpenHands, check out our documentation.

There you'll find resources on how to use different LLM providers, troubleshooting resources, and advanced configuration options.

🤝 How to Contribute

OpenHands is a community-driven project, and we welcome contributions from everyone. Whether you're a developer, a researcher, or simply enthusiastic about advancing the field of software engineering with AI, there are many ways to get involved:

  • Code Contributions: Help us develop new agents, core functionality, the frontend and other interfaces, or sandboxing solutions.
  • Research and Evaluation: Contribute to our understanding of LLMs in software engineering, participate in evaluating the models, or suggest improvements.
  • Feedback and Testing: Use the OpenHands toolset, report bugs, suggest features, or provide feedback on usability.

For details, please check CONTRIBUTING.md.

🤖 Join Our Community

Whether you're a developer, a researcher, or simply enthusiastic about OpenHands, we'd love to have you in our community. Let's make software engineering better together!

  • Slack workspace - Here we talk about research, architecture, and future development.
  • Discord server - This is a community-run server for general discussion, questions, and feedback.

📈 Progress

Star History Chart

📜 License

Distributed under the MIT License. See LICENSE for more information.

🙏 Acknowledgements

OpenHands is built by a large number of contributors, and every contribution is greatly appreciated! We also build upon other open source projects, and we are deeply thankful for their work.

For a list of open source projects and licenses used in OpenHands, please see our CREDITS.md file.

📚 Cite

@misc{opendevin,
      title={{OpenDevin: An Open Platform for AI Software Developers as Generalist Agents}},
      author={Xingyao Wang and Boxuan Li and Yufan Song and Frank F. Xu and Xiangru Tang and Mingchen Zhuge and Jiayi Pan and Yueqi Song and Bowen Li and Jaskirat Singh and Hoang H. Tran and Fuqiang Li and Ren Ma and Mingzhang Zheng and Bill Qian and Yanjun Shao and Niklas Muennighoff and Yizhe Zhang and Binyuan Hui and Junyang Lin and Robert Brennan and Hao Peng and Heng Ji and Graham Neubig},
      year={2024},
      eprint={2407.16741},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2407.16741},
}