saleiro's Stars
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
ranpox/awesome-computer-use
This is a collection of resources for computer-use agents, including videos, blogs, papers, and projects.
Nutlope/llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
anthropics/courses
Anthropic's educational courses
DS4SD/docling
Get your documents ready for gen AI
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
ArcadeAI/arcade-ai
Arcade AI Python SDK and CLI
microsoft/UFO
A UI-Focused Agent for Windows OS Interaction.
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
microsoft/WindowsAgentArena
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
mistralai/mistral-inference
Official inference library for Mistral models
meta-llama/llama-stack
Composable building blocks to build Llama Apps
SylphAI-Inc/AdalFlow
AdalFlow: The library to build & auto-optimize LLM applications.
ServiceNow/BrowserGym
BrowserGym, a gym environment for web task automation in the Chromium browser.
mistralai/mistral-common
merveenoyan/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
getomni-ai/zerox
PDF to Markdown with vision models
dottxt-ai/outlines
Structured Text Generation
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
microsoft/Trace
End-to-end Generative Optimization for AI Agents
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
eugeneyan/open-llms
📋 A list of open LLMs available for commercial use.
OthersideAI/self-operating-computer
A framework to enable multimodal models to operate a computer.
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
andrewyng/translation-agent
microsoft/promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
crewAIInc/crewAI
Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
danielmiessler/fabric
fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.