aidiary's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
microsoft/generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
OpenInterpreter/open-interpreter
A natural language interface for computers
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
OpenDevin/OpenDevin
🐚 OpenDevin: Code Less, Make More
mckaywrigley/chatbot-ui
AI chat for every model.
danny-avila/LibreChat
Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. Actively in public development.
assafelovic/gpt-researcher
LLM based autonomous agent that does online comprehensive research on any given topic
BerriAI/litellm
Python SDK, Proxy Server to call 100+ LLM APIs using the OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
asweigart/pyautogui
A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.
Plachtaa/VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Chainlit/chainlit
Build Conversational AI in minutes ⚡️
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
langchain-ai/opengpts
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
agiresearch/AIOS
AIOS: LLM Agent Operating System
NExT-GPT/NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Stability-AI/stable-audio-tools
Generative models for conditional audio generation
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
OpenCodeInterpreter/OpenCodeInterpreter
OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.
langchain-ai/streamlit-agent
Reference implementations of several LangChain agents as Streamlit apps
cloneofsimo/minDiffusion
Self-contained, minimalistic implementation of diffusion models with Pytorch.
litagin02/Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
onlyphantom/llm-python
Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone
seratch/ChatGPT-in-Slack
Swift demonstration of how to build a Slack app that enables end-users to interact with a ChatGPT bot
jiatastic/GPTInterviewer
GPT Interviewer - Practice interview with AI interviewer based on job descriptions and resume
theevann/streamlit-audiorecorder
Audio recorder for streamlit
tonnetonne814/PITS-44100-Ja
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。