larsoncs's Stars
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
AppFlowy-IO/AppFlowy
Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Huanshere/VideoLingo
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
HumanAIGC/EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
meltylabs/melty
Chat first code editor. To download the packaged app:
arcee-ai/mergekit
Tools for merging pretrained large language models.
AILab-CVC/YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Codium-ai/cover-agent
QodoAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
linyqh/NarratoAI
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
dvmazur/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
dvlab-research/ControlNeXt
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
kwuking/TimeMixer
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
bklieger-groq/stockbot-on-groq
StockBot powered by Groq: Lightning Fast AI Chatbot that Responds With Live Interactive Stock Charts, Financials, News, Screeners, and More. Powered by Llama3-70b on Groq, Vercel AI SDK, and TradingView Widgets.
alipay/agentUniverse
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
RedAIGC/StoryMaker
StoryMaker: Towards consistent characters in text-to-image generation
SYSU-STAR/RACER
Rapid Exploration with Multiple Unmanned Aerial Vehicles (UAV)
uzh-rpg/high_mpc
Policy Search for Model Predictive Control with Application to Agile Drone Flight
SYSU-STAR/H2-Mapping
H2-Mapping: Real-time Dense Mapping Using Hierarchical Hybrid Representation (2023 RAL Best Paper Award)
huangd1999/AgentCoder
This Repo is the official implementation of AgentCoder and AgentCoder+.
LiveCodeBench/LiveCodeBench
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
HKUST-Aerial-Robotics/FC-Planner
[ICRA'24 Best UAV Paper Award Finalist] An Efficient Global Planner for Aerial Coverage
SYSU-STAR/STAR-Searcher
Open-source code for the RA-L paper "Star-Searcher: An Efficient Aerial System for Target Search in Unknown Environments".
YangAn17/cooperativeTargetSearch_MPSO
UAV swarm, Cooperative search