YongLD

ROBOT, LMs

SCUTOuter Ring West Road, Panyu District, Guangzhou, Guangdong Province, China

YongLD's Stars

OpenInterpreter/open-interpreter
A natural language interface for computers
Language:Python57.1k 420 9784.9k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.4k 189 5122.2k
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:Python17.9k 144 3.6k1.9k
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
11.7k 213 32874
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 99 667975
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.8k 313 127598
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python4.2k 26 553449
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Language:Python4k 57 158623
OpenBMB/BMTools
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Language:Python2.9k 35 37273
PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！
Language:Jupyter Notebook2.6k 36 100248
THUDM/AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Language:Python2.2k 28 142163
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k 22 8886
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
Language:Python1.4k 11 227195
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language:Python1.3k 21 29125
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.3k 47 5386
BAAI-DCAI/Bunny
A family of lightweight multimodal models.
Language:Python940 18 12569
mli/transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
Language:Jupyter Notebook876 11 5109
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language:Python859 8 2644
LTH14/rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
Language:Python845 7 3940
dvlab-research/LLaMA-VID
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Language:Python744 14 10944
Event-AHU/Mamba_State_Space_Model_Paper_List
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
624 13 634
EmulationAI/awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
610 23 333
Genesis-Embodied-AI/RoboGen
A generative and self-guided robotic agent that endlessly propose and master new skills.
Language:Python605 12 2650
LeapLabTHU/Agent-Attention
Official repository of Agent Attention (ECCV2024)
Language:Python545 4 4537
showlab/DragAnything
[ECCV 2024] DragAnything: Motion Control for Anything using Entity Representation
Language:Python437 16 2415
robopen/roboagent
Repository to train and evaluate RoboAgent
Language:Python306 28 2225
Marker-Inc-Korea/RAGchain
Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...
Language:Python279 2 26427
yudianzheng/SketchVideo
[EG 2023] Sketch Video Synthesis
Language:Jupyter Notebook205 7 820
codefuse-ai/CodeFuse-MFT-VLM
Language:Python34 1 58
thecharm/BDoG
Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"
Language:Python9 1 01

YongLD

YongLD's Stars

OpenInterpreter/open-interpreter

hpcaitech/Open-Sora

deepset-ai/haystack

e2b-dev/awesome-ai-agents

salesforce/LAVIS

fudan-generative-vision/champ

open-compass/opencompass

yisol/IDM-VTON

OpenBMB/BMTools

PhoebusSi/Alpaca-CoT

THUDM/AgentBench

baaivision/Emu

open-compass/VLMEvalKit

PKU-YuanGroup/MagicTime

0nutation/SpeechGPT

BAAI-DCAI/Bunny

mli/transformers-benchmarks

datadreamer-dev/DataDreamer

LTH14/rcg

dvlab-research/LLaMA-VID

Event-AHU/Mamba_State_Space_Model_Paper_List

EmulationAI/awesome-large-audio-models

Genesis-Embodied-AI/RoboGen

LeapLabTHU/Agent-Attention

showlab/DragAnything

robopen/roboagent

Marker-Inc-Korea/RAGchain

yudianzheng/SketchVideo

codefuse-ai/CodeFuse-MFT-VLM

thecharm/BDoG