pingguomaggie

pingguomaggie's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python33.2k4.1k
openai/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Language:Python15k1.4k
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
Language:Python5.1k313
comet-ml/opik
Open-source end-to-end LLM Development Platform
Language:Java1.8k110
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.3k826
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12.4k791
SmartFlowAI/TheGodOfCookery
Language:Python7421
opea-project/GenAIExamples
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterprise AI (OPEA) project.
Language:Shell255176
luogen1996/LaVIN
[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"
Language:Python50637
Institute4FutureHealth/CHA
Conversational Health Agents: A Personalized LLM-powered Agent Framework
Language:Python6618
Atomic-man007/Awesome_Multimodel_LLM
Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.
26317
InternLM/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Language:Python2.5k154
SCUTlihaoyu/open-chat-video-editor
Open source short video automatic generation tool
Language:Python2.7k351
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
3.4k199
InterDigitalInc/CompressAI
A PyTorch library and evaluation platform for end-to-end compression research
Language:Python1.2k232
HoangTrinh/ROI_Online_Meeting_Codec
The official source code for RCLC: ROI-based joint conventional and learning video compression
Language:Python72
showlab/Image2Paragraph
[A toolbox for fun.] Transform Image into Unique Paragraph with ChatGPT, BLIP2, OFA, GRIT, Segment Anything, ControlNet.
Language:Python78853
ExponentialML/Video-BLIP2-Preprocessor
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
Language:Python13117
gorjanradevski/text2atlas
Codebase for "Learning to ground medical text in a 3D human atlas (CoNLL 2020)".
Language:Python61
cambridgeltl/visual-med-alpaca
Visual Med-Alpaca is an open-source, multi-modal foundation model designed specifically for the biomedical domain, built on the LLaMa-7B.
Language:Python36741
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Language:Jupyter Notebook15k1.4k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.8k966
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k251
deepaknlp/MedVidQACL
Implementation of the Benchmark Approaches for Medical Instructional Video Classification (MedVidCL) and Medical Video Question Answering (MedVidQA)
Language:Python286
baaivision/Painter
Painter & SegGPT Series: Vision Foundation Models from BAAI
Language:Python2.5k171
mrebol/Gestures-From-Speech
Language:C#144
openhuman-ai/awesome-gesture_generation
Awesome Gesture Generation
1574
ShenhanQian/SpeechDrivesTemplates
[ICCV 2021] The official repo for the paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".
Language:Python885
alvinliu0/HA2G
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
Language:Python1299
Advocate99/DiffGesture
[CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Language:Python23216