hcwei13

hcwei13's Stars

oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
Language:Python42.9k 345 3.8k5.5k
OpenLMLab/MOSS
An open-source tool-augmented conversational language model from Fudan University
Language:Python11.7k 123 3521.1k
OpenBMB/XAgent
An Autonomous LLM Agent for Complex Task Solving
Language:Python8.2k 71 307872
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 109 292492
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python5.1k 109 142435
salesforce/CodeGen
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Language:Python5k 80 76392
xlang-ai/OpenAgents
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Language:Python4.2k 47 100469
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Language:Python3.9k 56 55318
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
Language:Python3.7k 68 358424
princeton-vl/RAFT
Language:Python3.5k 37 170643
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文，开源数据&模型，AIGC应用
2.9k 63 2292
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.3k 39 658
OFA-Sys/ONE-PEACE
A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Language:Python1k 14 5770
piergiaj/pytorch-i3d
Language:Python1k 12 81256
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python848 32 8344
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Language:Python794 15 6554
LLaVA-VL/LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
Language:Python729 12 2458
Azure/MS-AMP
Microsoft Automatic Mixed Precision Library
Language:Python579 11 6948
jy0205/LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Language:Jupyter Notebook568 14 4031
DAMO-DI-ML/NeurIPS2023-One-Fits-All
The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"
Language:Python530 6 5577
Luodian/RelateAnything
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
Language:Python448 10 1221
persimmon-ai-labs/adept-inference
Inference code for Persimmon-8B
Language:Python415 17 724
LLaVA-VL/LLaVA-Interactive-Demo
LLaVA-Interactive-Demo
Language:Python367 16 1029
baaivision/CapsFusion
[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale
Language:Python204 21 85
snap-research/MMVID
[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Language:Python192 17 1023
LeapLabTHU/Pseudo-Q
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
Language:Python147 3 2210
dhg-wei/DeCap
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
Language:Jupyter Notebook130 2 106
Jingkang50/FunQA
FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, and beyond.
Language:Python91 3 70
gydpku/PPTC
PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Language:Python54 2 88
PengZai/ARIC
Aesthetically-Relevant-Image-Captioning
Language:Python29 2 161