tworuler

tworuler's Stars

xtekky/gpt4free
The official gpt4free repository | various collection of powerful language models
Language:Python62.9k 486 1.5k13.5k
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
Language:Python35.8k 989 1903.5k
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Language:Python32.7k 316 9434.8k
mli/paper-reading
深度学习经典、新论文逐段精读
27.6k 734 02.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python20.9k 158 1.6k2.3k
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python14.3k 149 3371.3k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.4k 123 3931.4k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
13.3k 258 128840
openai/shap-e
Generate 3D objects conditioned on text or images
Language:Python11.7k 234 120956
ggerganov/ggml
Tensor library for machine learning
Language:C++11.4k 132 4291.1k
wolfpld/tracy
Frame profiler
Language:C++10.5k 91 569706
modelscope/facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Language:Jupyter Notebook9.2k 89 348860
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python9k 63 1.6k583
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.8k 107 290486
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python6.2k 68 432420
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
Language:Python4.1k 40 395297
OpenGVLab/InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Language:Python3.2k 44 50232
InternLM/InternLM-XComposer
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Language:Python2.7k 43 400161
VainF/Awesome-Anything
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
1.7k 54 196
baaivision/Emu
Emu Series: Generative Multimodal Models from BAAI
Language:Python1.7k 22 8885
IDEA-Research/awesome-detection-transformer
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
1.3k 58 10113
Ma-Lab-Berkeley/CRATE
Code for CRATE (Coding RAte reduction TransformEr).
Language:Python1.2k 21 2296
cvdfoundation/open-images-dataset
Open Images is a dataset of ~9 million images that have been annotated with image-level labels and bounding boxes spanning thousands of classes.
1k 37 35158
mbzuai-oryx/groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Language:Python802 31 8038
ziqihuangg/Collaborative-Diffusion
[CVPR 2023] Collaborative Diffusion
Language:Python414 9 4232
mayuelala/FollowYourEmoji
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Language:Python347 18 2427
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
Language:Python324 35 138
FacePerceiver/LAION-Face
The human face subset of LAION-400M for large-scale face pretraining.
Language:Python282 4 1017
diffusion-facex/FaceX
73 16 10
kyegomez/Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Language:Python68 2 16