maybelu9

maybelu9's Stars

AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
Language:Python144k 1.1k 7.7k27k
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook95.4k 692 7.9k15.5k
facebookresearch/llama
Inference code for LLaMA models
Language:Python50.9k 499 8728.7k
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook47.8k 308 6745.7k
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Language:Python37.1k 431 1.6k3.2k
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Language:Python34.2k 178 5k2.6k
lllyasviel/ControlNet
Let us control diffusion models!
Language:Python30.5k 218 5562.7k
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.7k 381 1812k
yoheinakajima/babyagi
Language:Python20.5k 302 1512.7k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.7k 154 4692.2k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.4k 120 1.1k1.4k
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.4k 79 496991
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook10k 99 667974
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.5k 141 4521.5k
nebuly-ai/nebuly
The user analytics platform for LLMs
Language:Python8.4k 93 202644
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Language:Jupyter Notebook4.8k 34 198646
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
Language:Python3.8k 56 54315
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
Language:Python3.8k 48 176285
mukulpatnaik/researchgpt
A LLM based research assistant that allows you to have a conversation with a research paper
Language:Python3.6k 41 61338
google/prompt-to-prompt
Language:Jupyter Notebook3.2k 25 84296
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Language:Python2.2k 19 58138
lucidrains/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Language:Python2k 38 16123
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
Language:Python1.8k 16 81202
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.4k 28 19388
juncongmoo/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
Language:Python1.2k 20 8138
microsoft/MM-REACT
Official repo for MM-REACT
Language:Python937 19 1069
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
Language:Python554 8 168111
LAION-AI/aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
Language:Jupyter Notebook490 13 720
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
Language:Python185 3 5529
HenryJunW/TAG
Language:Python21 1 70