maybelu9's Stars
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
facebookresearch/llama
Inference code for LLaMA models
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
LAION-AI/Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
lllyasviel/ControlNet
Let us control diffusion models!
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
yoheinakajima/babyagi
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
mlfoundations/open_clip
An open source implementation of CLIP.
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
nebuly-ai/nebuly
The user analytics platform for LLMs
salesforce/BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
amazon-science/mm-cot
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
mukulpatnaik/researchgpt
A LLM based research assistant that allows you to have a conversation with a research paper
google/prompt-to-prompt
fudan-zvg/Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
lucidrains/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
KaiyangZhou/CoOp
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
juncongmoo/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
microsoft/MM-REACT
Official repo for MM-REACT
JonasSchult/Mask3D
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
LAION-AI/aesthetic-predictor
A linear estimator on top of clip to predict the aesthetic quality of pictures
davidnvq/grit
GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)
HenryJunW/TAG