hailin-shi

NIOBeijing

hailin-shi's Stars

microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.6k2.5k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.5k2.2k
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python2.7k170
UX-Decoder/Semantic-SAM
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Language:Python2.3k107
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.7k952
apple/ml-ferret
Language:Python8.3k485
Surrey-UP-Lab/RegionSpot
Recognize Any Regions
Language:Python1174
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook46.9k5.6k
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.9k372
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python167k44.2k
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k506
Luodian/Otter
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Language:Python3.6k242
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Language:Python1.2k102
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Language:Python2.2k171
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.3k2.9k
meta-llama/llama
Inference code for Llama models
Language:Python55.8k9.5k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.2k1.9k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k4k
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.2k822
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
Language:JavaScript27.7k2.8k
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook67.7k10.1k
zalandoresearch/fashion-mnist
A MNIST-like fashion product database. Benchmark :point_down:
Language:Python11.9k3k
cvlab-epfl/EPnP
EPnP: Efficient Perspective-n-Point Camera Pose Estimation
Language:MATLAB26359
lucasjinreal/yolov7_d2
🔥🔥🔥🔥 (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! 🔥🔥🔥
Language:Python3.1k481
Oneflow-Inc/libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
Language:Python39055
YehLi/xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
Language:Python1k111
SmallStoneSK/github-star-trend
一个可以查看项目Star增长趋势的Chrome插件
Language:JavaScript4812
JDAI-CV/CoTNet
This is an official implementation for "Contextual Transformer Networks for Visual Recognition".
Language:Python51478
JDAI-CV/fast-reid
SOTA Re-identification Methods and Toolbox
Language:Python3.4k834
JDAI-CV/centerX
This repo is implemented based on detectron2 and centernet
Language:Python55486

hailin-shi

hailin-shi's Stars

microsoft/unilm

haotian-liu/LLaVA

Alpha-VLLM/LLaMA2-Accessory

UX-Decoder/Semantic-SAM

salesforce/LAVIS

apple/ml-ferret

Surrey-UP-Lab/RegionSpot

facebookresearch/segment-anything

QwenLM/Qwen-VL

Significant-Gravitas/AutoGPT

baichuan-inc/Baichuan-7B

Luodian/Otter

mbzuai-oryx/Video-ChatGPT

X-PLUG/mPLUG-Owl

Vision-CAIR/MiniGPT-4

meta-llama/llama

ymcui/Chinese-LLaMA-Alpaca

tatsu-lab/stanford_alpaca

OptimalScale/LMFlow

lutzroeder/netron

CompVis/stable-diffusion

zalandoresearch/fashion-mnist

cvlab-epfl/EPnP

lucasjinreal/yolov7_d2

Oneflow-Inc/libai

YehLi/xmodaler

SmallStoneSK/github-star-trend

JDAI-CV/CoTNet

JDAI-CV/fast-reid

JDAI-CV/centerX