chengzhengxin's Stars
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
facebook/folly
An open-source C++ library developed and used at Facebook.
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
sczhou/CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
mlfoundations/open_clip
An open source implementation of CLIP.
pengxiao-song/LaWGPT
🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型
OFA-Sys/Chinese-CLIP
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
ceres-solver/ceres-solver
A large scale non-linear optimization library
mlfoundations/open_flamingo
An open-source framework for training large multimodal models.
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
CVI-SZU/Linly
Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集
open-mmlab/Multimodal-GPT
Multimodal-GPT
KidsWithTokens/Medical-SAM-Adapter
Adapting Segment Anything Model for Medical Image Segmentation
NVlabs/FB-BEV
Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
MCG-NJU/MixFormer
[CVPR 2022 Oral & TPAMI 2024] MixFormer: End-to-End Tracking with Iterative Mixed Attention
wh200720041/ssl_slam2
SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021
JamesQFreeman/LoRA-ViT
Low rank adaptation for Vision Transformer
wangwt/shadowsocks
shadowsocks的最新地址
MCG-NJU/SparseBEV
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
urbste/OpenImuCameraCalibrator
Camera calibration tool
ryanphilly/IOS-PointCloud
Create, Save, and Export Point Clouds w/ Lidar equipped Iphones
hero-y/BHRL
[CVPR 2022] Balanced and Hierarchical Relation Learning for One-shot Object Detection
eupenik/PoindCloudRenderer
Simplest app to render point cloud in iOS using SceneKit
zhxt/cyber-rt
Redistributed Apollo CyberRT, built with CMake.