huiyiygy's Stars
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
lllyasviel/ControlNet
Let us control diffusion models!
gedoor/legado
Legado 3.0 Book Reader with powerful controls & full functions❤️阅读3.0, 阅读是一款可以自定义来源阅读网络内容的工具,为广大网络文学爱好者提供一种方便、快捷舒适的试读体验。
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
facebookresearch/sam2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
listen1/listen1_desktop
one for all free music in china (Windows, Mac, Linux desktop)
lllyasviel/ControlNet-v1-1-nightly
Nightly release of ControlNet 1.1
UX-Decoder/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
lipku/metahuman-stream
Real time interactive streaming digital human
lllyasviel/Paints-UNDO
Understand Human Behavior to Align True Needs
CVCUDA/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
ShoufaChen/DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
gligen/GLIGEN
Open-Set Grounded Text-to-Image Generation
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
tianweiy/CenterPoint
instantX-research/InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
HuangJunJie2017/BEVDet
Official code base of the BEVDet series .
wnlen/clash-for-linux
clash-for-linux
siyuanliii/masa
Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything
Thinklab-SJTU/Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
SY-007-Research/3dgs_render_python
ADLab-AutoDrive/BEVHeight
An official code release of our CVPR'23 paper, BEVHeight
microsoft/ReCo
ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023
omer11a/bounded-attention
carlinds/unisim
DaTongjie/BEVSpread