dogvane's Stars
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
MasterYip/ChatPaper2Xmind
论文XMind笔记生成工具,将论文pdf通过ChatGPT转换为带有图片和公式的简要XMind笔记,提高论文阅读效率。
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
modelscope/KAN-TTS
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech
ZHO-ZHO-ZHO/ComfyUI-ZHO-Chinese
简体中文版 ComfyUI
NVlabs/ffhq-dataset
Flickr-Faces-HQ Dataset (FFHQ)
lansinuote/Chinese_Speech_to_Text
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
LizhenWangT/StyleAvatar
Code of SIGGRAPH 2023 Conference paper: StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video
LizhenWangT/FaceVerse
FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
eiaserinnys/pdf2md
This project, pdf2md, transforms academic paper PDF files into digestible text files. By analyzing the layout of the PDF file, the application restructures paragraphs and translates desired content. The final result is a conveniently exported text file.
SysCV/sam-hq
Segment Anything in High Quality [NeurIPS 2023]
SysCV/sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
s0md3v/sd-webui-roop
roop extension for StableDiffusion web-ui
shuxueslpi/chatGLM-6B-QLoRA
使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
tylearymf/UniHacker
Patch all versions of Unity3D and UnityHub for Windows, MacOS, Linux and Docker.
Hedlen/awesome-segment-anything
Tracking and collecting papers/projects/others related to Segment Anything.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
nilaoda/BBDown
Bilibili Downloader. 一款命令行式哔哩哔哩下载器.
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Deci-AI/super-gradients
Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
dme-compunet/YoloV8
Use YOLOv8 in real-time, for object detection, instance segmentation, pose estimation and image classification, via ONNX Runtime.
CASIA-IVA-Lab/FastSAM
Fast Segment Anything