microboym
Mai Congyi 麦从一 I am a student from Zhuhai No. 1 Middle School. 珠海市第一中学,高中生。
Zhuhai No. 1 Middle SchoolZhuhai, Guangdong, China
microboym's Stars
lyuwenyu/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
lucidrains/gigagan-pytorch
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
practical-tutorials/project-based-learning
Curated list of project-based tutorials
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
haoheliu/2021-ISMIR-MSS-Challenge-CWS-PResUNet
Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.
haoheliu/AudioLDM2
Text-to-Audio/Music Generation
facebookresearch/pyrobot
PyRobot: An Open Source Robotics Research Platform
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
PaddlePaddle/Paddle-Lite
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
Parskatt/DeDoDe
[3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
facebookresearch/ImageBind
ImageBind One Embedding Space to Bind Them All
OpenGVLab/DragGAN
Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ZrrSkywalker/Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
Eugeny/tabby
A terminal for a more modern age
microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
Weifeng-Chen/control-a-video
Official Implementation of "Control-A-Video: Controllable Text-to-Video Generation with Diffusion Models"
reflex-dev/reflex
🕸️ Web apps in pure Python 🐍
dailenson/SDT
This repository is the official implementation of Disentangling Writer and Character Styles for Handwriting Generation (CVPR 2023)
megvii-research/CREStereo
Official MegEngine implementation of CREStereo(CVPR 2022 Oral).
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
XingangPan/DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
abiosoft/colima
Container runtimes on macOS (and Linux) with minimal setup
zju3dv/deltar
Code for "DELTAR: Depth Estimation from a Light-weight ToF Sensor And RGB Image", ECCV 2022