BeneyKim's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
lllyasviel/ControlNet
Let us control diffusion models!
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
open-mmlab/mmdetection
OpenMMLab Detection Toolbox and Benchmark
facebookresearch/pytorch3d
PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
open-mmlab/mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
open-mmlab/mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
facebookresearch/pytorchvideo
A deep learning library for video understanding research.
open-mmlab/mmdeploy
OpenMMLab Model Deployment Framework
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
kakaobrain/nerf-factory
An awesome PyTorch NeRF library
acoustid/chromaprint
C library for generating audio fingerprints used by AcoustID
AddictedCS/soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
hkchengrex/STCN
[NeurIPS 2021] Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
SysCV/MaskFreeVIS
Mask-Free Video Instance Segmentation [CVPR 2023]
JorenSix/Olaf
Olaf: Overly Lightweight Acoustic Fingerprinting is a portable acoustic fingerprinting system.
object-detection-algorithm/R-CNN
目标检测 - R-CNN算法实现
mimbres/neural-audio-fp
Official implementation of Neural Audio Fingerprint (ICASSP 2021)
bytedance/ParaGen
ParaGen is a PyTorch deep learning framework for parallel sequence generation.
omerferhatt/torch2tflite
PyTorch to TensorFlow Lite converter
unmade/audiomatch
Find similar audio files easily
TuBui/RoSteALS
RoSteALS: Robust Steganography using Autoencoder Latent Space
Lemok00/IDEAS
Official pytorch implementation of paper "Image Disentanglement Autoencoder for Steganography without Embedding" (CVPR2022).
MC-E/LF-VSN
Accepted by CVPR 2023
deep-learning-algorithm/Evaluation-Metrics
各种算法评价指标的实现(mAP/Flops/params/fps/error-rate/accuracy)
JianhuaYang001/spatial-image-steganography
The source code of paper "An Embedding Cost Learning Framework Using GAN"
deep-learning-algorithm/PyNet
Numpy implementation of deep learning
object-detection-algorithm/SSD
目标检测 - SSD算法实现