Pinned Repositories
Best-websites-a-programmer-should-visit-zh
程序员应该访问的最佳网站中文版
Detectron2-Densepose-IUV2XYZ
Hybrid-model-for-human-action-adverb-recognition
Baseline Code for ADHA dataset
labelme
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
MCM-Note
medooze-player
Demo of a bug in `Player` component of Medooze WebRTC Media Server for Node.js
MFCC
mfcc, mel, pcen. (librosa)
nanoDet-2
ncnn-android-projects
Android Demon of mobilev2-yolo5s and retinaface
mathpopo's Repositories
mathpopo/Llama2-Chinese
Llama中文社区,最好的中文Llama大模型,完全开源可商用
mathpopo/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
mathpopo/Awesome-Linux-Software-zh_CN
🐧 一个 Linux 上超赞的应用,软件,工具以及其它资源的集中地。
mathpopo/codellama
Inference code for CodeLlama models
mathpopo/cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
mathpopo/CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
mathpopo/CVCUDA_FaceStoreHelper-release
Psyche AI Inc release source "CVCUDA_FaceStoreHelper"
mathpopo/DTLN
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
mathpopo/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
mathpopo/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
mathpopo/EMO
mathpopo/Experiments-with-Gemma-2B
I’ll be testing different Gemma models and sharing the results here and on my Hugging Face space. Stay tuned for updates!
mathpopo/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
mathpopo/gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
mathpopo/infinigen
Infinite Photorealistic Worlds using Procedural Generation
mathpopo/llama
Inference code for LLaMA models
mathpopo/magic-avatar
MagicAvatar: Multimodal Avatar Generation and Animation
mathpopo/MetaTransformer
Meta-Transformer for Unified Multimodal Learning
mathpopo/nvm
Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions
mathpopo/pandas-llm
Pandas-LLM
mathpopo/project-based-learning
Curated list of project-based tutorials
mathpopo/python-docs-samples
Code samples used on cloud.google.com
mathpopo/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
mathpopo/Real-Gemini
Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本、语音、图像和视频和这是世界进行问答和交流。
mathpopo/recognize-anything
Code for the Recognize Anything Model (RAM) and Tag2Text Model
mathpopo/Retrieval-based-Voice-Conversion-WebUI
Voice data <= 10 mins can also be used to train a good VC model!
mathpopo/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
mathpopo/torchexplorer
Interactively inspect module inputs, outputs, parameters, and gradients.
mathpopo/Waifu2x-Extension-GUI
Video, Image and GIF upscale/enlarge(Super-Resolution) and Video frame interpolation. Achieved with Waifu2x, Real-ESRGAN, Real-CUGAN, RTX Video Super Resolution VSR, SRMD, RealSR, Anime4K, RIFE, IFRNet, CAIN, DAIN, and ACNet.
mathpopo/yolo-world-with-efficientvit-sam
YOLO-World + EfficientViT SAM