Pinned Repositories
Automatic-Focusing
AF Experiment on ROI Parameters, Peak Search Algorithm, Contrast Measure Operators and Efficiency in various Scenario.
caffe-int8-convert-tools
Generate a quantization parameter file for ncnn framework int8 inference
crnn_plate_wave
GameAISDK
基于图像的游戏AI自动化框架
KCF_detail
🚀Way to learn KCF's principle and implement it in TI's C6678 DSP. 😃
kill-the-bits
Code for: "And the bit goes down: Revisiting the quantization of neural networks"
rknn-v5
SiamTrackers
(2020)The PyTorch version of Siamese ,SiamFC,SiamRPN,DaSiamRPN,UpdateNet,SiamDW,SiamRPN++, SiamMask,and SiamFC++ ; Visual object tracking based on deep learning
yolov10-rk
YOLOv10: Real-Time End-to-End Object Detection
yolov5-car-ShuffleV2
wavelet2008's Repositories
wavelet2008/yolov10-rk
YOLOv10: Real-Time End-to-End Object Detection
wavelet2008/011_Hisi3516AV200_source_code
wavelet2008/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
wavelet2008/DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
wavelet2008/finetune-Qwen2-VL
finetune-Qwen2-VL 2b/7b and LoRA
wavelet2008/florence2-finetuning
Quick exploration into fine tuning florence 2
wavelet2008/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
wavelet2008/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
wavelet2008/GOT-OCR-Inference
研究GOT-OCR-项目落地加速,不限语言
wavelet2008/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
wavelet2008/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
wavelet2008/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
wavelet2008/LLaVA-JP
LLaVA-JP is a Japanese VLM trained by LLaVA method
wavelet2008/MinerU
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
wavelet2008/MobileVLM
Strong and Open Vision Language Assistant for Mobile Devices
wavelet2008/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
wavelet2008/new-pac
翻墙-科学上网、自由上网、免费科学上网、免费翻墙、油管youtube、fanqiang、软件、VPN、一键翻墙浏览器,vps一键搭建翻墙服务器脚本/教程,免费shadowsocks/ss/ssr/v2ray/goflyway账号/节点,翻墙梯子,电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网、youtube视频下载、美区apple id共享账号
wavelet2008/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
wavelet2008/OWLVIT-RKNN
wavelet2008/PDF-Extract-Kit
A Comprehensive Toolkit for High-Quality PDF Content Extraction
wavelet2008/prompt-lookup-decoding
wavelet2008/Qwen-Audio
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
wavelet2008/sdxl-lightning-demo-app
A demo application using fal.realtime and the lightning fast SDXL API provided by fal
wavelet2008/Segment-Everything-Everywhere-All-At-Once
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
wavelet2008/T-Rex
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
wavelet2008/VideoLLaMA2
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
wavelet2008/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
wavelet2008/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
wavelet2008/YOLO-World
Real-Time Open-Vocabulary Object Detection
wavelet2008/yolov5-dual
A model that achieve dual detection(Infrared+RGB) with rotation