smallflyingpig
B.S of Huazhong University of Science and Technology (HUST). Ph.D of University of Chinese Academy of Science (UCAS).
Pinned Repositories
Generative_Adversarial_Networks_PyTorch
(PyTorch) Implementations of GAN, Improved GAN, DCGAN, LAPGAN, and InfoGAN in PyTorch
learning-to-fool-the-speaker-recognition
code for paper "learning to fool the speaker recognition"
projects
some projects for course
pytorch_examples
some examples by pytorch
pytorch_video_caption
some models for video caption implemented by pytorch. (S2VT)
SoundNet_Pytorch
converting the pretrained tensorflow SoundNet model to pytorch
speech-to-image-translation-without-text
Code for paper "direct speech-to-image translation"
Surround360
Surround360 is Facebook's open source hardware and software for capturing stereoscopic 3D 360 video for VR. The repo contains hardware designs, as well as software for camera control and rendering.
universal_adversarial_perturbation_generative_network_for_speaker_recognition
code for paper "Universal Adversarial Perturbations Generative Network for Speaker Recognition"
xavs2
xavs2 is an open-source encoder of Chinese AVS2 video coding standard.
smallflyingpig's Repositories
smallflyingpig/awesome-generative-compression
A collection of resources and papers on generative compression (including neural image/video/audio compression, generation-based image/video/audio compression)
smallflyingpig/smallflyingpig.github.io
smallflyingpig/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
smallflyingpig/BMPrinciples
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future
smallflyingpig/CompressAI
A PyTorch library and evaluation platform for end-to-end compression research
smallflyingpig/ControlNet
Let us control diffusion models
smallflyingpig/cross_modal_compression_mainpage
project mainpage for ACM MM 2021 paper: Cross Modal Compresion: Towards Human-comprehensive Semantic Compression
smallflyingpig/data_processing_MNBVC
data processing for MNBVC
smallflyingpig/deita
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
smallflyingpig/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
smallflyingpig/DynaSLAM
DynaSLAM is a SLAM system robust in dynamic environments for monocular, stereo and RGB-D setups
smallflyingpig/evo
Python package for the evaluation of odometry and SLAM
smallflyingpig/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
smallflyingpig/gpt4free
decentralising the Ai Industry, just some language model api's...
smallflyingpig/HuggingFaceModelDownloader
Simple go utility to download HuggingFace Models and Datasets
smallflyingpig/langchain
⚡ Building applications with LLMs through composability ⚡
smallflyingpig/langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
smallflyingpig/LibreChat
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
smallflyingpig/LibtorchSegmentation
A c++ trainable semantic segmentation library based on libtorch (pytorch c++). Backbone: VGG, ResNet, ResNext. Architecture: FPN, U-Net, PAN, LinkNet, PSPNet, DeepLab-V3, DeepLab-V3+ by now.
smallflyingpig/Loc-NeRF
Monte Carlo Localization using Neural Radiance Fields
smallflyingpig/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
smallflyingpig/ORB_SLAM2
Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities
smallflyingpig/ORB_SLAM3
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
smallflyingpig/pytorch-cpp
C++ Implementation of PyTorch Tutorials for Everyone
smallflyingpig/resume
个人中文简历 Latex 源码 https://hijiangtao.github.io/
smallflyingpig/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
smallflyingpig/STF
Pytorch implementation of the paper "The Devil Is in the Details: Window-based Attention for Image Compression".
smallflyingpig/TTUR
Two time-scale update rule for training GANs
smallflyingpig/video_stitching_mainpage
smallflyingpig/VQ-Diffusion
Official implementation of VQ-Diffusion