Pinned Repositories
accv
AlphaPose
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
AlphaPose_TRT
基于AlphaPose的TensorRT加速
Amodal3Det
awesome-model-compression-and-acceleration
a list of awesome papers on deep model ompression and acceleration
caffe
Caffe: a fast open framework for deep learning.
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
flash-attention-numpy
使用numpy复现flash attention算法
pointcloud
unet-tensorrt
oreo-lp's Repositories
oreo-lp/flash-attention-numpy
使用numpy复现flash attention算法
oreo-lp/caffe
Caffe: a fast open framework for deep learning.
oreo-lp/ControlNet4TRT
use tensorRT to accelerate ControlNet
oreo-lp/CThreadPool
【A simple used C++ threadpool】一个简单好用,性能优异的跨平台的C++线程池。欢迎 star & fork
oreo-lp/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
oreo-lp/DeepFaceLive
Real-time face swap for PC streaming or video calls
oreo-lp/FasterTransformer
Transformer related optimization, including BERT, GPT
oreo-lp/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
oreo-lp/InferLight
lightweighted deep learning inference service framework
oreo-lp/InferLLM
a lightweight LLM model inference framework
oreo-lp/json
JSON for Modern C++
oreo-lp/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
oreo-lp/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device.
oreo-lp/llama.cpp
Port of Facebook's LLaMA model in C/C++
oreo-lp/llm.c
LLM training in simple, raw C/CUDA
oreo-lp/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
oreo-lp/Megatron-LM_
Ongoing research training transformer models at scale
oreo-lp/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
oreo-lp/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
oreo-lp/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
oreo-lp/pytorch-deep-learning
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
oreo-lp/pytriton_fastapi
oreo-lp/qwen-vllm
通义千问VLLM推理部署DEMO
oreo-lp/rag-from-scratch
oreo-lp/RenderPy
oreo-lp/stable-diffusion.cpp
Stable Diffusion in pure C/C++
oreo-lp/transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
oreo-lp/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
oreo-lp/whisper.cpp
Port of OpenAI's Whisper model in C/C++
oreo-lp/yolov5
YOLOv5 in PyTorch > ONNX > CoreML > TFLite