Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
course
移动应用软件开发课程后台框架
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Labelme2YOLO
Help converting LabelMe Annotation Tool JSON format to YOLO text file format. If you've already marked your segmentation dataset by LabelMe, it's easy to use this tool to help converting to YOLO format dataset.
llm_kvcache_sparsity
Implement some method of LLM KV Cache Sparsity
MyWeChat
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
HarryWu99's Repositories
HarryWu99/llm_kvcache_sparsity
Implement some method of LLM KV Cache Sparsity
HarryWu99/ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
HarryWu99/course
移动应用软件开发课程后台框架
HarryWu99/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
HarryWu99/Labelme2YOLO
Help converting LabelMe Annotation Tool JSON format to YOLO text file format. If you've already marked your segmentation dataset by LabelMe, it's easy to use this tool to help converting to YOLO format dataset.
HarryWu99/MyWeChat
HarryWu99/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs