Pinned Repositories
aurora
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
awesome-cheatsheets
👩💻👨💻 Awesome cheatsheets for popular programming languages, frameworks and development tools. They include everything you should know in one single file.
blislab
BLISlab: A Sandbox for Optimizing GEMM
ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Codeforces
AC
cs143
CS143 learning and assignments
gemini2openai
This project converts the Gemini Embedding API into a format compatible with OpenAI’s API and deploys it on Cloudflare, enabling free and seamless integration and usage with the OpenAI Python library.
mlx-examples
Examples in the MLX framework
qwen-fast
DongqiShen's Repositories
DongqiShen/qwen-fast
DongqiShen/gemini2openai
This project converts the Gemini Embedding API into a format compatible with OpenAI’s API and deploys it on Cloudflare, enabling free and seamless integration and usage with the OpenAI Python library.
DongqiShen/aurora
DongqiShen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
DongqiShen/awesome-cheatsheets
👩💻👨💻 Awesome cheatsheets for popular programming languages, frameworks and development tools. They include everything you should know in one single file.
DongqiShen/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
DongqiShen/cs143
CS143 learning and assignments
DongqiShen/dongqishen.github.io
Dongqi's Leisure Time
DongqiShen/dongqishen.github.io_2
Dongqi's leisure time.
DongqiShen/mlx-examples
Examples in the MLX framework
DongqiShen/edgetunnel
在原版的基础上修改了显示 VLESS 配置信息转换为订阅内容。使用该脚本,你可以方便地将 VLESS 配置信息使用在线配置转换到 Clash 或 Singbox 等工具中。
DongqiShen/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
DongqiShen/FlashAttention-PyTorch
Implementation of FlashAttention in PyTorch
DongqiShen/ggml
Tensor library for machine learning
DongqiShen/gptpdf
Using GPT to parse PDF
DongqiShen/gradio
Create UIs for your machine learning model in Python in 3 minutes
DongqiShen/igemm
igemm tutorial
DongqiShen/iLLM
Implementing LLM from scratch. (Developing...)
DongqiShen/ips
优选ip
DongqiShen/llama.cpp
Port of Facebook's LLaMA model in C/C++
DongqiShen/llm.c
LLM training in simple, raw C/CUDA
DongqiShen/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
DongqiShen/llvm-tutor
A collection of out-of-tree LLVM passes for teaching and learning
DongqiShen/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
DongqiShen/Notes
Notes from my study
DongqiShen/one-api
OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistribution system, using a single API for all LLMs, and features an English UI.
DongqiShen/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
DongqiShen/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
DongqiShen/tvm-cn
TVM Documentation in Chinese Simplified / TVM 中文文档
DongqiShen/WorkerVless2sub
这个是一个将 Cloudflare Workers - VLESS 搭配 自建优选域名 的 订阅生成器