wgimperial

Pinned Repositories

alpa
Training and serving large-scale neural networks with auto parallelization.
Language:Python3.1k 46 297360
MInference
[NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Language:Python861 7 7439
gateway
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Language:TypeScript6.8k 39 397495
stylellm_models
StyleLLM文风大模型：基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿
261 1 915
alpa
Auto parallelization for large-scale neural networks
Language:Python0 0 00
fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Language:Python0 0 00
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
Language:Python0 0 00

wgimperial's Repositories

wgimperial/alpa
Auto parallelization for large-scale neural networks
Language:Python0 0 00
wgimperial/fastT5
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Language:Python0 0 00
wgimperial/MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练、有监督微调、RLHF(奖励建模、强化学习训练)和DPO(直接偏好优化)。
Language:Python0 0 00