Pinned Repositories
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
ACM-ICPC-Preparation
ACM-ICPC Preparation Guide
ailab
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
ALAE
[CVPR2020] Adversarial Latent Autoencoders
alpaca-lora
Instruct-tune LLaMA on consumer hardware
BERT-Relation-Extraction
PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper
EfficientNets-PyTorch
A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."
fastai
The fastai deep learning library, plus lessons and tutorials
FlexGen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput generation.
gnn_tutorial
xiongjun19's Repositories
xiongjun19/FlexGen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput generation.
xiongjun19/alpaca-lora
Instruct-tune LLaMA on consumer hardware
xiongjun19/EfficientNets-PyTorch
A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."
xiongjun19/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
xiongjun19/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
xiongjun19/ColossalAI
Making large AI models cheaper, faster and more accessible
xiongjun19/common_tools
一些通用的处理函数
xiongjun19/DeepSpeedExamples
Example models using DeepSpeed
xiongjun19/docling
Get your documents ready for gen AI
xiongjun19/FasterTransformer
Transformer related optimization, including BERT, GPT
xiongjun19/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
xiongjun19/grok-1
Grok open release
xiongjun19/hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
xiongjun19/InstructGLM
ChatGLM-6B 指令学习|指令数据|Instruct
xiongjun19/markitdown
Python tool for converting files and office documents to Markdown.
xiongjun19/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
xiongjun19/mermaid
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
xiongjun19/MOSS
An open-source tool-augmented conversational language model from Fudan University
xiongjun19/profile-data
xiongjun19/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
xiongjun19/RL4LMs
A modular RL library to fine-tune language models to human preferences
xiongjun19/RRHF
RRHF & Wombat
xiongjun19/TencentPretrain
Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo
xiongjun19/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
xiongjun19/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
xiongjun19/train_test
to add some scripts for test training problem
xiongjun19/training
Reference implementations of MLPerf™ training benchmarks
xiongjun19/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
xiongjun19/verl
veRL: Volcano Engine Reinforcement Learning for LLM
xiongjun19/vision
Datasets, Transforms and Models specific to Computer Vision