wangxidong06

Towards (Medical) LLMs’ interactivity

PHD@The Chinese University of Hong Kong, Shenzhen, BA@Beijing Institute of Technology, xidongwang1@link.cuhk.edu.cn

Pinned Repositories

Apollo
Multilingual Medicine: Model, Dataset, Benchmark, Code
Language:Python176 10 118
CMB
CMB, A Comprehensive Medical Benchmark in Chinese
Language:Python147 8 2915
FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
Language:Python36 2 05
Huatuo-26M
The Largest-scale Chinese Medical QA Dataset： with 26,000,000 question answer pairs.
231 9 1024
LongLLaVA
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture
Language:Python187 13 1413
Medical_NLP
Medical NLP Competition, dataset, large models, paper
2.2k 53 2410
acl-2023
Repository for the ACL 2023 conference website
Language:JavaScript0 0 00
BLAS_testbench
Basic Linear Algebra Subprograms testbench
Language:C1 1 00
Notes-and-Assigns-for-CS224N
Homework and Notes of CS224N
Language:JavaScript9 2 00
Optimized-LLM.cpp
Optimized LLM.cpp codes(LLaMa.cpp BLoomz.cpp Whisper.cpp) with Matrix Multiplication implemented by BLIS
Language:C1 1 00

wangxidong06's Repositories

wangxidong06/Notes-and-Assigns-for-CS224N
Homework and Notes of CS224N
Language:JavaScript9 2 00
wangxidong06/BLAS_testbench
Basic Linear Algebra Subprograms testbench
Language:C1 1 00
wangxidong06/Optimized-LLM.cpp
Optimized LLM.cpp codes(LLaMa.cpp BLoomz.cpp Whisper.cpp) with Matrix Multiplication implemented by BLIS
Language:C1 1 00
wangxidong06/acl-2023
Repository for the ACL 2023 conference website
Language:JavaScript0 0 00
wangxidong06/DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
Language:Python0 0 00
wangxidong06/emnlp-2023
Repository containing the website for the EMNLP 2023 conference
Language:HTML0 0 00
wangxidong06/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python0 0
wangxidong06/Firefly
Firefly(流萤): 中文对话式大语言模型(全量微调+QLoRA)，支持微调Baichuan2、CodeLlama、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya、Bloom等大模型
Language:Python0 0
wangxidong06/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0
wangxidong06/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
wangxidong06/llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
Language:Python0 0
wangxidong06/llama.cpp
Port of Facebook's LLaMA model in C/C++
Language:C0 0
wangxidong06/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python0 0
wangxidong06/LLMSFT_template
Various SFT acceleration framework scripts and codes
Language:Python1 0
wangxidong06/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python0 0
wangxidong06/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python0 0
wangxidong06/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python0 0
wangxidong06/OpenAIAPI
Use OpenAIAPI stably and quickly
Language:Python1 0
wangxidong06/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.
Language:Python0 0
wangxidong06/OpenRLHF
A Ray-based High-performance RLHF framework (for 7B on RTX4090 and 34B on A100)
Language:Python0 0
wangxidong06/PromethAI-Memory
Memory management for the AI Applications and AI Agents
Language:Python0 0
wangxidong06/TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
Language:C++0 0
wangxidong06/TinyLlama
Language:Python0 0
wangxidong06/torchtitan
A native PyTorch Library for large model training
wangxidong06/UltraFastBERT
The repository for the code of the UltraFastBERT paper
Language:Python0 0

wangxidong06

Pinned Repositories

Apollo

CMB

FastLLM

Huatuo-26M

LongLLaVA

Medical_NLP

acl-2023

BLAS_testbench

Notes-and-Assigns-for-CS224N

Optimized-LLM.cpp

wangxidong06's Repositories

wangxidong06/Notes-and-Assigns-for-CS224N

wangxidong06/BLAS_testbench

wangxidong06/Optimized-LLM.cpp

wangxidong06/acl-2023

wangxidong06/DoLa

wangxidong06/emnlp-2023

wangxidong06/EasyContext

wangxidong06/Firefly

wangxidong06/flash-attention

wangxidong06/flash-linear-attention

wangxidong06/llama-mistral

wangxidong06/llama.cpp

wangxidong06/LLaVA

wangxidong06/LLMSFT_template

wangxidong06/Megatron-DeepSpeed

wangxidong06/Megatron-LLaMA

wangxidong06/neurips_llm_efficiency_challenge

wangxidong06/OpenAIAPI

wangxidong06/opencompass

wangxidong06/OpenRLHF

wangxidong06/PromethAI-Memory

wangxidong06/TensorRT

wangxidong06/TinyLlama

wangxidong06/torchtitan

wangxidong06/UltraFastBERT