ssssmark's Stars
SkyworkAI/MoH
MoH: Multi-Head Attention as Mixture-of-Head Attention
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
xgqdut2016/cuda_code
easy cuda code
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
siboehm/SGEMM_CUDA
Fast CUDA matrix multiplication from scratch
CalvinXKY/BasicCUDA
A tutorial for CUDA&PyTorch
FasterDecoding/TEAL
thunlp/MoEfication
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
gangweiX/Stoch-predict-with-Tranformer-LSTM
stock predict with MLP,CNN,RNN,LSTM,Transformer and Transformer-LSTM
THUDM/CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
CS-BAOYAN/CSYuTuiMian2024
2024年计算机保研预推免通知
modelscope/ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
ssbuild/qwen_vl_finetuning
Lordog/dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
datawhalechina/daily-interview
Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star
CyC2018/CS-Notes
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
tnfe/tntweb-admin
react admin management system template
modelscope/agentscope
Start building LLM-empowered multi-agent applications in an easier way.
yuweihao/MambaOut
MambaOut: Do We Really Need Mamba for Vision?
zuiidea/antd-admin
An excellent front-end solution for enterprise applications built upon Ant Design and UmiJS
InternLM/InternLM
Official release of InternLM2.5 base and chat models. 1M context support
TongjiFinLab/CFGPT
Chinese Financial Assistant with Large Language Model
AccumulateMore/CV
✔(已完结)最全面的 深度学习 笔记【土堆 Pytorch】【李沐 动手学深度学习】【吴恩达 深度学习】
redotvideo/mamba-chat
Mamba-Chat: A chat LLM based on the state-space model architecture 🐍
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
Harhao/react-admin-system
基于React开发后台管理系统模板(Ant Design)
Oxen-AI/mamba-dive
This is the code that went into our practical dive using mamba as information extraction
state-spaces/mamba
Mamba SSM architecture