GHGmc2

@intelShanghai

GHGmc2's Stars

binary-husky/gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
Language:Python66.4k 280 1.7k8.1k
karpathy/LLM101n
LLM101n: Let's build a Storyteller
30.5k 2.5k 01.7k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.9k 98 181.1k
fchollet/ARC-AGI
The Abstraction and Reasoning Corpus
Language:JavaScript3.7k 104 71601
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.2k 12 4120
dabochen/spreadsheet-is-all-you-need
A nanoGPT pipeline packed in a spreadsheet
2.1k 15 4121
fengbintu/Neural-Networks-on-Silicon
This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.
1.9k 299 3385
cambrian-mllm/cambrian
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Language:Python1.8k 21 70117
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26176
laekov/fastmoe
A fast MoE impl for PyTorch
Language:Python1.6k 13 121191
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
1.3k 41 682
datamllab/LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Language:Python624 10 3862
hemingkx/SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
507 22 425
minyoungg/platonic-rep
Language:Python476 12 930
swc-17/SparseDrive
SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
Language:Python413 16 7957
feifeibear/long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
Language:Python388 4 1828
Tim-Salzmann/l4casadi
Use PyTorch Models with CasADi for data-driven optimization or learning-based optimal control. Supports Acados.
Language:Python386 9 5529
FlagOpen/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
Language:Python364 19 7853
HKUNLP/ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
Language:Python363 7 2119
HorizonRobotics/Sparse4D
Language:Jupyter Notebook348 6 9935
imbue-ai/cluster-health
Language:Python281 12 638
HuaiyuanXu/3D-Occupancy-Perception
[Information Fusion 2024] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
271 8 324
google/aqt
Language:Python270 6 3227
bytedance/flux
A fast communication-overlapping library for tensor parallelism on GPUs.
Language:C++241 8 2620
Mellanox/nccl-rdma-sharp-plugins
RDMA and SHARP plugins for nccl library
Language:C166 22 1932
google-deepmind/language_modeling_is_compression
Language:Python107 7 1814
fanshiqing/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
Language:Cuda74 0 026
madsys-dev/deepseekv2-profile
Language:Jupyter Notebook66 5 36
feifeibear/Odysseus-Transformer
Odysseus: Playground of LLM Sequence Parallelism
Language:Python61 2 22
SC-SGS/hardware_sampling
The Hardware Sampling (hws) library can be used to track hardware performance like clock frequency, memory usage, temperatures, or power draw.
Language:C++17 2 11

GHGmc2

GHGmc2's Stars

binary-husky/gpt_academic

karpathy/LLM101n

naklecha/llama3-from-scratch

fchollet/ARC-AGI

kvcache-ai/Mooncake

dabochen/spreadsheet-is-all-you-need

fengbintu/Neural-Networks-on-Silicon

cambrian-mllm/cambrian

gkamradt/LLMTest_NeedleInAHaystack

laekov/fastmoe

HuangOwen/Awesome-LLM-Compression

datamllab/LongLM

hemingkx/SpeculativeDecodingPapers

minyoungg/platonic-rep

swc-17/SparseDrive

feifeibear/long-context-attention

Tim-Salzmann/l4casadi

FlagOpen/FlagGems

HKUNLP/ChunkLlama

HorizonRobotics/Sparse4D

imbue-ai/cluster-health

HuaiyuanXu/3D-Occupancy-Perception

google/aqt

bytedance/flux

Mellanox/nccl-rdma-sharp-plugins

google-deepmind/language_modeling_is_compression

fanshiqing/grouped_gemm

madsys-dev/deepseekv2-profile

feifeibear/Odysseus-Transformer

SC-SGS/hardware_sampling