sasgkhgw

Pinned Repositories

aaa
00
abc
0 1 00
alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook00
d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。
Language:Python00
gitdemo
1
Language:Jupyter Notebook00
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python00
mergekit
Tools for merging pretrained large language models.
Language:Python00
MergeLM
Codebase for Merging Language Models
Language:Python00
ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
Language:Python00
pr
Language:Jupyter Notebook00

sasgkhgw's Repositories

sasgkhgw/aaa
00
sasgkhgw/abc
0 1 00
sasgkhgw/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook00
sasgkhgw/d2l-zh
《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被60多个国家的400多所大学用于教学。
Language:Python00
sasgkhgw/gitdemo
1
Language:Jupyter Notebook00
sasgkhgw/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python00
sasgkhgw/mergekit
Tools for merging pretrained large language models.
Language:Python00
sasgkhgw/MergeLM
Codebase for Merging Language Models
Language:Python00
sasgkhgw/ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.
Language:Python00
sasgkhgw/pr
Language:Jupyter Notebook00
sasgkhgw/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
sasgkhgw/sparsegpt
Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
sasgkhgw/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
sasgkhgw/TransportationNetworks
Transportation Networks for Research
sasgkhgw/wanda
A simple and effective LLM pruning approach.