s1ghhh

Pinned Repositories

Adversarial-Large-Character-Set-CAPTCHA-Generation
Language:Python0 1 00
alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 00
caselab
Language:Vue00
demix
DEMix Layers for Modular Language Modeling
Language:Python0 0 00
diffusion-lora
Language:Python00
easy-llm-finetuner
Easy Environment Configuration for LLM Model Finetuning
Language:Shell0 0 00
falcontune
Tune any FALCON in 4-bit
Language:Python00
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python1 0 00
MedVicuna
Language:Python3 1 00

s1ghhh's Repositories

s1ghhh/MedVicuna
Language:Python3 1 00
s1ghhh/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python1 0 00
s1ghhh/Adversarial-Large-Character-Set-CAPTCHA-Generation
Language:Python0 1 00
s1ghhh/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook0 0 00
s1ghhh/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python0 0 00
s1ghhh/caselab
Language:Vue00
s1ghhh/demix
DEMix Layers for Modular Language Modeling
Language:Python0 0 00
s1ghhh/diffusion-lora
Language:Python00
s1ghhh/easy-llm-finetuner
Easy Environment Configuration for LLM Model Finetuning
Language:Shell0 0 00
s1ghhh/falcontune
Tune any FALCON in 4-bit
Language:Python00
s1ghhh/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++0 0 00
s1ghhh/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Language:Python0 0 00
s1ghhh/m-bbox
Language:Python0 1 00
s1ghhh/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook0 0 00
s1ghhh/s1ghhh.github.io
Github Pages
Language:JavaScript0 0 00
s1ghhh/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0 00
s1ghhh/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python0 0
s1ghhh/LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
s1ghhh/mergekit
Tools for merging pretrained large language models.
Language:Python0 0
s1ghhh/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
Language:Jupyter Notebook0 0
s1ghhh/SVD-LLM
Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"
Language:Python0 0
s1ghhh/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python0 0
s1ghhh/wanda
A simple and effective LLM pruning approach.