if001

Pinned Repositories

bloom_lora_ja
Language:Python10
dedup_sentence
Language:C++4 1 11
lit-llama-ja
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python30
makeSentens
Language:Python1 1 00
quote_test
quote_test
Language:Vue1 1 00
RWKV-LM-LoRA-ja
Language:Python1 1 00

if001's Repositories

if001/dedup_sentence
Language:C++4 1 11
if001/lit-llama-ja
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python30
if001/arXiv_translate
Language:Python1
if001/book_impressions
Language:Shell
if001/chat_face
Language:Python
if001/doc_reader
Language:Python
if001/FLAN
Language:Python
if001/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0
if001/grok
if001/harbor
Simple and minimal personal blog theme.
Language:CSS
if001/HojiChar_OSCAR_sample
Language:Python
if001/if-blog-hugo
Language:Shell1 0
if001/litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
if001/llm-jp-eval
Language:Python
if001/llm_bunpo
Language:Python
if001/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python
if001/matmulfreellm
Implementation for MatMul-free LM.
if001/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python0 0
if001/mergekit
Tools for merging pretrained large language models.
Language:Python
if001/mergoo
A library for easily merging multiple LLM experts, and efficiently train the merged LLM.
Language:Python
if001/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
if001/paper_summary
Language:TypeScript
if001/qwen2_sft
Language:Python1 0
if001/rinna_4b_multi_instructions
Language:Python
if001/spm_tokenizer_ja
Language:Python
if001/style-bert-vits_sample
Language:Python1
if001/ucllm_nedo_prod
if001/wanda
A simple and effective LLM pruning approach.
Language:Python
if001/wiki_analysis
Language:Python
if001/wiki_classification
Language:Python