TearGosling

Pinned Repositories

PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
Language:Python31
FLAN
Language:Python10
LaMDA-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch.
Language:Python12
OGEYRRAT
Open-source Gloriously Extensive Yaml-configuration Repository for Reimplementing Architectures of Transformers
Language:Python10
augmenting-conv-data
Just testing
Language:Python00
axolotl
Go ahead and axolotl questions
Language:Python00
convogpt
Conversational Language model toolkit for training against human preferences.
Language:Python00
DeepSeek-MoE
Language:Python00
dreamer-for-text
An experiment
00
json2binidx_tool
Language:Python00

TearGosling's Repositories

TearGosling/OWARIDA
Open reWritten, Augmented and (sometimes) Reversed Instruction Data for All
Language:Python
TearGosling/augmenting-conv-data
Just testing
Language:Python
TearGosling/dreamer-for-text
An experiment
TearGosling/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C
TearGosling/nanoGPT_aru
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python
TearGosling/DeepSeek-MoE
TearGosling/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python
TearGosling/RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
Language:Jupyter Notebook
TearGosling/learnable-merging
TearGosling/LyCORIS
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
Language:Python
TearGosling/axolotl
Go ahead and axolotl questions
Language:Python
TearGosling/json2binidx_tool
TearGosling/mezo-pretraining-test
Language:Python
TearGosling/MeetInTheMiddle-Finetuning
Meet-in-the-Middle training method, but for fine-tuning
TearGosling/OGEYRRAT
Open-source Gloriously Extensive Yaml-configuration Repository for Reimplementing Architectures of Transformers
Language:Python1
TearGosling/U-ViT_JEPA
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
TearGosling/FLAN
Language:Python1
TearGosling/VNDataConverter
Grabbing parsed VNs of many different formats and converting it into a universal format into a universal
TearGosling/convogpt
Conversational Language model toolkit for training against human preferences.
Language:Python
TearGosling/LaMDA-pytorch
Open-source pre-training implementation of Google's LaMDA in PyTorch.
Language:Python12
TearGosling/smoothquant
SHORKIN ALL THE BUGSSSS AWAY
Language:Python
TearGosling/torch-int
AGHHHHH
Language:Python
TearGosling/PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
Language:Python31