Pinned Repositories
ATOMIC_ROBERTA
A repo for training a Roberta language model base architecture, atomically sized because the models have sub 10million parameters.
DatasetsMergeSplit
Some wrappers on Huggingface for complex dataset splitting and merging operations.
gantrithor_installer
This repo is for constructing installers for the gantrithor
HonorsThesis
sentence_encoder_distillation
This is a script for distilling the following model = SentenceTransformer("all-MiniLM-L6-v2") into smaller dimension models.
SpanMarkerNER_HF
SpanMarker for Named Entity Recognition, HuggingFace version
dorenwick's Repositories
dorenwick/ATOMIC_ROBERTA
A repo for training a Roberta language model base architecture, atomically sized because the models have sub 10million parameters.
dorenwick/DatasetsMergeSplit
Some wrappers on Huggingface for complex dataset splitting and merging operations.
dorenwick/HonorsThesis
dorenwick/sentence_encoder_distillation
This is a script for distilling the following model = SentenceTransformer("all-MiniLM-L6-v2") into smaller dimension models.
dorenwick/SpanMarkerNER_HF
SpanMarker for Named Entity Recognition, HuggingFace version
dorenwick/gantrithor_installer
This repo is for constructing installers for the gantrithor