sdtblck

Pinned Repositories

gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.2k 178 139939
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.6k 120 427967
stackexchange-dataset
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Language:Python66 3 115
the-pile
Language:Python1.4k 30 100119
lm_dataloader
Dataloader tools for language modelling
Language:Python5 3 01
Opensubtitles_dataset
downloads and parses subtitle dataset from opensubtitles.org
Language:Python16 2 03
PDFextract
Extracting pdfs using pdfminer.six and pyPDF2
Language:Python9 3 18
stylegan2
StyleGAN2 - Official TensorFlow Implementation
Language:Python12 2 00
youtube_subtitle_dataset
YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training
Language:Python25 2 13

sdtblck's Repositories

sdtblck/youtube_subtitle_dataset
YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training
Language:Python25 2 13
sdtblck/Opensubtitles_dataset
downloads and parses subtitle dataset from opensubtitles.org
Language:Python16 2 03
sdtblck/stylegan2
StyleGAN2 - Official TensorFlow Implementation
Language:Python12 2 00
sdtblck/PDFextract
Extracting pdfs using pdfminer.six and pyPDF2
Language:Python9 3 18
sdtblck/lm_dataloader
Dataloader tools for language modelling
Language:Python5 3 01
sdtblck/image-dl
A fast and simple image downloader in python
Language:Python1 2 0
sdtblck/pbar-pool
A straightforward, dependency free way to update multiple progress bars with python's multiprocessing library.
Language:Python1 2 0
sdtblck/tputils
Utilities for TPUs
Language:Python1 2 0
sdtblck/benchmarking
Tools for benchmarking clusters
Language:Python2 0
sdtblck/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python1 0
sdtblck/example-mkdocs-basic
A basic MkDocs project for Read the Docs
Language:Python0 0
sdtblck/example-sphinx-basic
A basic Sphinx project for Read the Docs
Language:Python0 0
sdtblck/fish
An independent replication of `Training Neural Networks with Fixed Sparse Masks` by Sung et al.
Language:Python2 2
sdtblck/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0
sdtblck/guesslang
Detect the programming language of a source code
Language:Python0 0
sdtblck/lm_dataformat
Language:Python1 0
sdtblck/mapillary_scraper
Language:Python2 01
sdtblck/Megatron-LM
Ongoing research training transformer models at scale
Language:Python0 0
sdtblck/mesh
Mesh TensorFlow: Model Parallelism Made Easier
Language:Python1 0
sdtblck/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
Language:Python0 0
sdtblck/mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
Language:Jupyter Notebook0 0
sdtblck/mojo
The Mojo Programming Language
0 0
sdtblck/mup
maximal update parametrization (µP)
Language:Jupyter Notebook0 0
sdtblck/pypi
Language:Dockerfile0 0
sdtblck/RealFakeAugment
Image augmentation functions for GAN training
Language:Python2 0
sdtblck/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
sdtblck/transformers-bloom-inference
Fast Inference Solutions for BLOOM
Language:Python0 0
sdtblck/Yandex-Image-Scraper
some tools for scraping images from yandex image search
Language:Python2 0