StellaAthena

Democratizing language models and understanding how they work

Booz Allen Hamilton, EleutherAI

Pinned Repositories

gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.2k 178 139939
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python6.6k 120 427967
pythia
The hub for EleutherAI's work on interpretability and learning dynamics
Language:Jupyter Notebook2.1k 33 99152
the-pile
Language:Python1.4k 30 100119
egnn-pytorch
Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch
Language:Python1 0 00
fractal-ml
Fun stuff with fractal machine learning
Language:Jupyter Notebook3 1 05
gpt-neo
An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.
Language:Python2 1 00
OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language:Python2 0 01
starter-hugo-academic
Language:Jupyter Notebook2 1 00
transformer-memorization
Language:C++6 3 04

StellaAthena's Repositories

StellaAthena/transformer-memorization
Language:C++6 3 04
StellaAthena/OpenPrompt
An Open-Source Framework for Prompt-Learning.
Language:Python2 0 01
StellaAthena/starter-hugo-academic
Language:Jupyter Notebook2 1 00
StellaAthena/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python1 0 00
StellaAthena/mesh-transformer-jax
Model parallel transformers in JAX and Haiku
Language:Python1 0 00
StellaAthena/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python1 0 0
StellaAthena/StellaAthena
GitHub README
1 1 01
StellaAthena/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python1 0 0
StellaAthena/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python0 0
StellaAthena/city-circuits
Language:Jupyter Notebook0 0
StellaAthena/client
🔥 A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
Language:Python0 0
StellaAthena/DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python0 0
StellaAthena/eleuther.ai
Language:JavaScript0 0
StellaAthena/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0
StellaAthena/llama
Inference code for LLaMA models
Language:Python0 01
StellaAthena/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language
Language:Python0 0
StellaAthena/metaseq
Repo for external large-scale work
Language:Python0 0
StellaAthena/ML_SageMaker_Studies
Case studies, examples, and exercises for learning to deploy ML models using AWS SageMaker.
Language:Jupyter Notebook0 0
StellaAthena/moss.rb
A plagiarism detection engine based on Stanford's MOSS(Measure of Software Similarity)
Language:Ruby0 0
StellaAthena/mtg
Collection of data science and machine learning projects for Magic: the Gathering
Language:Python0 01
StellaAthena/point-transformer-pytorch
Implementation of the Point Transformer layer, in Pytorch
Language:Python0 0
StellaAthena/promptsource
Toolkit for collecting and applying templates of prompting instances
Language:Python0 0
StellaAthena/speak-memory
Code and data to support "Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4"
Language:Python0 0
StellaAthena/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Language:Python0 0
StellaAthena/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
Language:Jupyter Notebook0 0
StellaAthena/TopographicVAE
Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"
Language:Python0 0
StellaAthena/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
StellaAthena/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
StellaAthena/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Language:Python0 0
StellaAthena/women-tech-speakers-organizers
A list of women tech speakers & organizers. Add yourself or others by submitting a PR! PS if you do add someone, make sure to tell them! :) #fempire
0 0