floatingbigcat

I enjoy building interesting and useful things

Pinned Repositories

android-kotlin-fundamentals-starter-apps
android-kotlin-fundamentals-starter-apps
Language:Kotlin00
asdfghjkl
ASDL: Automatic Second-order Differentiation (for Fisher, Gradient covariance, Hessian, Jacobian, and Kernel) Library
Language:Python0 0 00
ASDL_train
Language:Jupyter Notebook00
chess-to-text
Language:Jupyter Notebook0 0 00
Courseware
0 1 00
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
evojax
Language:Jupyter Notebook0 0 00
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0 00
transformer_layers_as_painters
Language:Python50
self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
Language:Python1k 16 12115

floatingbigcat's Repositories

floatingbigcat/transformer_layers_as_painters
Language:Python50
floatingbigcat/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 0 00
floatingbigcat/android-kotlin-fundamentals-starter-apps
android-kotlin-fundamentals-starter-apps
Language:Kotlin00
floatingbigcat/asdfghjkl
ASDL: Automatic Second-order Differentiation (for Fisher, Gradient covariance, Hessian, Jacobian, and Kernel) Library
Language:Python0 0 00
floatingbigcat/ASDL_train
Language:Jupyter Notebook00
floatingbigcat/chess-to-text
Language:Jupyter Notebook0 0 00
floatingbigcat/Courseware
0 1 00
floatingbigcat/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python0 0 00
floatingbigcat/evojax
Language:Jupyter Notebook0 0 00
floatingbigcat/floatingbigcat.github.io
Language:HTML0 1 00
floatingbigcat/Guider
Language:Python0 1 00
floatingbigcat/IHPCSS-Programming-challenge-2024
Language:Jupyter Notebook0 0
floatingbigcat/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
floatingbigcat/korean_audios
floatingbigcat/lm-evaluation-harness
A framework for few-shot evaluation of language models.
floatingbigcat/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python0 0
floatingbigcat/magicoder
Magicoder: Source Code Is All You Need
Language:Python0 0
floatingbigcat/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
Language:Python0 0
floatingbigcat/mergekit
Tools for merging pretrained large language models.
Language:Python
floatingbigcat/mm_builder
Language:Python1 81
floatingbigcat/MMLU-Pro
The scripts for MMLU-Pro
Language:Python
floatingbigcat/PictureBed
1 0
floatingbigcat/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Language:Python0 0
floatingbigcat/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Language:Python0 0
floatingbigcat/synthetic_data
Language:Python
floatingbigcat/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
Language:Python
floatingbigcat/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
floatingbigcat/VLM
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
floatingbigcat/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Language:Python0 0