Pinned Repositories
android-kotlin-fundamentals-starter-apps
android-kotlin-fundamentals-starter-apps
asdfghjkl
ASDL: Automatic Second-order Differentiation (for Fisher, Gradient covariance, Hessian, Jacobian, and Kernel) Library
ASDL_train
chess-to-text
Courseware
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
evojax
gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
transformer_layers_as_painters
self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
floatingbigcat's Repositories
floatingbigcat/transformer_layers_as_painters
floatingbigcat/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
floatingbigcat/android-kotlin-fundamentals-starter-apps
android-kotlin-fundamentals-starter-apps
floatingbigcat/asdfghjkl
ASDL: Automatic Second-order Differentiation (for Fisher, Gradient covariance, Hessian, Jacobian, and Kernel) Library
floatingbigcat/ASDL_train
floatingbigcat/chess-to-text
floatingbigcat/Courseware
floatingbigcat/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
floatingbigcat/evojax
floatingbigcat/floatingbigcat.github.io
floatingbigcat/Guider
floatingbigcat/IHPCSS-Programming-challenge-2024
floatingbigcat/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
floatingbigcat/korean_audios
floatingbigcat/lm-evaluation-harness
A framework for few-shot evaluation of language models.
floatingbigcat/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
floatingbigcat/magicoder
Magicoder: Source Code Is All You Need
floatingbigcat/magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
floatingbigcat/mergekit
Tools for merging pretrained large language models.
floatingbigcat/mm_builder
floatingbigcat/MMLU-Pro
The scripts for MMLU-Pro
floatingbigcat/PictureBed
floatingbigcat/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
floatingbigcat/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
floatingbigcat/synthetic_data
floatingbigcat/visualwebarena
VisualWebArena is a benchmark for multimodal agents.
floatingbigcat/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
floatingbigcat/VLM
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
floatingbigcat/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath