wanderingai

A wandering adventurer, crafting about large language models, on a mage's journey to source the gems of AI.

Together AISeattle, WA

wanderingai's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k 1.1k 15.9k26.5k
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python82.7k 1.7k 45.5k22.3k
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Language:Python31.9k 204 4.9k3.9k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.3k
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
Language:Python11.2k 83 1731.7k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.4k 74 532601
nebuly-ai/optimate
A collection of libraries to optimise AI model performances
Language:Python8.4k 92 201642
kellyjonbrazil/jc
CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.
Language:Python7.8k 27 316203
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python7.7k 108 156454
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
Language:TypeScript6.5k 37 107525
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Language:Python6k 68 269516
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python5.6k 63 98508
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.5k 105 1.1k924
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
Language:Python4.1k 46 593377
InternLM/xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Language:Python3.8k 34 516302
agiresearch/AIOS
AIOS: LLM Agent Operating System
Language:Python3.3k 49 31393
Tinche/aiofiles
File support for asyncio
Language:Python2.8k 36 120150
google-research/t5x
Language:Python2.6k 36 140301
peak/s5cmd
Parallel S3 and local filesystem execution tool.
Language:Go2.6k 31 377228
stochasticai/xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
Language:Python2.6k 33 101206
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Language:Python2k 45 125138
lcompilers/lpython
Python compiler
Language:C++1.5k 34 1k158
huggingface/nanotron
Minimalistic large language model 3D-parallelism training
Language:Python1.2k 39 76108
Liuhong99/Sophia
The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”
Language:Python933 15 4152
datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language:Python809 8 2640
Ananto30/zero
Zero: A simple and fast Python RPC framework
Language:Python562 8 2435
brentyi/tyro
CLI interfaces & config objects, from types
Language:Python481 7 10925
zeux/calm
CUDA/Metal accelerated language model inference
Language:C367 9 013
HazyResearch/flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Language:C++270 15 2427
NVIDIA/cuda-checkpoint
CUDA checkpoint and restore utility
Language:Cuda204 22 1310

wanderingai

wanderingai's Stars

huggingface/transformers

pytorch/pytorch

hiyouga/LLaMA-Factory

Dao-AILab/flash-attention

phidatahq/phidata

facebookresearch/xformers

nebuly-ai/optimate

kellyjonbrazil/jc

jzhang38/TinyLlama

jina-ai/reader

Lightning-AI/lit-llama

pytorch-labs/gpt-fast

NVIDIA/cutlass

pytorch/torchtune

InternLM/xtuner

agiresearch/AIOS

Tinche/aiofiles

google-research/t5x

peak/s5cmd

stochasticai/xTuring

huggingface/datatrove

lcompilers/lpython

huggingface/nanotron

Liuhong99/Sophia

datadreamer-dev/DataDreamer

Ananto30/zero

brentyi/tyro

zeux/calm

HazyResearch/flash-fft-conv

NVIDIA/cuda-checkpoint