Pinned Repositories
BentoVLLM
Self-host LLMs with vLLM and BentoML
autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering
LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!
hello-world
lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Simple-DevOps-Project
doccano_spacy
Doccano annotation server together with a Spacy backend
litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
mistral-finetune
Chasapas's Repositories
Chasapas/hello-world
Chasapas/lit-gpt
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Chasapas/Simple-DevOps-Project