ProbablyFaiz's Stars
upscayl/upscayl
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
google/python-fire
Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
daybrush/moveable
Moveable! Draggable! Resizable! Scalable! Rotatable! Warpable! Pinchable! Groupable! Snappable!
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
huggingface/text-generation-inference
Large Language Model Text Generation Inference
outlines-dev/outlines
Structured Text Generation
abetlen/llama-cpp-python
Python bindings for llama.cpp
gristlabs/grist-core
Grist is the evolution of spreadsheets.
OpenAccess-AI-Collective/axolotl
Go ahead and axolotl questions
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
turboderp/exllamav2
A fast inference library for running LLMs locally on modern consumer-class GPUs
OpenNMT/CTranslate2
Fast inference engine for Transformer models
turboderp/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Filimoa/open-parse
Improved file parsing for LLM’s
jasonjmcghee/rem
An open source approach to locally record and enable searching everything you view on your Mac.
unum-cloud/usearch
Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
go-python/gopy
gopy generates a CPython extension module from a go package.
S-LoRA/S-LoRA
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
ELS-RD/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
MeetKai/functionary
Chat language model that can use tools and interpret the results
ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
mcamac/react-text-annotate
React components for interactively highlighting parts of text.
jllllll/exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
WuNein/vllm4mteb
vLLM for embedding tasks using Original LLMs (Qwen2, LLaMA)