kyoungrok0517's Stars
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
open-webui/open-webui
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
microsoft/autogen
A programming framework for agentic AI 🤖
astral-sh/uv
An extremely fast Python package and project manager, written in Rust.
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
AnswerDotAI/fasthtml
The fastest way to create an HTML app
xjdr-alt/entropix
Entropy Based Sampling and Parallel CoT Decoding
verazuo/jailbreak_llms
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
mintisan/awesome-kan
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
JumpCrypto/crypto-reading-list
AnswerDotAI/rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
PrithivirajDamodaran/FlashRank
Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.
jawah/charset_normalizer
Truly universal encoding detector in pure Python
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
jbloomAus/SAELens
Training Sparse Autoencoders on Language Models
ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
permaweb/ao
The ao component and tools Monorepo - 🐰 🕳️ 👈
saprmarks/dictionary_learning
SixdegreeLab/MasteringChainAnalytics
xjdr-alt/entropix-local
smol models are fun too
kraibse/obsidian-table-sorting
This essential plugin will finally allow you to organize your tables non-destructively right within Obsidian. Sorting by multiple columns is supported!
DevSinghSachan/art
Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"
xjdr-alt/entropix-trainer
train entropix like a champ!