inkoon's Stars
OpenInterpreter/open-interpreter
A natural language interface for computers
xai-org/grok-1
Grok open release
microsoft/autogen
A programming framework for agentic AI π€
astral-sh/ruff
An extremely fast Python linter and code formatter, written in Rust.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
eugeneyan/applied-ml
π Papers & tech blogs by companies sharing their work on data science & machine learning in production.
meta-llama/llama3
The official Meta Llama 3 GitHub site
unclecode/crawl4ai
π₯π·οΈ Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
ItzCrazyKns/Perplexica
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
microsoft/BitNet
Official inference framework for 1-bit LLMs
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
EricLBuehler/mistral.rs
Blazingly fast LLM inference.
orioncactus/pretendard
μ΄λ νλ«νΌμμλ μ¬μ©ν μ μλ system-ui λ체 κΈκΌ΄ | A system-ui alternative font for all cross-platform
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
McGill-NLP/llm2vec
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
SakanaAI/evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
pemistahl/lingua-py
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
vllm-project/llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
ContextualAI/gritlm
Generative Representational Instruction Tuning
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
RAIVNLab/MRL
Code repository for the paper - "Matryoshka Representation Learning"
microsoft/MS-MARCO-Web-Search
A large-scale information-rich web dataset, featuring millions of real clicked query-document labels
cfahlgren1/qwen-2.5-code-interpreter
Qwen 2.5 Coder 1.5B with Code Interpreter
project-miracl/miracl
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
MLP-Lab/Bllossom
DunZhang/Stella
rladmstn1714/CLIcK
CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean
ielab/PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval