meshidenn's Stars
ashvardanian/StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc 🦖
tensorchord/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
cynthia/sse
Simple tool to allow an annotator to look at a source sentence and pick the most similar sentence out of a set of sentences.
ChenghaoMou/text-dedup
All-in-one text de-duplication
Stability-AI/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
fungtion/DANN
pytorch implementation of Domain-Adversarial Training of Neural Networks
ggerganov/llama.cpp
LLM inference in C/C++
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
google/paxml
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
moisutsu/trans-arxiv-bot
Twitter bot that tweets translated arXiv paper summaries
togethercomputer/RedPajama-Data
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
rioyokotalab/hpsc-2023
microsoft/msmarco
website for MS Marco
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
hical/HiCAL
HiCAL is a system for efficient high-recall retrieval with an adaptable assessing interface.
skypilot-org/skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Kei18/awesome_cs-ja_phd_life
collection of articles about PhD life written in 🇯🇵
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Lightning-AI/lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
nomic-ai/gpt4all
GPT4All: Chat with Local LLMs on Any Device
dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
BlinkDL/ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
vdaas/vald
Vald. A Highly Scalable Distributed Vector Search Engine
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications