raphaelsty's Stars
ggerganov/ggml
Tensor library for machine learning
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
pytorch/torchtune
PyTorch native finetuning library
tconbeer/harlequin
The SQL IDE for Your Terminal.
apache/age
Graph database optimized for fast analysis and real-time data processing. It is provided as an extension to PostgreSQL.
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
astanin/python-tabulate
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
zou-group/textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
zml/zml
High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
dora-rs/dora
DORA (Dataflow-Oriented Robotic Architecture) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
rspeer/wordfreq
Access a database of word frequencies, in various natural languages.
dleemiller/WordLlama
Things you can do with the token embeddings of an LLM
illuin-tech/colpali
The code used to train and run inference with the ColPali architecture.
py-pdf/fpdf2
Simple PDF generation for Python
xhluca/bm25s
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
zakird/crux-top-lists
Downloadable snapshots of the Chrome Top Million Websites pulled from public CrUX data in Google BigQuery.
facebookresearch/fairseq2
FAIR Sequence Modeling Toolkit 2
AnacletoLAB/grape
🍇 GRAPE is a Rust/Python Graph Representation Learning library for Predictions and Evaluations
ing-bank/sparse_dot_topn
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
jndean/LossRider
A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿
cmavro/GNN-RAG
GNN-RAG: Graph Neural Retrieval for Large Language Modeling Reasoning
jax-ml/ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
lightonai/pylate
Late Interaction Models Training & Retrieval
Unstructured-IO/unstructured-inference
TorchDR/TorchDR
TorchDR - PyTorch Dimensionality Reduction
carbonfact/icanexplain
📊 Explain why metrics change by unpacking them
mattt/hype
Write Python functions. Use them everywhere. HTTP, CLI, GUI, LLM (OMG)