ClemDoum
NLP Scientist/Engineer at ICIJ (ex. Sonos Voice Experience, Snips) https://www.linkedin.com/in/clementdoumouro/
@snipsco France
ClemDoum's Stars
wagoodman/dive
A tool for exploring each layer in a docker image
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
slimtoolkit/slim
Slim(toolkit): Don't change anything in your container image and minify it by up to 30x (and for compiled languages even more) making it secure too! (free and open source)
pqrs-org/Karabiner-Elements
Karabiner-Elements is a powerful tool for customizing keyboards on macOS
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
gristlabs/grist-core
Grist is the evolution of spreadsheets.
pressly/goose
A database migration tool. Supports SQL migrations and Go functions.
amacneil/dbmate
🚀 A lightweight, framework-agnostic database migration tool.
erikbern/ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
SciPhi-AI/R2R
The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a RESTful API.
ultrajson/ultrajson
Ultra fast JSON decoder and encoder written in C with Python bindings
jorisschellekens/borb
borb is a library for reading, creating and manipulating PDF files in python.
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
microsoft/Phi-3CookBook
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Azure-Samples/graphrag-accelerator
One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure
explosion/spacy-models
💫 Models for the spaCy Natural Language Processing (NLP) library
aio-libs/aiocache
Asyncio cache manager for redis, memcached and memory
piskvorky/sqlitedict
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
speedb-io/speedb
A RocksDB compliant high performance scalable embedded key-value store
pycountry/pycountry
A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.
WebTools-NG/WebTools-NG
WebTools Next Generation for Plex
maelstrom-software/maelstrom
Maelstrom is a fast Rust, Go, and Python test runner that runs every test in its own container. Tests are either run locally or distributed to a clustered job runner.
JiaquanYe/TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
cv-small-snails/Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
SapienzaNLP/relik
Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)
harsha-simhadri/big-ann-benchmarks
Framework for evaluating ANNS algorithms on billion scale datasets.
tomasonjo/diffbot-kg-chatbot
Knowledge graph construction and RAG demo using Diffbot and Neo4j
predlico/ARAGOG
ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research papers dataset. Includes modular code for easy experimentation and reusability.
xyb/rocksdb3
Python bindings for rocksdb