nicolaygerold's Stars
rasbt/deeplearning-models
A collection of various deep learning architectures, models, and tips
Lissy93/web-check
๐ต๏ธโโ๏ธ All-in-one OSINT tool for analysing any website
meilisearch/charabia
Library used by Meilisearch to tokenize queries and documents
cohere-ai/BinaryVectorDB
Efficient vector database for hundred millions of embeddings.
searchhub/search-collector
A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
EleutherAI/tokengrams
Efficiently computing & storing token n-grams from large corpora
MarginaliaSearch/MarginaliaSearch
Internet search engine for text-oriented websites. Indexing the small, old and weird web.
mixedbread-ai/baguetter
Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter supports sparse (traditional), dense (semantic), and hybrid retrieval methods.
TomWright/dasel
Select, put and delete data from JSON, TOML, YAML, XML and CSV files with a single tool. Supports conversion between formats and can be used as a Go package.
probably-nothing-labs/denormalized
Embeddable stream processing engine based on Apache DataFusion
neuml/txtai
๐ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
TheAlgorithms/Go
Algorithms and Data Structures implemented in Go for beginners, following best practices.
cgzirim/seek-tune
An implementation of Shazam's song recognition algorithm.
jodigiordano/gg
The diagramming tool for flowcharts, mindmaps, user flows, network & cloud diagrams, and more!
urchade/GLiNER
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
creyesp/Awesome-recsys
Curated list of recommnedation system topics
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
labmlai/annotated_deep_learning_paper_implementations
๐งโ๐ซ 60+ Implementations/tutorials of deep learning papers with side-by-side notes ๐; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ๐ฎ reinforcement learning (ppo, dqn), capsnet, distillation, ... ๐ง
binhnguyennus/awesome-scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
swyxio/spark-joy
โจ๐ 2000+ ways to add design flair, user delight, and whimsy to your product.
AnswerDotAI/RAGatouille
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
eugeneyan/applied-ml
๐ Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Canner/WrenAI
๐ An open-source SQL AI (Text-to-SQL) Agent that empowers data, product teams to chat with their data. ๐ค
anthropics/anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
fmind/mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
ByteByteGoHq/system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
donnemartin/system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
google-github-actions/deploy-cloudrun
A GitHub Action for deploying services to Google Cloud Run.