miniaturelle's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
DataTalksClub/mlops-zoomcamp
Free MLOps course from DataTalks.Club
kedro-org/kedro
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
microsoft/vscode-dev-containers
NOTE: Most of the contents of this repository have been migrated to the new devcontainers GitHub org (https://github.com/devcontainers). See https://github.com/devcontainers/template-starter and https://github.com/devcontainers/feature-starter for information on creating your own!
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
GokuMohandas/mlops-course
Learn how to design, develop, deploy and iterate on production-grade ML applications.
jupyter-naas/awesome-notebooks
A powerful data & AI notebook templates catalog: prompts, plugins, models, workflow automation, analytics, code snippets - following the IMO framework to be searchable and reusable in any context.
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
chiphuyen/dmls-book
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
LIAAD/yake
Single-document unsupervised keyword extraction
explosion/sense2vec
🦆 Contextually-keyed word vectors
boudinfl/pke
Python Keyphrase Extraction module
frutik/awesome-search
Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness
NVIDIA-Merlin/Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
ExtensityAI/symbolicai
Compositional Differentiable Programming Library
jacopotagliabue/you-dont-need-a-bigger-boat
An end-to-end implementation of intent prediction with Metaflow and other cool tools
NTMC-Community/awesome-neural-models-for-semantic-match
A curated list of papers dedicated to neural text (semantic) matching.
parrt/random-forest-importances
Code to compute permutation and drop-column importances in Python scikit-learn models
xLaszlo/datascience-fails
Collection of articles listing reasons why data science projects fail.
changyaochen/rbo
Implementation of Rank-biased Overlap
terrier-org/cikm2021tutorial
wikimedia/search-ltr
Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)