martijndijkshoorn's Stars
strapi/strapi
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
n8n-io/n8n
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
gto76/python-cheatsheet
Comprehensive Python Cheatsheet
shap/shap
A game theoretic approach to explain the output of any machine learning model.
deepset-ai/haystack
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
thingsboard/thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
graykode/nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
iterative/dvc
🦉 Data Versioning and ML Experiments
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
OpenMined/PySyft
Perform data science on data that remains in someone else's server
cortexlabs/cortex
Production infrastructure for machine learning at scale
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch
TeamHG-Memex/eli5
A library for debugging/inspecting machine learning classifiers and explaining their predictions
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
goru001/inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
mdipietro09/DataScience_ArtificialIntelligence_Utils
Examples of Data Science projects and Artificial Intelligence use-cases
GuansongPang/ADRepository-Anomaly-detection-datasets
ADRepository: Real-world anomaly detection datasets, including tabular data (categorical and numerical data), time series data, graph data, image data, and video data.
drivendataorg/deon
A command line tool to easily add an ethics checklist to your data science projects.
jmugan/modern_practical_nlp
This course covers how you can use NLP to do stuff.
nyu-mll/multiNLI
ashokc/Word-Embeddings-and-Document-Vectors
An evaluation of word-embeddings for classification
patil-suraj/distillbart-mnli
No Teacher BART distillation experiment for NLI tasks
AI4LAM/fastai4GLAMS
A study group for v4 of the fastai introduction to deep learning course with a focus on applications in GLAM settings
apanimesh061/Term_Doc_Matrix_ES
This is a tutorial on how to create a Term-Document Matrix from Elasticsearch.
ashokc/BoW-vs-BERT-Classification
Comparing traditional classifiers with bag-of-words approach to BERT for text classification
Amsterdam-AI-Team/Geolocalization_of_Street_Objects
In this repository, an approach is implemented to automatically detect and geolocate public objects, solely based on public available panoramic images. The objects of interest are assumed to be stationary, compact and observable from several locations each. In this project the objects being detected are bicycle symbols. NOTE: Panorama API offline.