Pinned Repositories
pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
haystack
:mag: Haystack is an open source NLP framework that leverages pre-trained Transformer models. It enables developers to quickly implement production-ready semantic search, question answering, summarization and document ranking for a wide range of NLP applications.
MLND_capstone
Capstone project implementation, report, and proposal for Udacity Machine Learning Engineer Nanodegree
mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
search_fundamentals_course
Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/search-fundamentals?utm_source=daniel.
search_with_machine_learning_course
Public repository for the Search with Machine Learning course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/course/search-with-machine-learning?utm_source=daniel.
shandou's Repositories
shandou/search_with_machine_learning_course
Public repository for the Search with Machine Learning course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/course/search-with-machine-learning?utm_source=daniel.
shandou/haystack
:mag: Haystack is an open source NLP framework that leverages pre-trained Transformer models. It enables developers to quickly implement production-ready semantic search, question answering, summarization and document ranking for a wide range of NLP applications.
shandou/notebooks
Jupyter notebooks for the Natural Language Processing with Transformers book
shandou/search_fundamentals_course
Public repository for the Search Fundamentals course taught by Daniel Tunkelang and Grant Ingersoll. Available at https://corise.com/search-fundamentals?utm_source=daniel.
shandou/test_pytorch_gpu
shandou/Clean-Code-in-Python
Clean Code in Python, published by Packt
shandou/conda
OS-agnostic, system-level binary package manager and ecosystem
shandou/DeepCTR-Torch
【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.
shandou/dssm
An industrial-grade implementation of DSSM
shandou/feature-engineering-for-machine-learning
Code Repository for the online course Feature Engineering for Machine Learning
shandou/fuzzywuzzy
Fuzzy String Matching in Python
shandou/generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
shandou/gensim
Topic Modelling for Humans
shandou/gitpod_conda
Stores gitpod configs for using conda
shandou/langchain-academy
shandou/llama_index
LlamaIndex is a data framework for your LLM applications
shandou/ml-design-patterns
Software Architecture for ML engineers
shandou/openai-cookbook
Examples and guides for using the OpenAI API
shandou/pdpipe
Easy pipelines for pandas DataFrames.
shandou/pecos
PECOS - Prediction for Enormous and Correlated Spaces
shandou/pyjanitor
Clean APIs for data cleaning. Python implementation of R package Janitor
shandou/python-patterns
A collection of design patterns/idioms in Python
shandou/pytorch-widedeep
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
shandou/qdrant
Qdrant - Vector Search Engine and Database for the next generation of AI applications. Also available in the cloud https://qdrant.to/cloud
shandou/quantulum3
Library for unit extraction under active development - fork of quantulum
shandou/scikit-learn
scikit-learn: machine learning in Python
shandou/sentence-transformers
Multilingual Sentence & Image Embeddings with BERT
shandou/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
shandou/SpanBERT
Code for using and evaluating SpanBERT.
shandou/swarm
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.