fatemerhmi's Stars
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
axa-group/Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
facebookresearch/detectron2
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
taasmoe/BIO-to-BIOLU
Changes the encoding of CoNLL-03 NER datasets from BIO to BIOLU
rmunro/pytorch_active_learning
PyTorch Library for Active Learning to accompany Human-in-the-Loop Machine Learning book
mikeqfu/pydriosm
PyDriosm: an open-source tool for downloading, reading and PostgreSQL-based I/O of OpenStreetMap data
juand-r/entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
fastai/fastbook
The fastai book, published as Jupyter Notebooks
JohnSnowLabs/nlu
1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.
joke2k/faker
Faker is a Python package that generates fake data for you.
mahmoodlab/CLAM
Open source tools for computational pathology - Nature BME
daviddrysdale/python-phonenumbers
Python port of Google's libphonenumber
obsei/obsei
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
doccano/doccano
Open source annotation tool for machine learning practitioners.
openvenues/libpostal
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
microsoft/presidio-research
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
john-kurkowski/tldextract
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
GokuMohandas/Made-With-ML
Learn how to design, develop, deploy and iterate on production-grade ML applications.
prakhar1989/docker-curriculum
:dolphin: A comprehensive tutorial on getting started with Docker!
ivan-bilan/The-Microservices-Pandect
A comprehensive reference for all topics related to building and maintaining microservices
dair-ai/nlp_paper_summaries
✍️ A carefully curated list of NLP paper summaries
ivan-bilan/The-NLP-Pandect
A comprehensive reference for all topics related to Natural Language Processing
MLforHealth/MIMIC_Extract
MIMIC-Extract:A Data Extraction, Preprocessing, and Representation Pipeline for MIMIC-III
nedap/deidentify
A Python library to de-identify medical records with state-of-the-art NLP methods.
nickdavidhaynes/spacy-cld
Language detection extension for spaCy 2.0+
microsoft/presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
ahmedbesbes/anonymizer
Text Anonymization app with Streamlit and Spacy
caufieldjh/awesome-bioie
🧫 A curated list of resources relevant to doing Biomedical Information Extraction (including BioNLP)