nlp-thirdgate

A hub of third-party NLP providers and tutorials to help you instantly handle your data iterator with no-string dependency apps.

The purpose is of this project is to share Third-party providers that could be combined into a single pipeline.

Third-Party Providers

LLM / Mistral.AI [🤖 models]
LLM / OpenRouter.AI [🤖 models]
LLM / Replicate.IO [🤖 models]
LLM / OpenAI / ChagGPT
NER / DeepPavlov [📙 notebook]
NER / Flair [bash-script] [🤖 models]
NER / Spacy [bash-script] [🤖 models]
Translation / GoogleTranslator [📙 notebook]

Individual Models / Others / Miscelanneous

LLM / OpenAI / o1
LLM / OpenAI / Qwen-2.5-Max
LLM / OpenAI / DeepSeek-R1-distill-7b [📙 qwen-notebook] [📙 llama3-notebook]
LLM / Transformers / LLaMA-3
LLM / Transformers / Qwen-2
LLM / Transformers / Phi-4
LLM / Transformers / Gemma-3 [📙 notebook]
LLM / Transformers / Flan-T5
LLM / Transformers / Mistral

Data Iterators

In this project we consider that each provider represent a wrapper over third-party app to handle iterator of data. We consider dict python type for representing each record of the data.

Pipeline Formation

If you wish to use several third-party providers all together for a data-iterators, it is recommented to adopt AREkit framework as a no-string solution for deploying pipeline that support batching mode.

No-string Application