agombert's Stars
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
worldbank/econberta-econie
Repository hosting the large language model EconBERTa and the annotated dataset EconIE
outlines-dev/outlines
Structured Text Generation
modal-labs/modal-examples
Examples of programs built using Modal
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
kyrolabs/awesome-langchain
😎 Awesome list of tools and projects with the awesome LangChain framework
AI4Finance-Foundation/FinGPT
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
huggingface/setfit
Efficient few-shot learning with Sentence Transformers
serpapi/google-search-results-python
Google Search Results via SERP API pip Python Package
kayoyin/interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
JustAnotherArchivist/snscrape
A social networking service scraper in Python
sohampoddar26/caves-data
CAVES-dataset accepted at SIGIR'22
crabcamp/lexrank
LexRank algorithm for text summarization
Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for ML models & LLMs
MilaNLProc/twitter-demographer
A python package to enrich Twitter Data
jamesturk/jellyfish
🪼 a python library for doing approximate and phonetic matching of strings.
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
cardiffnlp/xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
dinobby/ZS-BERT
Official implementation of the paper "Towards Zero-Shot Relation Extraction with Attribute Representation Learning."
ml-tooling/best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
Saravananslb/py-googletranslation
pygoogletranslation: Free and Unlimited Google translate API for Python. Translates totally free of charge.
makcedward/nlpaug
Data augmentation for NLP
twintproject/twint
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.