maiiabocharova
Machine Learning Engineer (focused on NLP) with interest in web-scraping.
DaXtraUkraine, Odessa
maiiabocharova's Stars
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
sqlalchemy/sqlalchemy
The Database Toolkit for Python
sqlite/sqlite
Official Git mirror of the SQLite source tree
roniemartinez/dude
dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
HumanSignal/label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
codingforentrepreneurs/30-Days-of-Python
Learn Python for the next 30 (or so) Days.
pzelasko/daseg
Dialog Acts SEGmentation: Tools for dialog act research
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
huggingface/optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools
aio-libs/aiohttp
Asynchronous HTTP client/server framework for asyncio and Python
FlareSolverr/FlareSolverr
Proxy server to bypass Cloudflare protection
PaddlePaddle/PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
hiyouga/Dual-Contrastive-Learning
Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"
declare-lab/awesome-emotion-recognition-in-conversations
A comprehensive reading list for Emotion Recognition in Conversations
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
infinitylogesh/NLP_paper_notes
My Notes and observations of Interesting NLP papers. My interests are representation learning, Metric learning and Information retrieval
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
nreimers/se-benchmark
facebookresearch/anli
Adversarial Natural Language Inference Benchmark
declare-lab/dialogue-understanding
This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
lmcinnes/umap
Uniform Manifold Approximation and Projection
andreas-vester/df2img
Save a Pandas DataFrame as image