data-augmentation
There are 991 repositories under data-augmentation topic.
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
ZhaoJ9014/face.evoLVe
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
QData/TextAttack
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
fepegar/torchio
Medical imaging toolkit for deep learning
iver56/audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
jasonwei20/eda_nlp
Data augmentation for NLP, presented at EMNLP 2019
yongzhuo/nlp_xiaojiang
自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用
visual-layer/fastdup
fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.
zhanlaoban/EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
LirongWu/awesome-graph-self-supervised-learning
Code for TKDE paper "Self-supervised learning on graphs: Contrastive, generative, or predictive"
Paperspace/DataAugmentationForObjectDetection
Data Augmentation For Object Detection
asteroid-team/torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
styfeng/DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
goru001/inltk
Natural Language Toolkit for Indic Languages aims to provide out of the box support for various NLP tasks that an application developer might need
zhunzhong07/Random-Erasing
Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
textflint/textflint
Unified Multilingual Robustness Evaluation Toolkit for Natural Language Processing
YuliangXiu/MobilePose
Light-weight Single Person Pose Estimator
Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
firmai/deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
conradry/copy-paste-aug
Copy-paste augmentation for segmentation and detection tasks
quqxui/Awesome-LLM4IE-Papers
Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)
arcelien/pba
Efficient Learning of Augmentation Policy Schedules
MTG/DeepConvSep
Deep Convolutional Neural Networks for Musical Source Separation
hongyi-zhang/mixup
Implementation of the mixup training method
codebox/image_augmentor
Data augmentation tool for images
amanchadha/coursera-gan-specialization
Programming assignments and quizzes from all courses within the GANs specialization offered by deeplearning.ai
denisyarats/drq
DrQ: Data regularized Q
tigerlab-ai/tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
bethgelab/imagecorruptions
Python package to corrupt arbitrary images.
sshuair/torchsat
🔥TorchSat 🌏 is an open-source deep learning framework for satellite imagery analysis based on PyTorch.
vanderschaarlab/synthcity
A library for generating and evaluating synthetic tabular data for privacy, fairness and data augmentation.