data-generation
There are 256 repositories under data-generation topic.
IDEA-Research/Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
sdv-dev/SDV
Synthetic data generation for tabular data
benkeen/generatedata
A powerful, feature-rich, random test data generator.
AgaMiko/data-augmentation-review
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to GitHub repos, papers, and others.
neomatrix369/awesome-ai-ml-dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
shuttle-hq/synth
The Declarative Data Generator
sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
whatyouhide/stream_data
Data generation and property-based testing for Elixir. 🔮
Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
sdv-dev/Copulas
A library to model multivariate data using copulas.
nomemory/mockneat
MockNeat - the modern faker lib.
tom-lord/regexp-examples
Generate strings that match a given regular expression
MTG/DeepConvSep
Deep Convolutional Neural Networks for Musical Source Separation
databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
cieslarmichal/faker-cxx
C++ Faker library for generating fake (but realistic) data.
microsoft/genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
tirthajyoti/pydbgen
Random dataframe and database table generator
kathrinse/be_great
A novel approach for synthesizing tabular data using pretrained large language models
trinker/wakefield
Generate random data sets
worldbank/REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
rapiddweller/rapiddweller-benerator-ce
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
finos/datahelix
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
gretelai/awesome-synthetic-data
📖 A curated list of resources dedicated to synthetic data
UnrealZoo/unrealzoo-gym
Large-scale photo-realistic virtual worlds for embodied AI
tinybirdco/mockingbird
Mockingbird is a mock streaming data generator
sdv-dev/DeepEcho
Synthetic Data Generation for mixed-type, multivariate time series.
zhaohengyuan1/Genixer
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
louisYen/Gen4Gen
🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"
mjkvaak/ImageDataAugmentor
Custom image data generator for TF Keras that supports the modern augmentation module albumentations
kgoldfeld/simstudy
simstudy: Illuminating research methods through data generation
leezythu/FlexKBQA
FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering
ykang/gratis
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
tosiron/jazznet
jazznet dataset of piano patterns for music audio machine learning research
br0kej/bin2ml
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
manumerous/wb_humanoid_mpc
Whole-Body Nonlinear MPC for Realtime Humanoid Loco-Manipulation Planning and Control
dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python