dataset-generator
There are 47 repositories under dataset-generator topic.
Dev-Tarek/sketched-webpages-generator
Customizable open-source software to generate randomized sketched web-pages.
hetpandya/youtube_tts_data_generator
A python library to generate speech dataset from Youtube videos
meyerls/PEGASUS
[IROS24] Offical repository for "PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DoF Object Pose Dataset Generation"
alexppppp/synthetic-dataset-object-detection
How to Create Synthetic Dataset for Computer Vision (Object Detection) (Article on Medium)
ATISLabs/SyntheticDatasets.jl
Collection of artificial data generators in julia
msorkhpar/wiki-entity-summarization
This repository hosts a comprehensive suite for graph-based entity summarization dataset generating from user-selected Wikipedia pages. Utilizing a series of interconnected modules, it leverages Wikidata and Wikipedia dumps to construct a dataset, alongside auto-generated ground truths.
tombax7/FLITC-application
A data-driven deep learning based fault diagnosis application for radial, active distribution grids
Jaesung-Jun/Cut-And-Save-Faces
collect pictures
nileshprasad137/keystroke-dynamics-datagen
Generate dataset for keystroke timings for exploratory and research purposes.
iwangjian/pyloader
🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.
fedecalendino/reddit-graph
Graph representation of Reddit
joshuaboud/gen-dataset
Command line tool to quickly generate a lot of files in a lot of directories
M-Farag/rawbuilder
an elegant datasets factory
OmarSamirz/ImageFromTextGenerator
IFTG (ImageFromTextGenerator) is a Python package that simplifies creating robust datasets for OCR models. Generate images from text, apply over 10 built-in noise effects, and customize fonts and layouts. IFTG supports all languages and offers endless noise combinations, including custom noise creation.
RahulSharmaNITT/MathWordProblem
Question Generator for math word problem.
christiangarcia0311/data-exploration-analysis
Data Exploration is the initial step in data analysis, where users explore a large data set in an unstructured way to uncover initial patterns, characteristics and points of interest.
ElsevierSoftwareX/SOFTX-D-20-00055
An open-source software for synthetic web-based user interface and content dataset generation. To cite this Original Software Publication: https://www.sciencedirect.com/science/article/pii/S2352711022000073
JC-ProgJava/Handwritten-Digit-Dataset
A collection of 107,730 28x28 PNG files of digits from 0-9, with a dataset generator.
leomaurodesenv/travel-dataset-generator
A tool to generate synthetic dataset of corporate travels
PatricioGuinle/CoffeMIDI
A MIDI Content Based Recomandation System
RohitMidha23/youtube-video-scraper
Unleash the power of YouTube with this efficient scraper - download videos with just a search query!
rodrigo-barraza/inscriptor
Blip 2 Captioning, Mass Captioning, Question Answering, and other tools.
StarlangSoftware/DataGenerator-Py
Classification dataset generator library for high level Nlp tasks
yas-sim/simple-annotation-toolkit
The most simple ROI annotation toolkit for object detection task
dennis-barrett/dimdates-dot-com
Source code for the Kimball-style date dimension generator dimdates.com.
Huned-materwala/Ethereum-Transaction-Data-Generator-ETDG
Ethereum transaction data generator to generate high quality and efficient dataset for fraud detection
realm-tech/docgen
A document generator used to fully create training and evaluation datasets for OCR applications
ZEKE320/llm-dataset-generator
The LLM Dataset Generator is an open source tool for generating text data compatible with various language models supported by LangChain. You can customize it to meet your specific needs, making it a valuable resource for researchers, developers, and organizations working on NLP applications.
filiptronicek/dataset-creator
Simple Flickr Image Scraper and compression script
mrelmi/persian-speech-recognition
automatic dataset generator from subtitles of movies for speech recognition
rbsathish/file_renamer_with_gui
using this tool u can rename your files from the selected directory. It will be useful for your ML dataset preparations and anyother uses Eg. Frame_01, Image01
red-shock/Mass-Image-Downloader
A browser extension which allows you to download all images on a page as well as aggregate them.
Spr-Aachen/Easy-DataSet-Creator-Tool-For-Image-Classification
一个简易的图像分类数据集制作工具,目前尚在施工中~ | A simple dataset creating tool for image classification, still working on it~
adeiskandarzulkarnaen/AksaraGenerator-Flutter
Dataset Generator for CNN Handwriting Recognition
JoshWarn/Multi-Label-Shapes-Toy-Dataset-Generator
An easy-to-use multi-label image dataset generator.
VendenIX/YoutubeDatasetGenerator
This repository provides a tool to create a dataset of images from a YouTube video by capturing one image every 10 seconds in 480p resolution.