Fake entire data schemas. Easily.
Original repository: https://github.com/mariushelf/factory_boss
Factory Boss can help you whenever you need to mock data schemas, for example when you cannot work or develop with the original data for privacy, GDPR related issues and security concerns.
Factory Boss can mock entire data schemas, including relationships and dependencies between features and objects.
Schema specifications are read from a simple yaml format. The generated output is a list of dictionaries for each mocked entity, which can easily be written to a database, converted to pandas DataFrames etc.
This package is available on PyPI and can be installed with pip install factory_boss
.
Mocking a data schema consists of two steps:
- Specify the schema.
- Generate data.
Schemas are specified in yaml, including relationships between entities and mock rules.
See simple_schema.yaml for an example.
from pprint import pprint
import yaml
from factory_boss.generator import Generator
from factory_boss.spec_parser.parser import SpecParser
with open("examples/simple_schema.yaml", "r") as f:
schema = yaml.safe_load(f)
parser = SpecParser()
parsed_spec = parser.parse(schema)
generator = Generator(parsed_spec)
instances = generator.generate()
print("INSTANCES")
print("=========")
pprint(instances)
See factory_boss/scripts/generate.py for the full script.
Many much, above all documentation.
Here are biggest "milestones":
- documentation
- finalize the schema specification
- support dynamic fields (generate fields via a Python function with other fields as input)
In the issues section there are some more tickets.
I'm more than happy to accept help in the form of bug reports, feature requests, or pull requests, even though there is no formal "contribution guideline" yet.
If you want to help just reach out to me :)
This work wouldn't be possible without the amazing faker package.
Factory Boss is also heavily inspired by factory_boy, but has a different focus. While factory_boy excels at generating single objects and test fixtures, Factory Boss aims at faking entire data schemas. In that sense it offers both a subset and a superset of factory_boy's features.
MIT -- see LICENSE
Author: Marius Helf (helfsmarius@gmail.com)