The objective of this app is to
- Show how to load and use FeatureStore via the Vertex SDK
- Provide a performance test with the
run_experiment.py
application
This notebook downloads the movielens and imbdb datasets then creates a simple testing dataset of over 150k movies.
This notebook creates a movies featurestore and loads movies and user review entities into a featurestore. The notebook ends with making streaming calls to the featurestore.
This (along with Factorial-Design.ipynb
) tests featurestore's load and request latency. doepy
is used to construct the tests. Recommended to load via terminal background process or nohup
call.
Usage: for long running do a background process to run:
nohup python run_experiment.py --N_ITERATIONS <> --N_WORKERS <>
--N_MEASURES <> --N_REPEAT_MEASURES --N_PREDICTIONS <>
--PROJECT_ID <> --BUCKET <>
--BQ_DATASET movielens --REGION <> --FEATURESTORE_ID <>
Data will be generated for each run in data/
and will accumulate measurments
Parameters:
N_FEATURES
: number of features, incremented in chunks of 7 varaiblesN_PREDICTIONS
: Number of rows to load and requestN_WORKERS
: Number of workers configured on the featurestoreN_REPEAT_MEASURES
: Number of times to execute latency measurments per configFEATURESTORE_ID
: The name of the featurestore that will be recreated for the test
Outputs:
FS Load From BQ
: Number of seconds it takes to load the FS with the specified features and rowstotal_seconds
: Response in seconds for specified request
Review the analysis tab for: