Bellspringsteen/OurCamera

"raise ValueError('First step cannot be zero.')" in the training stage

Closed this issue · 3 comments

yh570 commented

In the training stage with command:
python models-master/research/object_detection/train.py --logtostderr
--pipeline_config_path=./data/faster_rcnn_resnet101_cars.config
--train_dir=./data/models/model/train
shows the error message: "raise ValueError('First step cannot be zero.')".

eval.py and tensorboard --logdir=./data/models/model/ works well.

I confirmed all the paths in generate_tfrecord.py and faster_rcnn_resnet101_cars.config are correct, and test.record and train.record was generated successfully(I don't know the content are correct, if there is a method to check them please let me know). Would you please let me know the possible situation may cause the problem? Thank you.

danp commented

It looks like object detection was recently changed to add this error in tensorflow/models#3656. Maybe try using the commit before that was merged, such as with:

git clone https://github.com/tensorflow/models
cd models
git checkout -b before-object-detection-changes 2913cb24ecb16ba955006b41072adce45c5a0f62

Hello,

Great you created the .records. You make the train and eval empty folders yourself. Then when you call train.py it will fill those folders with checkpoints and frozen points for models that you can use later.

danp commented

Looking at tensorflow/models#3794, maybe this is no longer valid?