Data loss: not an sstable (bad magic number)

Question

Data loss: not an sstable (bad magic number)

wpq3142 opened this issue 7 years ago · 14 comments

tensorflow.python.framework.errors_impl.DataLossError: Unable to open table file /home/wpq/data/model.ckpt.data-00000-of-00001:
Data loss: not an sstable (bad magic number): perhaps your file is in a different file format and you
need to use a different restore operator?

Answer 1 · 2017-11-01T06:50:25.000Z

I got same error when I running object detection API.

Answer 2 · 2017-11-01T07:03:21.000Z

It seems that the downloaded pre-training file does not match the model

Answer 3 · 2017-11-01T11:26:54.000Z

File format is inconsistent,Look at posts：
http://votec.top/2016/12/24/tensorflow-r12-tf-train-Saver/

slim.get_or_create_global_step() change to: tf.train.get_or_create_global_step()

Answer 4 · 2017-11-03T07:17:55.000Z

I apologize but I am having a hard time understanding what the problem is, where the problem is, and what version it affects. Please resubmit and pay attention to the issue template (https://github.com/tensorflow/tensorflow/issues/new) . Please provide all the information it asks. Thank you.

Particularly, telling us exactly what you did will help. There are many models in this repository, so I'm not quite sure which one you're talking about. Though from @scotthuang1989 it appears its the object detection API?

Could you elaborate on the sequence of steps that reproduces the problem?

Answer 5 · 2017-11-03T15:32:22.000Z

duplicate of #2676

Answer 6 · 2018-08-23T06:33:35.000Z

Exporting a trained model for inference
After your model has been trained, you should export it to a Tensorflow graph proto. A checkpoint will typically consist of three files:

model.ckpt-${CHECKPOINT_NUMBER}.data-00000-of-00001
model.ckpt-${CHECKPOINT_NUMBER}.index
model.ckpt-${CHECKPOINT_NUMBER}.meta

python object_detection/export_inference_graph.py \
	--input_type image_tensor \
	--pipeline_config_path /app/tf_object_detection_api/config/faster_rcnn_inception_v2_pets.config \
	--trained_checkpoint_prefix /app/tf_object_detection_api/models/model.ckpt-306 \
	--output_directory /app/tf_object_detection_api/models/faster_rcnn_inception_v2_pets

Answer 7 · 2018-12-06T09:11:39.000Z

@Aleafboat ,thanks for your advice, i work it.
the original code of mine is

python export_inference_graph.py --input_type image_tensor --pipeline_config_path ./dataset_tools/voc2007/models/ssd_mobilenet_v1/ssd_mobilenet_v1_coco.config --trained_checkpoint_prefix ./dataset_tools/voc2007/output/model.ckpt-5060* --output_directory ./dataset_tools/voc2007/pb_model/
i just remove the '*', at the end of 5060.

Answer 8 · 2019-03-21T08:29:33.000Z

I have fixed the issue by this:
replace model.ckpt the model.ckpt-200000
where 20000 is your checkpoint number

Answer 9 · 2019-04-08T05:54:03.000Z

Solved on #7696

Answer 10 · 2019-04-15T18:35:59.000Z

Hi, (Apr/2019), the same error, but the cause is an aborted train process. I was re-train the model, and then try again with:

$cd bert-master/bert_output
$
$python ./run_classifier.py --task_name=cola --do_train=true --do_eval=true --data_dir=./data --vocab_file=$BERT_BASE_DIR/vocab.txt --bert_config_file=$BERT_BASE_DIR/bert_config.json --init_checkpoint=$BERT_BASE_DIR/bert_model.ckpt --max_seq_length=128 --train_batch_size=32 --learning_rate=2e-5 --num_train_epochs=1.0 --output_dir=./bert_output/ --do_lower_case=False

Answer 11 · 2019-05-12T16:28:05.000Z

Hello all, just follow the below video and export your own model with in a 10 seconds

https://youtu.be/w0Ebsbz7HYA

Answer 12 · 2019-07-30T21:05:36.000Z

Got the same error while importing the model. Fixed it by providing the right path.

V2 Saver from TF generates 3 files:
model.ckpt-${CHECKPOINT_NUMBER}.data-00000-of-00001
model.ckpt-${CHECKPOINT_NUMBER}.index
model.ckpt-${CHECKPOINT_NUMBER}.meta

In restore give the path till CHECKPOINT_NUMBER, like below:

saver.restore(sess, "model.ckpt-${CHECKPOINT_NUMBER}")

Hope that make sense.

Answer 13 · 2022-05-11T18:24:57.000Z

I was able to resolve this issue by saving a .h5 file directly

Link: https://www.tensorflow.org/guide/keras/save_and_serialize#:~:text=The%20recommended%20format%20is%20SavedModel,'h5'%20to%20save()%20.

Answer 14 · 2024-01-27T23:54:01.000Z

if you are trying to do this in 2024 and it doesn't work try this:
root-path-to-model-checkpoint-storage-here/ckpt-n
put the checkpoint number you want instead of n, works for me!