my predict method i taking too much time even i am using GPU with 8 GB ram why is it taking too much time

Question

my predict method i taking too much time even i am using GPU with 8 GB ram why is it taking too much time

DeveloperRachit opened this issue 5 years ago · 27 comments

DeveloperRachit commented 5 years ago

i am using retinanet_resnet152_500_classes_0.4991.h5
that pretrained model

Answer 1 · 2020-04-23T07:34:41.000Z

Can you post exact time?

Answer 2 · 2020-04-23T07:41:22.000Z

yes it's taking 120 sec to predict a images objects

Answer 3 · 2020-04-23T08:46:50.000Z

You need to ensure GPU is used
Long time is possible for first recognition - because of long model initialization. Try to recognize several images and check how much time it requires for each.

Answer 4 · 2020-04-23T08:51:10.000Z

i am sure it's using GPU Memory full after using GPU why is it taking too much time

Answer 5 · 2020-04-23T08:51:48.000Z

for every image it's taking 120 sec i tried many images

Answer 6 · 2020-04-23T08:52:35.000Z

Did you load model before each image? Or use the same?

Answer 7 · 2020-04-23T08:52:42.000Z

if you want i could show you my code also

Answer 8 · 2020-04-23T08:54:33.000Z

this is for loading model
model_path = "/data/sample-apps/deep_dive_demos/open_images_detection/preprocessing/retinanet_resnet152_lt.h5" model = models.load_model(model_path, backbone_name='resnet152')

Answer 9 · 2020-04-23T08:55:53.000Z

it's taking toommuch time to load
then i am using
boxes, scores, labels =model.predict(np.expand_dims(image, axis=0))
it's taking 60sec

so whole time taking for both 120sec

Answer 10 · 2020-04-23T08:56:41.000Z

yes i am loading before each images
actually i am using single single images not for multiple

Answer 11 · 2020-04-23T08:59:22.000Z

i simple made a userinterface where user will upload image and click onn detect when that person click on detect firstly it will load model and then i will predict for every uploading images is doing same

Answer 12 · 2020-04-23T08:59:55.000Z

my api after uploading images load model and then predict objects

Answer 13 · 2020-04-23T09:17:54.000Z

That's strange. It shouldn't take more than a second. Which tensorflow and keras version you use?

Did you try resnet101 and resnet50?

Answer 14 · 2020-04-23T09:18:54.000Z

no i tried resnet152

Answer 15 · 2020-04-23T09:19:32.000Z

tensorflow-gpu==1.14.0
Keras==2.3.1
i am using these versions

Answer 16 · 2020-04-23T09:29:32.000Z

Looks fine. Do you use model for inference?

Try model based on resnet50 and check timing.

Answer 17 · 2020-04-23T09:33:05.000Z

i tried on resnet50 but taking 120 sec for it also

Answer 18 · 2020-04-23T09:35:16.000Z

i am using model for inferecce.

Answer 19 · 2020-04-23T09:36:29.000Z

Sorry don't know what it can be (

Did you try script with inference example?
https://github.com/ZFTurbo/Keras-RetinaNet-for-Open-Images-Challenge-2018/blob/master/retinanet_inference_example.py

Answer 20 · 2020-04-23T09:39:19.000Z

yes i tried

Answer 21 · 2020-04-23T09:40:59.000Z

During this 60 seconds for inference can you check GPU usage?

Answer 22 · 2020-04-23T09:42:34.000Z

Thu Apr 23 15:11:45 2020 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 410.93 Driver Version: 410.93 CUDA Version: 10.0 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | |===============================+======================+======================| | 0 Quadro P4000 Off | 00000000:81:00.0 Off | N/A | | 53% 55C P0 35W / 105W | 7901MiB / 8119MiB | 11% Default | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: GPU Memory | | GPU PID Type Process name Usage | |=============================================================================| | 0 7759 C /usr/bin/python3 5779MiB | | 0 9620 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10424 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10428 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10430 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10436 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10438 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10440 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10441 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10442 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10444 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 10447 C /usr/local/AV/bin/ffmpeg 251MiB | | 0 11329 C /usr/local/AV/bin/ffmpeg 137MiB | | 0 14191 C /usr/local/AV/bin/ffmpeg 204MiB | | 0 25039 C /usr/local/AV/bin/ffmpeg 139MiB | +-----------------------------------------------------------------------------+

Answer 23 · 2020-04-24T05:26:19.000Z

it's taking 40 sec to load model and rest 25 sec to predict.

Answer 24 · 2020-04-24T05:27:22.000Z

is there any way to reduce time of model loading either we can do it only once to loading model?

Answer 25 · 2020-04-29T09:17:38.000Z

I don't think it's possible to reduce load model time. But you can load model once and keep it in memory while processing images.

Answer 26 · 2020-04-29T09:49:00.000Z

i am using my api when i load my model once it give me error when i pass image to predict method
ValueError: Tensor Tensor("filtered_detections/map/TensorArrayStack/TensorArrayGatherV3:0", shape=(?, 500, 4), dtype=float32) is not an element of this graph.

Answer 27 · 2020-04-29T09:49:48.000Z

could you help me how to load model once in memory with tf session ?