dmlc/ps-lite

throwing an instance of 'dmlc::Error

gccrpm opened this issue · 4 comments

terminate called after throwing an instance of 'dmlc::Error'
what(): [16:41:36] src/van.cc:229: Check failed: obj timeout (5 sec) to wait App 0 customer 0 ready at 0

terminate called after throwing an instance of 'dmlc::Error'
what(): [16:41:36] src/van.cc:229: Check failed: obj timeout (5 sec) to wait App 0 customer 0 ready at 0

Stack trace returned 7 entries:
[bt] (0) ./ps_test() [0x409add]
[bt] (1) ./ps_test() [0x40a3bb]
[bt] (2) ./ps_test() [0x42668f]
[bt] (3) ./ps_test() [0x4291f3]
[bt] (4) /usr/lib/x86_64-linux-gnu/libstdc++.so.6(+0xb8c80) [0x7fedfb86fc80]
[bt] (5) /lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7fedfbb406ba]
[bt] (6) /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7fedfafd541d]

is timeout too small?

What commit are you using and would you mind sharing the steps to reproduce?

I just download ps-lite and https://github.com/Zminghua/pserver-LR.git , and then follow the step complied . then run the test shell:
~/pserver-LR/test$ ./local.sh 1 1 ./ps_test
启动Worker rank = 0
[11:49:03] ..//ps-lite/include/dmlc/logging.h:277: [11:49:03] src/van.cc:229: Check failed: obj timeout (5 sec) to wait App 0 customer 0 ready at 0

Stack trace returned 7 entries:
[bt] (0) ./ps_test() [0x409add]
[bt] (1) ./ps_test() [0x40a3bb]
[bt] (2) ./ps_test() [0x42668f]
[bt] (3) ./ps_test() [0x4291f3]
[bt] (4) /home/liao/anaconda3/lib/libstdc++.so.6(+0xc8421) [0x7f669b094421]
[bt] (5) /lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7f669a6726ba]
[bt] (6) /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7f669a09f41d]

terminate called after throwing an instance of 'dmlc::Error'
what(): [11:49:03] src/van.cc:229: Check failed: obj timeout (5 sec) to wait App 0 customer 0 ready at 0

Stack trace returned 7 entries:
[bt] (0) ./ps_test() [0x409add]
[bt] (1) ./ps_test() [0x40a3bb]
[bt] (2) ./ps_test() [0x42668f]
[bt] (3) ./ps_test() [0x4291f3]
[bt] (4) /home/liao/anaconda3/lib/libstdc++.so.6(+0xc8421) [0x7f669b094421]
[bt] (5) /lib/x86_64-linux-gnu/libpthread.so.0(+0x76ba) [0x7f669a6726ba]
[bt] (6) /lib/x86_64-linux-gnu/libc.so.6(clone+0x6d) [0x7f669a09f41d]

I meet the same error, have you solve this problem, if yes I am appreciate for your sharing ~

you should change some code in test.cpp like this:
// start system Start(); StartServer(); // run worker nodes RunWorker();