stefancrain/folding-at-home

ERROR:exception: clWaitForEvents

hbontempo-br opened this issue · 3 comments

I need some help.
I'm having erros when folding with GPU.

19:32:53:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP108 [GeForce GT 1030] from 40.114.52.201

19:32:53:WU01:FS01:Connecting to 40.114.52.201:8080

19:32:53:ERROR:WU00:FS00:Exception: Server did not assign work unit

19:33:25:WU01:FS01:Downloading 29.70MiB

19:33:35:WU01:FS01:Download 2.95%

19:33:41:WU01:FS01:Download 22.73%

19:33:47:WU01:FS01:Download 39.14%

19:33:53:WU01:FS01:Download 54.29%

19:33:59:WU01:FS01:Download 82.49%

19:34:03:WU01:FS01:Download complete

19:34:03:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11777 run:0 clone:3636 gen:15 core:0x22 unit:0x00000020287234c95e73c43565f3e52a

19:34:03:WU01:FS01:Starting

19:34:03:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /app/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 01 -suffix 01 -version 705 -lifeline 6 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0

19:34:03:WU01:FS01:Started FahCore on PID 16

19:34:03:WU01:FS01:Core PID:20

19:34:03:WU01:FS01:FahCore 0x22 started

19:34:04:WU01:FS01:0x22:*********************** Log Started 2020-04-04T19:34:03Z ***********************

19:34:04:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************

19:34:04:WU01:FS01:0x22:       Type: 0x22

19:34:04:WU01:FS01:0x22:       Core: Core22

19:34:04:WU01:FS01:0x22:    Website: https://foldingathome.org/

19:34:04:WU01:FS01:0x22:  Copyright: (c) 2009-2018 foldingathome.org

19:34:04:WU01:FS01:0x22:     Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora

19:34:04:WU01:FS01:0x22:             <rafal.wiewiora@choderalab.org>

19:34:04:WU01:FS01:0x22:       Args: -dir 01 -suffix 01 -version 705 -lifeline 16 -checkpoint 15

19:34:04:WU01:FS01:0x22:             -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device

19:34:04:WU01:FS01:0x22:             0 -gpu 0

19:34:04:WU01:FS01:0x22:     Config: <none>

19:34:04:WU01:FS01:0x22:************************************ Build *************************************

19:34:04:WU01:FS01:0x22:    Version: 0.0.2

19:34:04:WU01:FS01:0x22:       Date: Dec 6 2019

19:34:04:WU01:FS01:0x22:       Time: 21:20:17

19:34:04:WU01:FS01:0x22: Repository: Git

19:34:04:WU01:FS01:0x22:   Revision: f87d92b58abdf7e6bf2e173cfbc4dc3e837c7042

19:34:04:WU01:FS01:0x22:     Branch: core22

19:34:04:WU01:FS01:0x22:   Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)

19:34:04:WU01:FS01:0x22:    Options: -std=gnu++98 -O3 -funroll-loops

19:34:04:WU01:FS01:0x22:   Platform: linux2 4.9.87-linuxkit-aufs

19:34:04:WU01:FS01:0x22:       Bits: 64

19:34:04:WU01:FS01:0x22:       Mode: Release

19:34:04:WU01:FS01:0x22:************************************ System ************************************

19:34:04:WU01:FS01:0x22:        CPU: Intel(R) Core(TM)2 Quad CPU Q8300 @ 2.50GHz

19:34:04:WU01:FS01:0x22:     CPU ID: GenuineIntel Family 6 Model 23 Stepping 10

19:34:04:WU01:FS01:0x22:       CPUs: 4

19:34:04:WU01:FS01:0x22:     Memory: 7.79GiB

19:34:04:WU01:FS01:0x22:Free Memory: 5.66GiB

19:34:04:WU01:FS01:0x22:    Threads: POSIX_THREADS

19:34:04:WU01:FS01:0x22: OS Version: 4.15

19:34:04:WU01:FS01:0x22:Has Battery: false

19:34:04:WU01:FS01:0x22: On Battery: false

19:34:04:WU01:FS01:0x22: UTC Offset: 0

19:34:04:WU01:FS01:0x22:        PID: 20

19:34:04:WU01:FS01:0x22:        CWD: /app/work

19:34:04:WU01:FS01:0x22:         OS: Linux 4.15.0-91-generic x86_64

19:34:04:WU01:FS01:0x22:    OS Arch: AMD64

19:34:04:WU01:FS01:0x22:********************************************************************************

19:34:04:WU01:FS01:0x22:Project: 11777 (Run 0, Clone 3636, Gen 15)

19:34:04:WU01:FS01:0x22:Unit: 0x00000020287234c95e73c43565f3e52a

19:34:04:WU01:FS01:0x22:Reading tar file core.xml

19:34:04:WU01:FS01:0x22:Reading tar file integrator.xml

19:34:04:WU01:FS01:0x22:Reading tar file state.xml

19:34:04:WU01:FS01:0x22:Reading tar file system.xml

19:34:04:WU01:FS01:0x22:Digital signatures verified

19:34:04:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core

19:34:04:WU01:FS01:0x22:Version 0.0.2

19:34:24:WU01:FS01:0x22:Completed 0 out of 2000000 steps (0%)

19:34:24:WU01:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900

19:34:26:WU01:FS01:0x22:ERROR:exception: clWaitForEvents

19:34:26:WU01:FS01:0x22:Saving result file ../logfile_01.txt

19:34:26:WU01:FS01:0x22:Saving result file checkpt.crc

19:34:26:WU01:FS01:0x22:Saving result file science.log

19:34:26:WU01:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT

19:34:57:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)

19:34:57:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:11777 run:0 clone:3636 gen:15 core:0x22 unit:0x00000020287234c95e73c43565f3e52a

Thank you for your work.

I'm glad to lend a hand, but there is a much bigger folding at home audience out there on reddit and the F@H forums.

BAD_WORK_UNIT seems to be linked to driver versions. 1 2 3

Can you post the output of this command?

docker run \
 --rm \
 --gpus all \
 --entrypoint="nvidia-smi" \
 stefancrain/folding-at-home:latest

You were rigth, it was a driver issue. Thank you for your help.

Ps: sorry about the late response.

no problem @hbontempo-br - happy folding!