ERROR:exception: clWaitForEvents
hbontempo-br opened this issue · 3 comments
hbontempo-br commented
I need some help.
I'm having erros when folding with GPU.
19:32:53:WU01:FS01:Requesting new work unit for slot 01: READY gpu:0:GP108 [GeForce GT 1030] from 40.114.52.201
19:32:53:WU01:FS01:Connecting to 40.114.52.201:8080
19:32:53:ERROR:WU00:FS00:Exception: Server did not assign work unit
19:33:25:WU01:FS01:Downloading 29.70MiB
19:33:35:WU01:FS01:Download 2.95%
19:33:41:WU01:FS01:Download 22.73%
19:33:47:WU01:FS01:Download 39.14%
19:33:53:WU01:FS01:Download 54.29%
19:33:59:WU01:FS01:Download 82.49%
19:34:03:WU01:FS01:Download complete
19:34:03:WU01:FS01:Received Unit: id:01 state:DOWNLOAD error:NO_ERROR project:11777 run:0 clone:3636 gen:15 core:0x22 unit:0x00000020287234c95e73c43565f3e52a
19:34:03:WU01:FS01:Starting
19:34:03:WU01:FS01:Running FahCore: /usr/bin/FAHCoreWrapper /app/cores/cores.foldingathome.org/v7/lin/64bit/Core_22.fah/FahCore_22 -dir 01 -suffix 01 -version 705 -lifeline 6 -checkpoint 15 -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device 0 -gpu 0
19:34:03:WU01:FS01:Started FahCore on PID 16
19:34:03:WU01:FS01:Core PID:20
19:34:03:WU01:FS01:FahCore 0x22 started
19:34:04:WU01:FS01:0x22:*********************** Log Started 2020-04-04T19:34:03Z ***********************
19:34:04:WU01:FS01:0x22:*************************** Core22 Folding@home Core ***************************
19:34:04:WU01:FS01:0x22: Type: 0x22
19:34:04:WU01:FS01:0x22: Core: Core22
19:34:04:WU01:FS01:0x22: Website: https://foldingathome.org/
19:34:04:WU01:FS01:0x22: Copyright: (c) 2009-2018 foldingathome.org
19:34:04:WU01:FS01:0x22: Author: John Chodera <john.chodera@choderalab.org> and Rafal Wiewiora
19:34:04:WU01:FS01:0x22: <rafal.wiewiora@choderalab.org>
19:34:04:WU01:FS01:0x22: Args: -dir 01 -suffix 01 -version 705 -lifeline 16 -checkpoint 15
19:34:04:WU01:FS01:0x22: -gpu-vendor nvidia -opencl-platform 0 -opencl-device 0 -cuda-device
19:34:04:WU01:FS01:0x22: 0 -gpu 0
19:34:04:WU01:FS01:0x22: Config: <none>
19:34:04:WU01:FS01:0x22:************************************ Build *************************************
19:34:04:WU01:FS01:0x22: Version: 0.0.2
19:34:04:WU01:FS01:0x22: Date: Dec 6 2019
19:34:04:WU01:FS01:0x22: Time: 21:20:17
19:34:04:WU01:FS01:0x22: Repository: Git
19:34:04:WU01:FS01:0x22: Revision: f87d92b58abdf7e6bf2e173cfbc4dc3e837c7042
19:34:04:WU01:FS01:0x22: Branch: core22
19:34:04:WU01:FS01:0x22: Compiler: GNU 4.8.2 20140120 (Red Hat 4.8.2-15)
19:34:04:WU01:FS01:0x22: Options: -std=gnu++98 -O3 -funroll-loops
19:34:04:WU01:FS01:0x22: Platform: linux2 4.9.87-linuxkit-aufs
19:34:04:WU01:FS01:0x22: Bits: 64
19:34:04:WU01:FS01:0x22: Mode: Release
19:34:04:WU01:FS01:0x22:************************************ System ************************************
19:34:04:WU01:FS01:0x22: CPU: Intel(R) Core(TM)2 Quad CPU Q8300 @ 2.50GHz
19:34:04:WU01:FS01:0x22: CPU ID: GenuineIntel Family 6 Model 23 Stepping 10
19:34:04:WU01:FS01:0x22: CPUs: 4
19:34:04:WU01:FS01:0x22: Memory: 7.79GiB
19:34:04:WU01:FS01:0x22:Free Memory: 5.66GiB
19:34:04:WU01:FS01:0x22: Threads: POSIX_THREADS
19:34:04:WU01:FS01:0x22: OS Version: 4.15
19:34:04:WU01:FS01:0x22:Has Battery: false
19:34:04:WU01:FS01:0x22: On Battery: false
19:34:04:WU01:FS01:0x22: UTC Offset: 0
19:34:04:WU01:FS01:0x22: PID: 20
19:34:04:WU01:FS01:0x22: CWD: /app/work
19:34:04:WU01:FS01:0x22: OS: Linux 4.15.0-91-generic x86_64
19:34:04:WU01:FS01:0x22: OS Arch: AMD64
19:34:04:WU01:FS01:0x22:********************************************************************************
19:34:04:WU01:FS01:0x22:Project: 11777 (Run 0, Clone 3636, Gen 15)
19:34:04:WU01:FS01:0x22:Unit: 0x00000020287234c95e73c43565f3e52a
19:34:04:WU01:FS01:0x22:Reading tar file core.xml
19:34:04:WU01:FS01:0x22:Reading tar file integrator.xml
19:34:04:WU01:FS01:0x22:Reading tar file state.xml
19:34:04:WU01:FS01:0x22:Reading tar file system.xml
19:34:04:WU01:FS01:0x22:Digital signatures verified
19:34:04:WU01:FS01:0x22:Folding@home GPU Core22 Folding@home Core
19:34:04:WU01:FS01:0x22:Version 0.0.2
19:34:24:WU01:FS01:0x22:Completed 0 out of 2000000 steps (0%)
19:34:24:WU01:FS01:0x22:Temperature control disabled. Requirements: single Nvidia GPU, tmax must be < 110 and twait >= 900
19:34:26:WU01:FS01:0x22:ERROR:exception: clWaitForEvents
19:34:26:WU01:FS01:0x22:Saving result file ../logfile_01.txt
19:34:26:WU01:FS01:0x22:Saving result file checkpt.crc
19:34:26:WU01:FS01:0x22:Saving result file science.log
19:34:26:WU01:FS01:0x22:Folding@home Core Shutdown: BAD_WORK_UNIT
19:34:57:WARNING:WU01:FS01:FahCore returned: BAD_WORK_UNIT (114 = 0x72)
19:34:57:WU01:FS01:Sending unit results: id:01 state:SEND error:FAULTY project:11777 run:0 clone:3636 gen:15 core:0x22 unit:0x00000020287234c95e73c43565f3e52a
Thank you for your work.
stefancrain commented
I'm glad to lend a hand, but there is a much bigger folding at home audience out there on reddit and the F@H forums.
BAD_WORK_UNIT seems to be linked to driver versions. 1 2 3
Can you post the output of this command?
docker run \
--rm \
--gpus all \
--entrypoint="nvidia-smi" \
stefancrain/folding-at-home:latest
hbontempo-br commented
You were rigth, it was a driver issue. Thank you for your help.
Ps: sorry about the late response.
stefancrain commented
no problem @hbontempo-br - happy folding!