luissen/ESRT

DataLoader worker (pid(s) 9504, 11496, 19564, 12924, 14856, 1256, 20244, 7752) exited unexpectedly

Opened this issue · 8 comments

RuntimeError Traceback (most recent call last)
e:\esrt\train.py in
244 epoch_start = datetime.datetime.now()
245 valid(args.scale)
--> 246 train(epoch)
247 if epoch%10==0:
248 save_checkpoint(epoch)

e:\esrt\train.py in train(epoch)
131 utils.adjust_learning_rate(optimizer, epoch, args.step_size, args.lr, args.gamma)
132 print('epoch =', epoch, 'lr = ', optimizer.param_groups[0]['lr'])
--> 133 for iteration, (lr_tensor, hr_tensor) in enumerate(training_data_loader, 1):
134
135 if args.cuda:

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in next(self)
519 if self._sampler_iter is None:
520 self._reset()
--> 521 data = self._next_data()
522 self._num_yielded += 1
523 if self._dataset_kind == _DatasetKind.Iterable and \

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _next_data(self)
1184
1185 assert not self._shutdown and self._tasks_outstanding > 0
-> 1186 idx, data = self._get_data()
1187 self._tasks_outstanding -= 1
1188 if self._dataset_kind == _DatasetKind.Iterable:

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _get_data(self)
1150 else:
1151 while True:
-> 1152 success, data = self._try_get_data()
1153 if success:
1154 return data

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _try_get_data(self, timeout)
1001 if len(failed_workers) > 0:
1002 pids_str = ', '.join(str(w.pid) for w in failed_workers)
-> 1003 raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e
1004 if isinstance(e, queue.Empty):
1005 return (False, None)

RuntimeError: DataLoader worker (pid(s) 9504, 11496, 19564, 12924, 14856, 1256, 20244, 7752) exited unexpectedly

kidly help to resolve this error. I am also not able to use GPU with the code

Did you solve this problem?I'm also having this issue

RuntimeError Traceback (most recent call last) e:\esrt\train.py in 244 epoch_start = datetime.datetime.now() 245 valid(args.scale) --> 246 train(epoch) 247 if epoch%10==0: 248 save_checkpoint(epoch)

e:\esrt\train.py in train(epoch) 131 utils.adjust_learning_rate(optimizer, epoch, args.step_size, args.lr, args.gamma) 132 print('epoch =', epoch, 'lr = ', optimizer.param_groups[0]['lr']) --> 133 for iteration, (lr_tensor, hr_tensor) in enumerate(training_data_loader, 1): 134 135 if args.cuda:

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in next(self) 519 if self._sampler_iter is None: 520 self._reset() --> 521 data = self._next_data() 522 self._num_yielded += 1 523 if self._dataset_kind == _DatasetKind.Iterable and \

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _next_data(self) 1184 1185 assert not self._shutdown and self._tasks_outstanding > 0 -> 1186 idx, data = self._get_data() 1187 self._tasks_outstanding -= 1 1188 if self._dataset_kind == _DatasetKind.Iterable:

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _get_data(self) 1150 else: 1151 while True: -> 1152 success, data = self._try_get_data() 1153 if success: 1154 return data

C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _try_get_data(self, timeout) 1001 if len(failed_workers) > 0: 1002 pids_str = ', '.join(str(w.pid) for w in failed_workers) -> 1003 raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e 1004 if isinstance(e, queue.Empty): 1005 return (False, None)

RuntimeError: DataLoader worker (pid(s) 9504, 11496, 19564, 12924, 14856, 1256, 20244, 7752) exited unexpectedly

kidly help to resolve this error. I am also not able to use GPU with the code

u can try to set the num_works zero,may be useful

set the num_works zero but still facing the same issue

RuntimeError Traceback (most recent call last) e:\esrt\train.py in 244 epoch_start = datetime.datetime.now() 245 valid(args.scale) --> 246 train(epoch) 247 if epoch%10==0: 248 save_checkpoint(epoch)
e:\esrt\train.py in train(epoch) 131 utils.adjust_learning_rate(optimizer, epoch, args.step_size, args.lr, args.gamma) 132 print('epoch =', epoch, 'lr = ', optimizer.param_groups[0]['lr']) --> 133 for iteration, (lr_tensor, hr_tensor) in enumerate(training_data_loader, 1): 134 135 if args.cuda:
C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in next(self) 519 if self._sampler_iter is None: 520 self._reset() --> 521 data = self._next_data() 522 self._num_yielded += 1 523 if self._dataset_kind == _DatasetKind.Iterable and
C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _next_data(self) 1184 1185 assert not self._shutdown and self._tasks_outstanding > 0 -> 1186 idx, data = self._get_data() 1187 self._tasks_outstanding -= 1 1188 if self._dataset_kind == _DatasetKind.Iterable:
C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _get_data(self) 1150 else: 1151 while True: -> 1152 success, data = self._try_get_data() 1153 if success: 1154 return data
C:\ProgramData\Anaconda3\envs\ESRT\lib\site-packages\torch\utils\data\dataloader.py in _try_get_data(self, timeout) 1001 if len(failed_workers) > 0: 1002 pids_str = ', '.join(str(w.pid) for w in failed_workers) -> 1003 raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e 1004 if isinstance(e, queue.Empty): 1005 return (False, None)
RuntimeError: DataLoader worker (pid(s) 9504, 11496, 19564, 12924, 14856, 1256, 20244, 7752) exited unexpectedly
kidly help to resolve this error. I am also not able to use GPU with the code

u can try to set the num_works zero,may be useful

set the num_works zero but still facing the same issue

set the num_works zero but still facing the same issue

parser.add_argument("--threads", type=int, default=0, #8
help="number of threads for data loading")
set this?

set all code into main.
you know, if name == 'main'