ziya_llama 切分tp后模型卡死,各位大佬有解决方法吗
buringslyar opened this issue · 0 comments
buringslyar commented
/usr/local/lib/python3.9/site-packages/pytorch_lightning/trainer/connectors/data_connector.py:240: PossibleUserWarning: The dataloader, val_dataloader 0, does not have many workers which may be a bottleneck. Consider increasing the value of the num_workers
argument(try 128 which is the number of cpus on this machine) in the
DataLoader` init to improve performance.
rank_zero_warn(
Sanity Checking DataLoader 0: 0%| | 0/2 [00:00<?, ?it/s]