geohot/7900xtx

[PATCH] drm/amdgpu : Add mes_log_enable to control mes log feature

Opened this issue · 1 comments

https://lore.kernel.org/all/9871b5a1-8838-42a0-99c5-18bbb7f90c86@amd.com/T
"Due to a CP interrupt bug, bad packet garbage exception codes are raised.
Do a range check so that the debugger and runtime do not receive garbage
codes.
Update the user api to guard exception code type checking as well."

https://lore.kernel.org/all/e23a0e94-f31e-4711-ad3b-93caca180286@amd.com/T/
"Here I think we should cancel work items that are still in the work
queue, not flush workqueue node->ih_wq. In this case the kfd functions
have been terminated, there is no way to handle the left work items.
That would make work queue flush never finish. I think it is the reason
there are orphan kernel tasks.

After cancel left work items we can call destroy_workqueue."