]> Gentwo Git Trees - linux/.git/commit
drm/amdgpu: Report individual reset error
authorLijo Lazar <lijo.lazar@amd.com>
Mon, 6 Oct 2025 05:09:03 +0000 (10:39 +0530)
committerAlex Deucher <alexander.deucher@amd.com>
Tue, 7 Oct 2025 18:09:19 +0000 (14:09 -0400)
commit2e97663760e5fb7ee14f399c68e57b894f01e505
treef01046bcb0d239e8e434c1d3b1fb43491175475e
parenta107aeb6a2150dd552673caefc771e2222d584de
drm/amdgpu: Report individual reset error

If reinitialization of one of the GPUs fails after reset, it logs
failure on all subsequent GPUs eventhough they have resumed
successfully.

A sample log where only device at 0000:95:00.0 had a failure -

amdgpu 0000:15:00.0: amdgpu: GPU reset(19) succeeded!
amdgpu 0000:65:00.0: amdgpu: GPU reset(19) succeeded!
amdgpu 0000:75:00.0: amdgpu: GPU reset(19) succeeded!
amdgpu 0000:85:00.0: amdgpu: GPU reset(19) succeeded!
amdgpu 0000:95:00.0: amdgpu: GPU reset(19) failed
amdgpu 0000:e5:00.0: amdgpu: GPU reset(19) failed
amdgpu 0000:f5:00.0: amdgpu: GPU reset(19) failed
amdgpu 0000:05:00.0: amdgpu: GPU reset(19) failed
amdgpu 0000:15:00.0: amdgpu: GPU reset end with ret = -5

To avoid confusion, report the error for each device
separately and return the first error as the overall result.

Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
Reviewed-by: Asad Kamal <asad.kamal@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c