CVE-2022-49781 Information
Description
In the Linux kernel the following vulnerability has been resolved:
perf/x86/amd: Fix crash due to race between amd_pmu_enable_all perf NMI and throttling
amd_pmu_enable_all() does:
if (!test_bit(idx cpuc->active_mask))
continue;
amd_pmu_enable_event(cpuc->events[idx]);
A perf NMI of another event can come between these two steps. Perf NMI handler internally disables and enables all events including the one which nmi-intercepted amd_pmu_enable_all() was in process of enabling. If that unintentionally enabled event has very low sampling period and causes immediate successive NMI causing the event to be throttled cpuc->events[idx] and cpuc->active_mask gets cleared by x86_pmu_stop(). This will result in amd_pmu_enable_event() getting called with event=NULL when amd_pmu_enable_all() resumes after handling the NMIs. This causes a kernel crash:
BUG: kernel NULL pointer dereference address: 0000000000000198
PF: supervisor read access in kernel mode
PF: error_code(0x0000) - not-present page
[…]
Call Trace:
amd_pmu_disable_all()/amd_pmu_enable_all() calls inside perf NMI handler were recently added as part of BRS enablement but I’m not sure whether we really need them. We can just disable BRS in the beginning and enable it back while returning from NMI. This will solve the issue by not enabling those events whose active_masks are set but are not yet enabled in hw pmu.
Reference
https://git.kernel.org/stable/c/baa014b9543c8e5e94f5d15b66abfe60750b8284 https://git.kernel.org/stable/c/fd5e454b856ed86b090336e269695d9908609b71
Share on: