关于WHEA_UNCORRECTABLE_ERROR(124)导致随机蓝屏/黑屏的硬件故障排查请求
大家好,我的PC会随机出现蓝屏或黑屏,尤其是玩游戏的时候——有些游戏会更快触发BSOD。
最近我升级了CPU、GPU和内存,目前已经做了这些排查:
- 跑完了内存测试,没发现任何问题
- GPU和其他所有系统驱动都更新到了最新版本
最近一次崩溃的dump信息如下:
WHEA_UNCORRECTABLE_ERROR (124)
A fatal hardware error has occurred. Parameter 1 identifies the type of error source that reported the error. Parameter 2 holds the address of the WHEA_ERROR_RECORD structure that describes the error conditon.
Arguments:
Arg1: 0000000000000000, Machine Check Exception
Arg2: ffff988186d70028, Address of the WHEA_ERROR_RECORD structure.
Arg3: 00000000be000000, High order 32-bits of the MCi_STATUS value.
Arg4: 0000000000800400, Low order 32-bits of the MCi_STATUS value.Debugging Details:
KEY_VALUES_STRING: 1
Key : Analysis.CPU.mSec Value: 3109 Key : Analysis.DebugAnalysisManager Value: Create Key : Analysis.Elapsed.mSec Value: 3124 Key : Analysis.IO.Other.Mb Value: 0 Key : Analysis.IO.Read.Mb Value: 0 Key : Analysis.IO.Write.Mb Value: 0 Key : Analysis.Init.CPU.mSec Value: 453 Key : Analysis.Init.Elapsed.mSec Value: 14409 Key : Analysis.Memory.CommitPeak.Mb Value: 99 Key : Bugcheck.Code.LegacyAPI Value: 0x124 Key : Failure.Bucket Value: 0x124_0_AuthenticAMD_PROCESSOR_MAE Key : Failure.Hash Value: {a9e19e3a-0a6a-028e-1a0c-57a9a8b8f3a0} Key : WER.OS.Branch Value: vb_release Key : WER.OS.Timestamp Value: 2019-12-06T14:06:00Z Key : WER.OS.Version Value: 10.0.19041.1BUGCHECK_CODE: 124
BUGCHECK_P1: 0
BUGCHECK_P2: ffff988186d70028
BUGCHECK_P3: be000000
BUGCHECK_P4: 800400
FILE_IN_CAB: MEMORY.DMP
WHEA_ERROR_RECORD: ffff988186d70028
PROCESS_NAME: System
STACK_TEXT:
fffff8063a7f7a98 fffff806388a4a99 : 0000000000000124 0000000000000000 ffff988186d70028 00000000be000000 : nt!KeBugCheckEx
fffff8063a7f7aa0 fffff806388a4e00 : ffff988186d70028 ffff988186d70028 0000000000000000 0000000000000000 : hal!HalBugCheckSystem+0xe9
fffff8063a7f7ae0 fffff806389a974e : 0000000000000000 fffff8063a7f7b80 ffff988186d70028 0000000000000000 : hal!HalpCollectDebugInfo+0x50
fffff8063a7f7b20 fffff806388a5705 : fffff8063a7f7c90 fffff8063a7f7c90 ffff988186d70028 0000000000000000 : nt!WheaReportHwError+0x46e
fffff8063a7f7b90 fffff806388a5a3a : 0000000000000001 0000000000000000 0000000000000000 0000000000000000 : hal!HalpMcaReportError+0xb5
fffff8063a7f7ce0 fffff806388a591c : ffff98817fa52000 0000000000000000 0000000000000000 0000000000000000 : hal!HalpMceHandlerCore+0xea
fffff8063a7f7d30 fffff806388a5b8e : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : hal!HalpMceHandler+0xcc
fffff8063a7f7d70 fffff806388a5df8 : ffff98817fa52000 fffff8063a7f7ff0 0000000000000000 0000000000000000 : hal!HalpMceHandlerWithRendezvous+0xce
fffff8063a7f7da0 fffff806389b1e39 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : hal!HalHandleMcheck+0x48
fffff8063a7f7dd0 fffff80638804973 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : hal!HalpHandleMcheck+0x19
fffff8063a7f7e00 fffff80638908a09 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : nt!KxMcheckAbort+0x73
fffff8063a7f7f40 00007ffd7d7f2a7a : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : nt!KiMcheckAbort+0x129
00000000001df7e8 0000000000000000 : 0000000000000000 0000000000000000 0000000000000000 0000000000000000 : 0x00007ffd`7d7f2a7aMODULE_NAME: AuthenticAMD
IMAGE_NAME: AuthenticAMD.sys
STACK_COMMAND: .cxr; .ecxr ; kb
BUCKET_ID_FUNC_OFFSET: 129
FAILURE_BUCKET_ID: 0x124_0_AuthenticAMD_PROCESSOR_MAE
OS_VERSION: 10.0.19041.1
BUILD_VERSION_STRING: 19041.1.amd64fre.vb_release.191206-1406
OSPLATFORM_TYPE: x64
OSNAME: Windows 10
FAILURE_ID_HASH: {a9e19e3a-0a6a-028e-1a0c-57a9a8b8f3a0}
Followup: MachineOwner
我明白这是硬件问题,可能出在CPU或者主板上。考虑到我的主板是较老的型号,大概率是主板的问题。我的主板型号是GA-A320M-S2H (rev.1x),已经更新到了能支持的最新BIOS版本。
我的系统信息如下:
OS Name: Microsoft Windows 10 Pro
Version: 10.0.19041 Build 19041
System Manufacturer: Gigabyte Technology Co., Ltd.
System Model: To be filled by O.E.M.
System Type: x64-based PC
Processor: AMD Ryzen 5 5600X 6-Core Processor, 3700 Mhz, 6 Core(s), 12 Logical Processor(s)
BIOS Version/Date: American Megatrends Inc. F63, 2021/10/21
SMBIOS Version: 3.2
Installed Physical Memory (RAM): 16.0 GB
Total Physical Memory: 15.9 GB
Available Physical Memory: 10.2 GB
Total Virtual Memory: 18.3 GB
Available Virtual Memory: 11.9 GB
Page File Space: 2.40 GB
Page File: C:\pagefile.sys
GPU: NVIDIA GeForce RTX 3060 Ti
希望各位能帮我进一步排查问题,看看有没有其他可能的原因或者解决办法,谢谢!
备注:内容来源于stack exchange,提问作者david plant




