[BSOD] Which Hardware part is it?

Page 1 of 2 12 LastLast

  1. Posts : 5
    Windows 7 Ultimate x64
       #1

    [BSOD] Which Hardware part is it?


    Hi

    I have some strange issues with my computer

    I have my computer since 20months, since I build it I have some BSODs while gaming or watching video.

    Yesterday I bought new RAM which is official supported from the mainboard.
    After I replaced my old RAM with it I had MORE Bluescreens (3 times while playing sc2, 1 time after some seconds of prime95)

    The BSOD
    Code:
    BugCheck 124, {0, fffffa800686b028, fa000000, 400405}

    Code:
    0: kd> !analyze -v
    *******************************************************************************
    *                                                                             *
    *                        Bugcheck Analysis                                    *
    *                                                                             *
    *******************************************************************************
    
    WHEA_UNCORRECTABLE_ERROR (124)
    A fatal hardware error has occurred. Parameter 1 identifies the type of error
    source that reported the error. Parameter 2 holds the address of the
    WHEA_ERROR_RECORD structure that describes the error conditon.
    Arguments:
    Arg1: 0000000000000000, Machine Check Exception
    Arg2: fffffa800686b028, Address of the WHEA_ERROR_RECORD structure.
    Arg3: 00000000fa000000, High order 32-bits of the MCi_STATUS value.
    Arg4: 0000000000400405, Low order 32-bits of the MCi_STATUS value.
    
    Debugging Details:
    ------------------
    
    BUGCHECK_STR:  0x124_GenuineIntel
    CUSTOMER_CRASH_COUNT:  1
    DEFAULT_BUCKET_ID:  VISTA_DRIVER_FAULT
    PROCESS_NAME:  SC2.exe
    CURRENT_IRQL:  f
    STACK_TEXT:  
    fffff800`00ba8a98 00000000`00000000 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`00000000 : nt!KeBugCheckEx
    
    STACK_COMMAND:  kb
    FOLLOWUP_NAME:  MachineOwner
    MODULE_NAME: hardware
    IMAGE_NAME:  hardware
    DEBUG_FLR_IMAGE_TIMESTAMP:  0
    FAILURE_BUCKET_ID:  X64_0x124_GenuineIntel_PROCESSOR_MAE
    BUCKET_ID:  X64_0x124_GenuineIntel_PROCESSOR_MAE
    Followup: MachineOwner
    ---------
    Why does it show VISTA_* I run WIN7
    Can someone say which Hardware Part is causing the BSOD?
    The RAM should be fine, same errors occurs with old Corsair and new Kingson RAM
    Temperatures are OK.



    Don't hate me for that, but I using Linux too (I know thats the wrong place here, but I attach it too)
    Probably helpfull for some, Linux gives some readable error message
    The errors occur when I run memtester linux programm)

    Linux Kernel Messages
    Code:
    Nov  6 01:37:58 localhost kernel: [Hardware Error]: No human readable MCE decoding support on this CPU type.
    Nov  6 01:37:58 localhost kernel: [Hardware Error]: Run the message through 'mcelog --ascii' to decode.
    Nov  6 01:37:58 localhost kernel: [Hardware Error]: Machine check events logged
    mcelog
    # cat /var/log/syslog | grep mcelog
    Nov 6 13:32:18 localhost kernel: [Hardware Error]: Run the message through 'mcelog --ascii' to decode.
    Nov 6 13:32:18 localhost mcelog: HARDWARE ERROR. This is *NOT* a software problem!
    Nov 6 13:32:18 localhost mcelog: Please contact your hardware vendor
    Nov 6 13:32:18 localhost mcelog: MCE 0
    Nov 6 13:32:18 localhost mcelog: CPU 0 BANK 6
    Nov 6 13:32:18 localhost mcelog: MISC 4030314 ADDR 13e8b7280
    Nov 6 13:32:18 localhost mcelog: TIME 1289046738 Sat Nov 6 13:32:18 2010
    Nov 6 13:32:18 localhost mcelog: MCG status:
    Nov 6 13:32:18 localhost mcelog: MCi status:
    Nov 6 13:32:18 localhost mcelog: MCi_MISC register valid
    Nov 6 13:32:18 localhost mcelog: MCi_ADDR register valid
    Nov 6 13:32:18 localhost mcelog: Threshold based error status: green
    Nov 6 13:32:18 localhost mcelog: MCA: corrected filtering (some unreported errors in same region)
    Nov 6 13:32:18 localhost mcelog: Generic CACHE Level-2 Data-Write Error
    Nov 6 13:32:18 localhost mcelog: STATUS 8c2000400020114a MCGSTATUS 0
    Nov 6 13:32:18 localhost mcelog: MCGCAP 1c09 APICID 0 SOCKETID 0
    Nov 6 13:32:18 localhost mcelog: CPUID Vendor Intel Family 6 Model 26
    Nov 6 13:39:17 localhost mcelog: failed to prefill DIMM database from DMI data
    After these errors the system does not have to reboot, sometime it runs for some seconds/minutes and freezes then


    Is it the CPU which is buggy?

    P.S.
    Prime95 Blend Test results in the bluescreen after some seconds
    MemTest86+ 4.1 shows no errors (2 passes)
      My Computer


  2. Posts : 5,705
    Win7 x64 + x86
       #2

    The STOP 0x124 error is primarily a hardware error, but it can also be a compatibility issue, a Windows issue, or a corrupted driver issues (not everyone agrees on the last possibility).

    The Linux kernel message below is most appropriate:
    Nov 6 01:37:58 localhost kernel: [Hardware Error]: No human readable MCE decoding support on this CPU type.
    So, we have to try different troubleshooting steps.

    Run MemTest 86+ v4.10 overnight (at least 3 passes, more is better).
    Run a hard drive diagnostic: HD Diagnostic (read the details at the link)

    I wonder about the 550 watt PSU. In general most PSU's aren't up to what they say that they are rated for. And, you have a lot of extra equipment running (2 video cards and 8 hard drives). To test, remove 1 video card, and unplug most of the hard drives - then see if you still get the BSOD's.

    Please provide the info in this post. It may show clues to other problems: https://www.sevenforums.com/crashes-d...tructions.html

    More info on troubleshooting STOP 0x124 here: Stop 0x124 - what it means and what to try
      My Computer


  3. Posts : 5
    Windows 7 Ultimate x64
    Thread Starter
       #3

    I only have 1 Graphic Card
    I removed all not needed parts and runned prime95 with the same result -> bsod
    Connected were 2HDDs (instead of 6), Graphic Card, W-Lan Card and a DVD Burner

    Samsung HUTIL doesn't report any errors on the 2 disks connected

    At the time of the tests all 6 RAM Dimm were installed

    Attached the output of the tool and the perfmon report


    Edit:
    It needs a lot longer to crash with only 1 RAM Dimm (like 3min instead of 15secs of prime15 test)
    Last edited by rReLmy; 06 Nov 2010 at 12:47.
      My Computer


  4. Posts : 5,705
    Win7 x64 + x86
       #4

    Try Prime 95 using each of the tests (Small FFT, Blended, Large FFT) - no need to repeat whatever tests you've already run. Let us know the differences in time to crash.

    Have you checked the motherboard website to see if your RAM is compatible with the motherboard?
    Have you visited the RAM manufacturer's website and found the settings/timings/voltages - and then gone into the BIOS and set those settings/timings/voltages? (In other words, DON'T use the Auto settings for your RAM)?
      My Computer


  5. Posts : 5
    Windows 7 Ultimate x64
    Thread Starter
       #5

    usasma said:
    Try Prime 95 using each of the tests (Small FFT, Blended, Large FFT) - no need to repeat whatever tests you've already run. Let us know the differences in time to crash.

    Have you checked the motherboard website to see if your RAM is compatible with the motherboard?
    Have you visited the RAM manufacturer's website and found the settings/timings/voltages - and then gone into the BIOS and set those settings/timings/voltages? (In other words, DON'T use the Auto settings for your RAM)?
    Small FFT didn't resultet in BSOD
    In-place Large FFT results in a BSOD within 2/3 minutes
    Blend, max 15 seconds until BSOD


    RAM is official supported by the Mainboard
    Timings are set manually (9-9-9-27 / 1 @ 1.5V)
    auto settings would be something like CL7
      My Computer


  6. Posts : 5,705
    Win7 x64 + x86
       #6

    Small FFT tests mostly stuff associated with the CPU (not much RAM tested)
    Large FFT tests heat and power consumption - more RAM is tested than in Small FFT's
    Blend puts the stress on the RAM and tests most other components

    Try this free CPU stress test: |MG| CPU Stability Test 6.0 Download
      My Computer


  7. Posts : 2,963
    Windows 7 Professional SP1 64-bit
       #7

    Memtest86+ should be ran for 5-7 passes. As recommended, do it overnight so that it can run without keeping you from doing what you want to do. I noticed you said that the same error occured with two different brands of RAM. It could be that one or more RAM slots are bad. The best way to test this out is to test using only 1 stick of RAM in multiple slots. If the errors only show up when the stick is in certain slots, then those slots are probably the problem.
      My Computer


  8. Posts : 5
    Windows 7 Ultimate x64
    Thread Starter
       #8

    CPU Stability Test runned for 30min in normal mode before a BSOD
    Second time it runned 1h 30min without a BSOD

    Thermal Mode doesnt result in BSOD (only 30min tested because temps didnt went high ~50deg)


    I have the BSOD too with only 1 stick in 2 separate slots
    I will run MemTest this night and report the result back
      My Computer


  9. Posts : 5,705
    Win7 x64 + x86
       #9

    Try SiSoft Sandra Lite (free) and FurMark (free)
    SiSoft Sandra Lite: SiSoftware Zone
    Furmark:
    FurMark download site: FurMark: VGA Stress Test, Graphics Card and GPU Stability Test, Burn-in Test, OpenGL Benchmark and GPU Temperature | oZone3D.Net
    FurMark Setup:
    - If you have more than one GPU, select Multi-GPU during setup
    - In the Run mode box, select "Stability Test" and "Log GPU Temperature"
    Click "Go" to start the test
    - Run the test until the GPU temperature maxes out - or until you start having problems (whichever comes first).
    - Click "Quit" to exit
    What I'm looking for is something that fails only when testing one or the other. I am concerned that the CPU stress test failed with a BSOD (which may indicate a problem with the CPU).
      My Computer


  10. Posts : 2,963
    Windows 7 Professional SP1 64-bit
       #10

    usasma said:
    I am concerned that the CPU stress test failed with a BSOD (which may indicate a problem with the CPU).
    I'm thinking it is the CPU as well, but the only way I know to know for sure is to elimanate every other possibility.
      My Computer


 
Page 1 of 2 12 LastLast

  Related Discussions
Our Sites
Site Links
About Us
Windows 7 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 7" and related materials are trademarks of Microsoft Corp.

© Designer Media Ltd
All times are GMT -5. The time now is 01:25.
Find Us