Random frequent BSOD's - memory related


  1. Posts : 2
    windows 7 ultimate x64
       #1

    Random frequent BSOD's - memory related


    Hi,
    In the last few days I'm getting lots of BSOD's.
    Sometimes it's after an hour or so, and sometimes it's right when the computer starts.
    Each time the message on the BSOD is different, but they are all seems to be related to memory issues.

    However, I have ruled out physical memory issues:
    No errors found on memtest.
    I have two memory chips, I have got the BSOD with either one of them.

    I have also ruled out heating issues:
    The computer is not hot.
    I let it cool down, put it on right next to my airconditioner, and yet I got a BSOD right away.

    I have tried reinstalling my drivers.

    My intuition says that it's a fault driver that's causing memory overflow, but I can't prove that.

    Dump files are attached.
    I couldn't get the system health report, I keep getting an error: "The wait for the report generation tool to finish has timed out"

    My laptop is Samsung NP305E5A-A01US.

    I'm using Windows 7 Ultimate x64.
    This is not the original OS that came with my laptop.
    This is a retail version.
    Both hardware and OS are about 6 month old.


    I will really appreciate any help.
    Last edited by Ronennn; 24 Apr 2012 at 13:25.
      My Computer


  2. Posts : 2
    windows 7 ultimate x64
    Thread Starter
       #2

    Running stress tests with Prime95 results in "Hardware failure detected",
    which makes me think that maybe it is heating issues :/

    Update:
    No, I still don't think this is related to heating.
    The stress test got the temperature over 70 degrees (with no BSOD), and I just got another BSOD while the temperature was about 40 degrees.

    Any ideas?
      My Computer


  3. Posts : 11,269
    Windows 7 Home Premium 64 Bit
       #3

    If I remember correctly, the hardware failure detected message asks that you read the stress.txt file. That file is only provided in the full version of Prime95, but it is provided on an overclocking site:

    stress.txt (Overclockers Australia Forums - View Single Post - Prime95 stress.txt file ???)

    STRESS TESTING YOUR COMPUTER

    BACKGROUND
    ----------

    Today's computers are not perfect. Even brand new systems from major
    manufacturers can have hidden flaws. If any of several key components such
    as CPU, memory, cooling, etc. are not up to spec, it can lead to incorrect
    calculations and/or unexplained system crashes.

    Overclocking is the practice of increasing the speed of the CPU and/or
    memory to make a machine faster at little cost. Typically, overclocking
    involves pushing a machine past its limits and then backing off just a
    little bit.

    For these reasons, both non-overclockers and overclockers need programs
    that test the stability of their computers. This is done by running
    programs that put a heavy load on the computer. Though not originally
    designed for this purpose, this program is one of a few programs that
    are excellent at stress testing a computer.


    RESOURCES
    ---------

    This program is a good stress test for the CPU, memory, L1 and L2 caches,
    CPU cooling, and case cooling. The torture test runs continuously, comparing
    your computer's results to results that are known to be correct. Any
    mismatch and you've got a problem! Note that the torture test sometimes
    reads from and writes to disk but cannot be considered a stress test for
    hard drives.

    You'll need other programs to stress video cards, PCI bus, disk access,
    networking and other important components. In addition, this is only one
    of several good programs that are freely available. Some people report
    finding problems only when running two or more stress test programs
    concurrently. You may need to raise prime95's priority when running two
    stress test programs so that each gets about 50% of the CPU time.

    Forums are a great place to learn about available stability test programs
    and to get advice on what to do when a problem is found.

    The currently popular stability test programs are (sorry, I don't have
    web addresses for these):
    Prime95 (this program's torture test)
    3DMark2001
    CPU Stability test
    Sisoft sandra
    Quake and other games
    Folding@Home
    Seti@home
    Genome@home

    Several useful websites for help (look for overclocking community or forum):
    Overclockers: The Performance Computing Community for Overclocking Hardware and How to Overclock Information
    Ars Technica
    HARDOCP - HardOCP Computer Hardware Reviews and News
    AnandTech
    Tom's Hardware: Hardware News, Tests and Reviews
    PC Overclocking, Modding and Building | Sharky Extreme
    Also try the alt.comp.hardware.overclocking Usenet newsgroup.

    Utility programs you may find useful (I'm sure there are others - look around):
    Motherboard monitor from livewiredev.com - dowload music Resources and Information.
    Memtest86 from Memtest86.com - Memory Diagnostic
    Cpuburn by redelm: http://pages.sbcglobal.net/redelm/
    TaskInfo2002 from Iarsn - High Quality System Software


    WHAT TO DO IF A PROBLEM IS FOUND?
    ---------------------------------

    The exact cause of a hardware problem can be very hard to find.

    If you are not overclocking, the most likely cause is an overheating CPU
    or memory DIMMs that are not quite up to spec. Another possibility is
    you might need a better power supply. Try running MotherBoard monitor
    and browse the forums above to see if your CPU is running too hot.
    If so, make sure the heat sink is properly attached, fans are operational,
    and air flow inside the case is good. For isolating memory problems, try
    swapping memory DIMMs with a co-worker's or friend's machine. If the errors
    go away, then you can be fairly confidant that memory was the cause of
    the trouble. A power supply problem can often be identified by a significant
    drop in the voltages when prime95 starts running. Once again the overclocker
    forums are a good resource for what voltages are acceptable.

    If you are overclocking then try increasing the core voltage, reduce the
    CPU speed, reduce the front side bus speed, or change the memory timings
    (CAS latency). Also try asking for help in one of the forums above - they
    may have other ideas to try.


    CAN I IGNORE THE PROBLEM?
    -------------------------

    Ignoring the problem is a matter of personal preference. There are
    two schools of thought on this subject.

    Most programs you run will not stress your computer enough to cause a
    wrong result or system crash. If you ignore the problem, then video games
    may stress your machine resulting in a system crash. Also, stay away from
    distributed computing projects where an incorrect calculation might cause
    you to return wrong results. Bad data will not help these projects!
    In conclusion, if you are comfortable with a small risk of an occasional
    system crash then feel free to live a little dangerously! Keep in mind
    that the faster prime95 finds a hardware error the more likely it is that
    other programs will experience problems.

    The second school of thought is, "Why run a stress test if you are going
    to ignore the results?" These people want a guaranteed 100% rock solid
    machine. Passing these stability tests gives them the ability to run
    CPU intensive programs with confidence.


    FREQUENTLY ASKED QUESTIONS
    --------------------------

    Q) My machine is not overclocked. If I'm getting an error, then there must
    be a bug in the program, right?

    A) The torture test is comparing your machines results against
    KNOWN CORRECT RESULTS. If your machine cannot generate correct
    results, you have a hardware problem. HOWEVER, if you are failing
    the torture test in the SAME SPOT with the SAME ERROR MESSAGE
    every time, then ask for help at mersenneforum.org - it is
    possible that a recent change to the torture test code may have
    introduced a software bug.

    Q) How long should I run the torture test?

    A) I recommend running it for somewhere between 6 and 24 hours.
    The program has been known to fail only after several hours and in
    some cases several weeks of operation. In most cases though, it will
    fail within a few minutes on a flaky machine.

    Q) Prime95 reports errors during the torture test, but other stability
    tests don't. Do I have a problem?

    A) Yes, you've reached the point where your machine has been
    pushed just beyond its limits. Follow the recommendations above
    to make your machine 100% stable or decide to live with a
    machine that could have problems in rare circumstances.

    Q) A forum member said "Don't bother with prime95, it always pukes on me,
    and my system is stable!. What do you make of that?"

    or

    "We had a server at work that ran for 2 MONTHS straight, without a reboot
    I installed Prime95 on it and ran it - a couple minutes later I get an error.
    You are going to tell me that the server wasn't stable?"

    A) These users obviously do not subscribe to the 100% rock solid
    school of thought. THEIR MACHINES DO HAVE HARDWARE PROBLEMS.
    But since they are not presently running any programs that reveal
    the hardware problem, the machines are quite stable. As long as
    these machines never run a program that uncovers the hardware problem,
    then the machines will continue to be stable.
      My Computer


 

  Related Discussions
Our Sites
Site Links
About Us
Windows 7 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 7" and related materials are trademarks of Microsoft Corp.

© Designer Media Ltd
All times are GMT -5. The time now is 09:51.
Find Us