MemTest-86 Manual Version 5.0
Copyright 2013 mark® Software
Page 1
Table of Contents 1 Introduction.............................................................................................................................................4 1.1 Memory Reliability.......................................................................................................................4 1.2 MemTest86 Overview...................................................................................................................4 1.3 Compatibility.................................................................................................................................4 2 Setup and Use..........................................................................................................................................6 2.1 Boot-disk Creation using Windows...............................................................................................6 2.2 Boot-disk Creation Using Linux...................................................................................................7 2.3 Building MemTest86 (v4 BIOS) from source...............................................................................7 2.4 Using MemTest86 (UEFI).............................................................................................................9 2.5 Using MemTest86 (v4 BIOS)......................................................................................................17 3 Repairing Memory Faults.....................................................................................................................21 3.1 Anti-Static Handling Procedures.................................................................................................21 3.2 Re-Seating Memory Modules......................................................................................................21 3.3 Replacing Modules......................................................................................................................21 3.4 Error Validity...............................................................................................................................22 4 Over Clocking.......................................................................................................................................23 4.1 Background..................................................................................................................................23 4.2 Operating Margins.......................................................................................................................23 4.3 Using MemTest86 for Over Clocking.........................................................................................23 Appendices...............................................................................................................................................26 Appendix A.Technical Information...................................................................................................26 A.1 Memory Testing Philosophy................................................................................................26 A.2 MemTest86 Test Algorithms................................................................................................26 A.3 Individual Test Descriptions................................................................................................27 A.4 Error Display Information....................................................................................................29 Appendix B.Product .............................................................................................................30 B.1 Known Problems..................................................................................................................30 B.2 Enhancements......................................................................................................................31 Appendix C.Change Log...................................................................................................................32 Appendix D.Acknowledgments........................................................................................................38 D.1 UEFI (v5+)...........................................................................................................................38 D.2 BIOS (v4).............................................................................................................................38
Copyright 2013 mark® Software
Page 2
1 Introduction 1.1 Memory Reliability Properly functioning memory is critical for reliable operation of a PC or laptop. Few computer s fully understand the risks associated with memory errors. Because PCs typically do not have any mechanisms for detecting memory errors, confusing and potentially disastrous consequences can result from these undetected memory problems. Memory errors will often cause erratic behavior with software applications that can mysteriously fail. The most serious risk from memory errors, however, is corruption of data that manages how information is stored on disk. In most cases, this type of corruption will cause one or more files to be lost. There are cases where a memory error can cause the loss of the entire contents of your hard disk. Periodic testing of memory with a rigorous and thorough memory test will greatly reduce the risk of problems and data loss due to memory errors.
1.2 MemTest86 Overview Memory errors are often pattern sensitive and may be very intermittent. Detecting these errors is technically challenging and is an imperfect science. MemTest86 uses advanced algorithms that have been refined for more than 20 years. These testing techniques are highly effective at detecting difficult to find memory errors. In addition, MemTest86 has the capability to test all available memory. Memory testing programs execute from memory and therefore are not able to test the memory that is occupied by the test program itself. When running the BIOS version, MemTest86 is able to move itself to a different portion of memory and then tests the memory that it previously occupied. The UEFI version, due to platform limitations, is unable to remap itself to different portions of memory in order to run tests in the section of memory it was occupying. The UEFI firmware itself also takes up some space compared to a traditional BIOS. So slightly less RAM can be tested compared to the BIOS version.
1.3 Compatibility MemTest86 is designed to work with all processors using the Intel/AMD x86 and X86_64 architecture, running on UEFI or BIOS systems. Most newer systems are able to run the UEFI version of MemTest86, but all systems should be able to boot the traditional BIOS version. MemTest86 is able to test all types of memory. There is no need for MemTest86 to know what type of memory it is testing. MemTest86 attempts to detect and display information about the hardware it is testing but this information is not used during testing. Since MemTest86 is a standalone program it does not require any operating system for execution. It can be used with any PC regardless of what operating system, if any, is installed. MemTest86 is multi-threaded and is able to concurrently use multiple Us to test memory. It may, however, be limited by the implementation in the underlying firmware.
Copyright 2013 mark® Software
Page 3
1.3.1 UEFI (v5+) For UEFI systems, multiprocessor is dependent on the implemention of the multiprocessor services provided by the UEFI firmware. On older UEFI systems, the multiprocessor can be fairly limited, causing issues such as a reduced number of Us available for testing or even program freeze when attempting to run on any U other than the first. It is recommended that MemTest86 is run on only one U first before attempting to run on multiple Us.
1.3.2 BIOS (v4) For older systems that use the traditional BIOS, MemTest86 will function properly with any number of Us but is currently configured to use a maximum of 32 Us for testing. When runnining on 64 bit Us, MemTest86 executes in 32 bit mode using PAE. For 32 bit Us, testing is limited to 64 GB. 64 bit Us running MemTest86 executes in “long” mode which allows for testing of up to 8 TB of memory. Us executing in 32 bit mode can test a maximum of 2 GB of memory at a time. This 2 GB window is then advanced, allowing for all of memory to be tested.
Copyright 2013 mark® Software
Page 4
2 Setup and Use MemTest86 s booting from both the newer UEFI platform and the traditional BIOS. When booting from UEFI, MemTest86 has access to additional services not available in BIOS including: •
Native 64-bit
•
No longer requires the use of the PAE workaround to access more than 4GB of memory. (PAE = Physical Address Extension)
•
Mouse , where ed by the underlying UEFI system. On older systems a keyboard is still required.
•
Improved USB keyboard . The keyboard now works on systems that fail to emulate IO Port 64/60 correctly. So Mac USB keyboards are now ed.
•
Improved multi-threading , where ed by the underlying UEFI system.
•
Reporting of detailed RAM SPD information. Timings, clock speeds, vendor names and much more.
•
to writing to the USB drive that MemTest86 is running from for logging and report generation. In all prior MemTest86 releases, there was no disk .
•
Use of GPT. (GUID Partition Table)
If UEFI is not ed on the system, the older v4 BIOS version is booted. MemTest86 can boot from a CD, USB flash drive or, with Linux systems, by the boot loader (for example, LILO or Grub). To use MemTest86 on a Windows computer, either a CD-ROM drive or USB flash drive is required. Any Unix or Windows system may be used to create the CD or USB flash drive. Once a MemTest86 boot disk has been created, it may be used on any computer.
2.1 Boot-disk Creation using Windows Before you can use MemTest86 on a Windows system it must first be installed on a CD or USB flash drive. To create a CD, a system capable of creating a CD from an ISO file is required. Create a boot-able CD-ROM: 1. the Windows MemTest86 ISO image. 2. Right click on the ed file and select the "Extract to Here” option. This places the CD-ROM ISO image in the current folder. 3. Use the CD burning software available on your system to create a CD-ROM using the extracted ISO image. Be sure that you create a CD image from the ISO file rather than placing a copy of the ISO file onto a data CD. Look for “Burn Image from File” or similar option under the File menu of your CD burning software. Create a boot-able USB Flash drive: 1. the Windows MemTest86 USB image zip file. Copyright 2013 mark® Software
Page 5
2. Extract the contents of the zip file to a directory 3. Plug in the USB drive 4. Launch the ImageUSB application that was included in the zip file 5. Select the USB drive from the list 6. If it is not already selected, select the image file included in the zip file 7. Click 'Write to UFD' 8. After accepting a few more prompts this should give you a working bootable USB drive
2.2 Boot-disk Creation Using Linux It is recommended that Linux s and install pre-compiled packages to create boot-able media. Advanced s may wish to build from source and optionally make source code changes. Create a boot-able CD-ROM: 1. the Linux MemTest86 ISO image. 2. Uncompress the ISO image (gunzip memtest86-iso.gz). 3. Use the CD burning software available on your system to create a CD-ROM using the uncompressed ISO image. Be sure that you create a CD image from the ISO file rather than placing a copy of the ISO file onto a data CD. Look for “Burn Image for File” or similar option under the File menu of your CD burning software. Create a boot-able USB Flash drive: 1. the Linux MemTest86 USB image. 2. Untar the package (tar xvzf memtest86-usb.tar.gz). An image file and a REE file will be created in the current directory. 3. Follow instructions in the REE to write the USB flash disk.
2.3 Building MemTest86 (v4 BIOS) from source 1. Review the Makefile and adjust options as needed. 2. Type "make" This creates a file named "memtest.bin" which is a bootable image. If you encounter build problems a pre-compiled binary image has been included (precomp.bin). This image file may be copied to a floppy disk or may be loaded from a disk partition by Lilo or Grub boot loaders from a hard disk partition. The Makefile creates bootable images that may be written directly to a floppy disk or CD. A method is not provided for creating a bootable USB flash drive from source. Create a MemTest86 floppy disk: 1. Insert a blank write-enabled floppy disk. 2. As root, type "make install" Copyright 2013 mark® Software
Page 6
Create a bootable CD-ROM 1. type “make iso” 2. An ISO image file (memtest86.iso) is created in the source directory. Use CD burning software available on your system to create a CD-ROM using the extracted ISO image. Boot from a disk partition using Grub: 1. Copy memtest.bin to a permanent location (for example: /boot/memtest.bin). 2. Add an entry in the Grub config file (/boot/grub/menu.lst) to boot memtest86. Only the title and kernel fields need to be specified. The following is a sample Grub entry for booting memtest86: title MemTest86 linux16 /boot/memtest.bin Boot from a disk partition using Lilo: 1. Copy memtest.bin to a permanent location (ie. /boot/memtest.bin). 2. Add an entry in the lilo config file (usually /etc/lilo.conf) to boot memtest86. Only the image and label fields need to be specified. The following is a sample Lilo entry for booting memtest86: image = /boot/memtest.bin label = memtest86 3. As root, type "lilo"
Copyright 2013 mark® Software
Page 7
2.4 Using MemTest86 (UEFI) To start MemTest86 insert the CD-ROM or USB flash drive into the appropriate drive and restart your computer.
Note: The UEFI BIOS must be configured to boot from the device that MemTest86 is installed on. Most systems have an optional boot menu that is enabled be pressing a key at startup (often ESC, F9, F11 or F12). If available use the boot menu to select the correct drive. You may see both the UEFI and BIOS as separate options. Please consult your motherboard documentation for details. When MemTest86 boots, a splashscreen is displayed with a 10 second countdown timer which when expires, automatically starts the memory tests with default settings. Pressing a key or moving the mouse shall stop the timer.
Selecting 'Exit' shall reboot your system. To configure the memory tests, select 'Config' and the main menu is displayed. The main menu allows the to customize the memory test settings such as the specific tests to execute, address range to test and which U(s) are used in testing.
Copyright 2013 mark® Software
Page 8
2.4.1 Test Configuration 2.4.1.1 System Info The System Info screen displays the hardware information of the system it is running on.
View detailed RAM (SPD) info - displays the SPD information stored in the individual RAM modules. The RAM SPD information can be saved to a file on disk (Pro version only) View memory usage - displays how the system memory address map is allocated amongst the subsystems. Disable/Enable memory caching (Pro version only)– disable/enable memory caching when the tests are running. Warning - disabling memory caching greatly reduces the performance of the entire system, including screen updates. Disable/Enable ECC polling – if ECC detection/correction is ed and enabled, this option disables/enables periodic checking of any ECC errors that have been detected by the system while the memory tests are running. Enable/Disable ECC injection (Pro version only) - if ECC detection/correction is ed/enabled and ECC injection is ed by the system, this option enables/disables injection of ECC errors to simulate how the system responds to real ECC errors. Save system information summary to file (Pro version only)- Save the system information to a file on disk
Copyright 2013 mark® Software
Page 9
2.4.1.2 Test Selection The Test Selection screen allows the to select the test sequence to run, and the number of es to run each sequence. See Individual Test Descriptions for a detailed description of each test.
To enable/disable a test, use the up / down arrow keys or mouse to highlight a test, then press enter or click. Test 11 and 12 are available only on the Pro version. To change the number of es, select 'Number of es' and enter the desired number of es.
Copyright 2013 mark® Software
Page 10
2.4.1.3 Address Range The Address Range screen allows the to specify a subset of the total system address map to test.
To change the lower or upper limit of the address range to test, select the appropriate setting and enter a new value. To enter a decimal address, enter the address as is. To enter a hexadecimal address, enter '0x' followed by the hex address. You may also use the suffixes 'k', 'm', 'g' to specifiy kilobyte, megabytes and gigabytes respectively. Selecting 'Reset Limits to default' will reset the limits to its maximum values.
Copyright 2013 mark® Software
Page 11
2.4.1.4 U Selection The U Selection screen allows the to specify a single U to test, or cycle through all Us using a selection method.
Single - specifies a single U to test, and prompts the to enter a valid U number Parallel - executes the memory tests on all Us concurrently, on a set of non-overlapping memory segments Round Robin - only one U is running a test at any given time but cycles to the next U in a round robin fashion after every test Sequential - only one U is running a test at any given time but cycles to the next U after a certain memory size has been tested by the U.
Copyright 2013 mark® Software
Page 12
2.4.2 Configuration File (Pro version only) Memory test parameters can also be set via a configuration file (mt86.cfg) that is loaded on startup, without the need to manually configure the memory tests every time MemTest86 is run. This is useful especially in testing environments where memory tests need to be executed in an automated fashion without intervention. A sample configuration file is as follows: # # MemTest86 configuration file # TSTLIST=0,1,3,5,8 NUM=3 ADDRLIMLO=0x10000000 ADDRLIMHI=0x20000000 USEL=PARALLEL UNUM=1 ECOLL=0
Lines that start with '#' indicate a comment line. All parameters are specified as follows: [Parameter_name]=[Parameter_value] The following table summarizes the list of ed parameters: Parameter
Description
TSTLIST
List of tests to execute in the test sequence. Each test is specified by a test number, separated by a comma.
NUM
Number of iterations of the test sequence to execute. This must be a number greater than 0.
ADDRLIMLO
The lower limit of the address range to test. To specify a hex address, the address must begin with '0x'. Otherwise, the address shall be interpreted as a decimal address.
ADDRLIMHI
The upper limit of the address range to test. To specify a hex address, the address must begin with '0x'. Otherwise, the address shall be interpreted as a decimal address.
USEL
One of the following U selection modes: { 'SINGLE', 'PARALLEL', 'RROBIN', 'SEQ' }
UNUM
The U # of the specific U to test. This parameter only has an effect if USEL is set to 'SINGLE'.
ECOLL
Specifies whether ECC errors shall be polled. 0 – Polling disabled, 1 – Polling enabled
2.4.3 Testing Once the memory test has started, the following screen which shows the test status is displayed:
Copyright 2013 mark® Software
Page 13
MemTest86 executes a series of numbered test sections to check for errors. The execution order for these tests has been arranged so that errors will be detected as rapidly as possible. The time required for a complete of MemTest86 will vary greatly depending on U speed, memory speed and memory size. If memory errors are detected they will be displayed on the lower half of the screen. If MemTest86 runs multiple es without errors you can be certain that your memory is functioning properly. Other problems (hardware or software) may exist but at least you can eliminate memory as a culprit. In addition the U must work properly to run MemTest86 so successful execution implicitly assures that your U is also functioning properly. MemTest86 can not diagnose many types of PC failures. For example a faulty U that causes Windows to crash will most likely just cause MemTest86 to crash in the same way. 2.4.3.1 Runtime Configuration Options MemTest86 may be configured during operation via runtime configuration commands. Pressing the “C” key at anytime will display the runtime command menu.
When running the UEFI version, the following options are available in the Configuration command menu: Settings: (1) Skip Current Test (2) End Test (3) Go Back to Main Menu (0) Continue Copyright 2013 mark® Software
Page 14
The runtime configuration commands allow the to adjust the following settings. (1) Skip Current Test- Aborts the current test and starts the next test in the sequence (2) End Test - Stops the test and displays a summary of the results (3) Go Back to Main Menu – Stops the test nd returns to the main menu (4) Continue - Resume the test
2.4.4 Test Report (Pro version only) At the end of the test, a summary of the test results is displayed, as well as the option to save an HTML test report to a file. The test report is fully customizable by modifying the following files: mt86head.htm - The HTML code that will be used as a header to the test report. This can contain a company logo, information or any additional information about the test environment. mt86foot.htm – The HTML code that will be used as a footer to the test report. Examples of usage include a signature line to indicate the technician who performed the test, or a disclaimer about the validity of the test results. report.css – The stylesheet used to specify the appearance of the report. The file itself contains all the properties that are used, along with several templates that can be used to customize the report.
2.4.5 Troubleshooting MemTest86 Problems A log file (MemTest86.log) is automatically created and updated while MemTest86 is running. This log file contains information that is helpful in diagnosing possible memory failures or problems with MemTest86 itself. If you believe you may have encountered a bug with MemTest86, please report problems to
[email protected].
Copyright 2013 mark® Software
Page 15
2.5 Using MemTest86 (v4 BIOS) To start MemTest86, insert the MemTest86 floppy disk, CD-ROM or USB flash drive into the appropriate drive and restart your computer.
Note: The BIOS must be configured to boot from the device that MemTest86 is installed on. Newer computers have an optional boot menu that is enabled be pressing a key at startup (often ESC, F9, F11 or F12). If available use the boot menu to select the correct drive. With older computers the boot sequence must list the drive used to load Memest86 before any other bootable media (i.e. a hard disk where Windows is installed). Most systems will be configured with a suitable boot sequence. If not you can reconfigure the boot sequence using the BIOS settings. Please consult your motherboard documentation for details.
2.5.1 Testing When MemTest86 starts it displays details about the system configuration and then begins testing. MemTest86 executes a repeating cycle of tests. Testing will continue to run until the program execution is interrupted (by pressing the ESC key or pressing the reset button). There is no set time for how long the test should be run. The following is an example of the MemTest86 screen.
MemTest86 executes a series of numbered test sections to check for errors. The execution order for these tests has been arranged so that errors will be detected as rapidly as possible. The counter increments each time that all of the tests have been run. The first is abbreviated to find errors more quickly. Subsequent es execute longer and will be more thorough. Generally a single is sufficient to detect all but the most obscure errors. For Copyright 2013 mark® Software
Page 16
complete confidence in cases where intermittent errors are suspected, testing for a longer period is advised. The time required for a complete of MemTest86 will vary greatly depending on U speed, memory speed and memory size. If memory errors are detected they will be displayed on the lower half of the screen. The default error reporting mode will display a detailed summary of all errors. If MemTest86 runs multiple es without errors you can be certain that your memory is functioning properly. Other problems (hardware or software) may exist but at least you can eliminate memory as a culprit. In addition the U must work properly to run MemTest86 so successful execution implicitly assures that your U is also functioning properly. MemTest86 can not diagnose many types of PC failures. For example a faulty U that causes Windows to crash will most likely just cause MemTest86 to crash in the same way. 2.5.1.1 Runtime Configuration Options MemTest86 may be configured during operation via runtime configuration commands. Pressing the “C” key at anytime will display the runtime command menu. Tthe following options are available in the Configuration command menu:
Settings: (1) Test Selection (2) Address Range (3) Error Report Mode (4) U Selection Mode (5) Refresh Screen (6) Restart Test (7) Miscellaneous Options (0) Continue
The runtime configuration commands allow the to adjust the following settings. (1) Test Selection - Individual tests may be selected for execution or the current test may be skipped. When a failure has been identified it is much faster to troubleshoot by running only the failing test. (2) Address Range - With this option memory testing may be restricted to a selected portion of memory. This is also useful to speed up the troubleshooting process by testing only the failing portion of memory. (3) Error Report Mode - The error report mode may be changed to provide different types of error information. The default “Error Summary” mode provides all of the details required for diagnosing memory problems. However, other error reporting modes are available. "BadRAM” patterns may be used with Linux systems to map out bad memory blocks. (4) U Selection Mode – This option controls how Us are selected for testing. The options are All, Round Robin and Sequential. With Round Robin Us are selected in round robin fashion for each test. With the Sequential option all available Us are Copyright 2013 mark® Software
Page 17
used for each test. (5) Refresh Screen - Redraws the screen when the display becomes garbled. (6) Restart Test – Restarts test with default settings. (7) Miscellaneous Options – Enable and disable runtime options. Currently only two options are available, One and Boot trace. The One feature runs the complete test once and then exits, but only if there were no errors. This provides a convenient method for unattended testing. The Boot Trace option is a debugging tool that is primarily designed for troubleshooting startup problems, but may be enabled at anytime. A help bar is displayed at the bottom of the screen with the following options. Keyboard Assignment
Description
ESC
Exits the test and does a warm restart via the BIOS
C
Enter the configuration menu
SP (Spacebar)
Set scroll lock (Stops scrolling of error messages) Note: Memory testing is suspended when the scroll lock option is set and the scrolling display region is full.
CR (Enter)
Clear scroll lock (Enables error message scrolling)
2.5.2 Startup (boot) Options A number of options may be specified at boot time. The following options are available: One
– Enables the One option. This feature runs the complete test once and then exits, but only if there were no errors. This provides a convenient method for unattended testing. One may also be enabled via an on-line command. The syntax is “one” and there are no additional options. Btrace
– Enables the boot trace option and is used to diagnose test failures please see section 2.7 for details. The syntax is “btrace” and there are no additional options. Maxus
– Set the maximum number of U's to start. This is primarily a troubleshooting option. Setting maxus to one skips all code related to finding and starting additional Us. The syntax is “maxus=N” where N = 1 to 31. Test
List – Allows for execution on a subset of the normal test sequence. The syntax is “tstlist=list” where list is a comma separated list of test numbers to run (do not include spaces). Example: tstlist=1,4,5,7 U
Mask – A mask of U numbers that are started and enables. A 32 bit mask is specified with a 1 set for each U that will be enabled. U 0 cannot be disabled and will be enabled regardless of the mask. The syntax is “umask=N” where N is a 32 bit hexadecimal number with or without a 0x prefix. Example: umask=0x55555555 enables all even numbered Us. Copyright 2013 mark® Software
Page 18
Console
– Used to setup serial console parameters. The syntax is “console=ttySn,baudPL” where n = serial port number (0 or 1), baud = baud rate, P = parity setting (N, O or E) and L is the number of bits (7 or 8). Example: “console=ttyS0,9600e8 uses port 0 at 9600 baud, even parity and 8 bits. The standard Memtest86 media images provided use Syslinux for booting and allow for boot options to be specified interactively. At the “boot:” prompt type “memtest” followed by any of the above options. Multiple options may be specified separated by spaces. For example “memtest one tstlist=1,6,7,8” will start the test with the One option enabled and execute tests 1,6,7 and 8.
2.5.3 Troubleshooting with Boot Trace This is section is targeted at troubleshooting problems with execution of the test code and not for diagnosing reported memory errors. In Version 4.1 a Boot Trace feature was added that provides a simple mechanism for troubleshooting of test failures by both technical and nontechnical s. With the very large variety of computer hardware available it is impossible to test Memtest86 an all platforms and some incompatibilities exist causing failures. In the past determining the cause of these failures has been difficult to impossible. With trace information many of these faults may now be addressed. Enabling Boot Trace - The boot trace feature is enabled with either a boot time option or an on-line command. However, most problems occur during startup so the boot option is the preferred method. All of the released images have a boot option to start the test with boot tracing enabled. When booting from a Linux system with LILO or GRUB add “btrace” to the boot options line. When Boot Trace is enabled two columns of trace information will be displayed one the bottom two thirds of the screen. The U, line number from the source file, a short message and two parameters are displayed in each trace point. A “>” character denotes the current trace point. A total of 26 trace points are displayed and older trace points are lost as execution progresses. Pressing any key will advance to the next trace point. As initialization proceeds other portions of the screen will be filled in with information and the header lines will be erased. Gathering Trace Information – There are two types of failures where trace information is needed to help diagnose the problem, hangs and crashes. Gathering traces for a situation when the test hangs is very simple. Just continue to press return until the test stops. At minimum email the last 5 trace points. Much more useful would be a digital picture of the screen. For cases where the test crashes we need to see the trace points just before the crash. This requires slowly stepping through the traces and identifying the point where the test reboots. Then run the test again and stop at the trace point just before the crash and report the information. Again, email at least the last 5 trace points or better a picture of the screen. Please email failure information to
[email protected]. Using Trace Information - Technical s will be able to diagnose problems using trace data that previously would have required detailed understanding of Memtest86's internal workings. By following the trace points in the source you will be able to follow the path of execution and identify problems. As needed new trace points may be added to the code to provide more detail when diagnosing a problem. Copyright 2013 mark® Software
Page 19
3 Repairing Memory Faults When MemTest86 detects errors the error count will be incremented and the error details will be displayed on the lower half of the screen. The key information needed to diagnose errors are the “Error Confidence Value” and the “Errors per Memory Slot” information. Error confidence values over 100 should always be considered to be legitimate. The errors per memory slot indicate both the number of memory modules present in your PC and the number of errors for each memory module. This information may not be available for older PCs. The remaining error details may be ignored. To diagnose and repair a memory fault requires you to open your computer case and handle sensitive electronic components. With proper handling procedures this is not difficult. If you do not want repair your own hardware you can use MemTest86 to test your RAM, but rely on a third party to do the actual repair.
3.1 Anti-Static Handling Procedures Electronic components may be damaged by static electricity. The key to proper handling is to simply avoid causing a static discharge though the component that is being handled. This is done by discharging static buildup before a component comes in with another surface. For example when installing a component into a computer, first touch a bare metal part of the computer case. If you want to set a component on a table, touch the table first and then set down the component. If you take a step or even shuffle your feet you will need to redischarge any static buildup.
3.2 Re-Seating Memory Modules In many cases memory problems are caused by a poor connection. This can be resolved by simply removing and reinstalling the memory module(s) using the following procedure. 1. Using the documentation for your motherboard locate the memory module slots and identify the memory module(s). 2. Using proper anti-static handing procedures remove the memory module(s). In most cases this is done by pressing down on the locking tabs at the end of the module slot. 3. Carefully clean any dust or debris from the module and the motherboard memory module slots. 4. Reinstall the memory module(s) into their original slot on the motherboard. 5. Rerun MemTest86.
3.3 Replacing Modules If re-seating memory modules does not resolve the problem then a memory module will generally need to be replaced. When more than one module is installed you will need to determine which module is failing. First consult your motherboard documentation to determine if memory modules may be used individually or if they must be used in pairs. If your motherboard is configured with the minimum number of memory modules you will need to purchase a replacement memory Copyright 2013 mark® Software
Page 20
module in order to locate the failing one. Using the guidelines in your motherboard manual to maintain a working configuration selectively remove modules from the system and rerun MemTest86 to find the bad module(s). Be sure to carefully note which modules are in the system when the test es and fails. In most cases memory failures will be due to a faulty memory module and replacing the module will resolve the problem. However, the motherboard affects memory operation and may also cause memory errors. There are instances where only a particular combination of memory and motherboard will cause errors. This is typically due to the memory not being of sufficient quality and speed to keep up with the motherboard. This memory may function properly when installed in a less demanding motherboard. When errors are only detected by test #5 it is usually due to memory not being fast enough to work properly with the motherboard. More conservative memory timing BIOS settings (if ed) may resolve these problems. Otherwise higher quality memory may be required, or possibly the motherboard may need to be replaced.
3.4 Error Validity There are many s who question if errors detected by MemTest86 are valid. In at least 99.9% of cases reported errors will be legitimate and must be corrected. MemTest86 simply exercises all available RAM looking for errors. If any error is detected in RAM, regardless of how or where, it is a legitimate failure that needs to be corrected. There are no known issues regarding compatibility. It is possible, but unlikely, that a particular error will never show up in normal operation. However, operating with marginal memory is risky and can result in data loss and disk corruption. Even if there is no overt indication of problems you cannot assume that your system will be unaffected. There are some rare cases where motherboard manufacturers will map hardware s into space normally occupied by memory. When these locations are tested errors are reported. In this case, the reported errors will be invalid. To better identify these rare cases where invalid errors are reported an error confidence value is created by MemTest86. The program tracks failures and then does some simple analysis to determine the probability of invalid errors. When the confidence value exceeds 100 it is nearly impossible that the reported errors will be invalid.
Copyright 2013 mark® Software
Page 21
4 Over Clocking 4.1 Background MemTest86 is an invaluable tool for over clocking and may be used to increase your systems performance and reliability. Many shy away from over clocking fearing that it will make their PC less reliable. With a proper procedure, fine tuning your system timings is safe and in some cases will even improve reliability. Before embarking on over clocking one must understand the concept of margins or margin for error.
4.2 Operating Margins To achieve high reliability computer systems are designed and tested under conditions that are more strenuous than those expected in normal use. Take for example a computer that is designed to operate with a bus frequency of 133 MHz. A quality manufacturer will design and test for operation at perhaps 140 MHz or higher. This margin for error provides confidence that even with inevitable manufacturing variances and changing conditions operation will be reliable. When components from different manufacturers are combined an even greater margin for error is required since the exact characteristics of the associated components are not known. The result is the majority of computers will end up having a much larger margin for error than is needed. This means that a lot of performance is being wasted. On the other hand there will be a small number of systems that do not have enough margin of error and will have poor reliability even without over clocking. Many think of over clocking as simply making a computer operate as fast as possible. This approach is unwise and will often result in unreliable operation. A much better philosophy is to adjust and fine tune computer timings to maintain an appropriate margin for error. An appropriate margin includes not leaving too much margin, wasting performance. For example let say we have a computer that will operate properly with a system clock of 189 MHz. Obviously a lot of performance will be wasted if we operate the computer at the standard 133 MHz. In addition if the computer fails at 190 MHz it would be unwise to operate at 189 MHz since there is little margin for error and operation will likely not be reliable. An operating margin of 3% to 6% is sufficient to insure good reliability. For this example a system clock of 178 to 183 would be ideal. There may be cases where it will be advisable to under clock. If a system with a normal clock rate of 133 MHz does not operate properly at 136 MHz or more then there is not enough margin and under clocking is required to ensure reliability. To summarize, over clocking should be thought of as fine tuning a computer to ensure reliability first and secondarily maximizing performance.
4.3 Using MemTest86 for Over Clocking The first step in a proper over clocking procedure is to determine the operational limits of your computer. After the operational limits have been identified the system settings may be adjusted to provide enough margin to ensure high reliability. MemTest86 is an ideal tool to accurately determine the operation limits of your memory and U. Using the guidelines below, experiment with the settings available for your motherboard to find settings that result in the highest clock rate combined with the highest reported memory bandwidth. Copyright 2013 mark® Software
Page 22
1. Before attempting to over clock you need to know what system parameters your motherboard will allow you to adjust and how they are adjusted. Some motherboards will allow all useful parameters to be adjusted while others do not allow for any adjustment. Consult your motherboard manual for details. Some of the parameters that may be available are: ●
System clock rate
●
U clock multiplier
●
Memory timing
●
Memory clock multiplier
●
U voltage
●
Memory voltage
2. You need to know how to reset the CMOS to factory settings. It is common when over clocking to end up with settings that will not run the BIOS. When the parameters are set via CMOS the only way to recover is to reset the CMOS to the factory configuration. For some motherboards this is accomplished by removing a jumper. Other will require removal of the CMOS backup battery. Make sure that you know how to recover before starting. 3. Before adjusting any parameters, run MemTest86 for a full to establish a baseline. Record the memory bandwidth and U clock frequency for the default configuration. 4. Typically the best place to start is with the system clock rate. Increase the clock rate in fairly small increments (2-4 MHz) and then run at least a partial of MemTest86. Running at least 15% of tests 1-5 should be the minimum amount of testing for each iteration. Continue increasing the clock rate until you get a failure. Take your time and take good notes. For each step be sure that you record the memory bandwidth reported by MemTest86. Some BIOS's automatically adjust memory timings according to clock rate. You may find that by increasing the clock rate, memory performance will decrease. Once you find a failure back off on the clock rate until you find the point at which you get errors. Once you find the point at which memory errors occur back the clock off one step and run a full of MemTest86 to confirm the operational limit. In some cases the U will hang (stop responding) before memory errors are detected. 5. Some motherboards will allow you to adjust memory timing. Memory timings are typically listed as 4 values separated by hyphens. However, some motherboards only offer choices like fast, faster and fastest. There are two strategies for adjusting memory timing. If memory errors are detected before the U exhibits problems then slower memory timing may be used to allow a higher system clock rate to be used. The second strategy is to use faster memory timings to get more memory bandwidth without increasing the system clock rate. It is impossible to know which values will effect errors. Some of the memory timings will affect memory bandwidth and others will not. Be sure to record the reported memory bandwidth for each parameter change. 6. If in step 4 the U hangs before memory errors appear then the U has less margin for error than the memory. If available you may want to try reducing the U multiplier and then continue to increase the system clock until either the U stops or memory Copyright 2013 mark® Software
Page 23
errors occur. This is helpful if memory timings are not adjustable or are ineffective. 7. U and memory voltages are adjustable on some motherboards and increasing them may allow you to run at higher speeds. In particular higher U voltages tend to be quite effective for over clocking. However, higher voltages also mean higher temperatures so be sure that you have plenty of case cooling and an effective U cooler. Use carefully. 8. Some motherboards allow you to use a system clock multiplier for memory. The default is usually 1:1, or in other words the system and memory clock rates will be the same. This setting is only useful when the memory and U operational limits are significantly different and can not be brought into balance using the techniques listed above. Once you have established the operational limits of your system then you need to select settings that allow for a reasonable margin for error. Do not be tempted to use the maximum settings! A margin of 3% to 6% is recommended for reliable operation. The easiest way to add operational margin is to simply reduce the system clock rate by 3% to 6% from the maximum setting that functioned properly. In many cases the operational limits for the U and memory will be different. For example you can get more U speed by reducing memory settings. Or you can get more memory performance by reducing the U multiplier. For these cases you will need to choose a compromise. Both memory bandwidth and U clock rate are important so don't be tempted to only optimize for one or the other. MemTest86 provides good assurance of reliable memory operation when over clocking. Even when a dramatic increase in memory bandwidth is achieved. As long as MemTest86 does not report errors and appropriate margins have been applied then you should not hesitate to fully maximize memory timings. However, some caution must be exercised for system clock rate increases. With most motherboards the clock rate for the PCI and AGP busses are based on the system clock. Generally these busses will have no problem running at somewhat higher rates. MemTest86 does not test PCI or AGP and do not provide any assurance that anything other than the U and memory are working properly. Sadly there is currently no safe way to determine the operational limits for PCI and AGP and therefor there is no way to assure that there are appropriate margins. Unless your motherboard is able to independently establish the frequency of PCI and AGP busses you should be careful about running with large (more than 15%) increases in the system clock. In addition running your U at higher frequencies will generate more heat. Small frequency increases will generally be fine with the installed U cooler. Larger increases in the system clock rate may necessitate a larger, more effective U cooler.
Copyright 2013 mark® Software
Page 24
Appendices Appendix A.Technical Information Appendix A contains technical information from the MemTest86 REE file that was previously released under the Gnu Public License (GPL). This section provides additional background information and technical information that may be useful for advanced s.
A.1 Memory Testing Philosophy There are many approaches for testing memory. However, many tests simply throw some patterns at memory without much thought or knowledge of memory architecture or how errors can best be detected. This works fine for hard memory failures but does little to find intermittent errors. BIOS based memory tests are useless for finding intermittent memory errors. Memory chips consist of a large array of tightly packed memory cells, one for each bit of data. The vast majority of the intermittent failures are a result of interaction between these memory cells. Often writing a memory cell can cause one of the adjacent cells to be written with the same data. An effective memory test attempts to test for this condition. Therefore, an ideal strategy for testing memory would be the following: 1. write a cell with a zero 2. write all of the adjacent cells with a one, one or more times 3. check that the first cell still has a zero It should be obvious that this strategy requires an exact knowledge of how the memory cells are laid out on the chip. In addition there is a never ending number of possible chip layouts for different chip types and manufacturers making this strategy impractical. However, there are testing algorithms that can approximate this ideal strategy.
A.2 MemTest86 Test Algorithms MemTest86 uses two algorithms that provide a reasonable approximation of the ideal test strategy above. The first of these strategies is called moving inversions. The moving inversion test works as follows: 1. Fill memory with a pattern 2. Starting at the lowest address 2a. Check that the pattern has not changed 2b. Write the patterns complement 2c. Increment the address Repeat 2a - 2c 3. Starting at the highest address 3a. Check that the pattern has not changed 3b. Write the patterns complement 3c. Decrement the address Repeat 3a - 3c Copyright 2013 mark® Software
Page 25
This algorithm is a good approximation of an ideal memory test but there are some limitations. Most high density chips today store data 4 to 16 bits wide. With chips that are more than one bit wide it is impossible to selectively read or write just one bit. This means that we cannot guarantee that all adjacent cells have been tested for interaction. In this case the best we can do is to use some patterns to insure that all adjacent cells have at least been written with all possible one and zero combinations. It can also be seen that caching, buffering and out of order execution will interfere with the moving inversions algorithm and make less effective. It is possible to turn off cache but the memory buffering in new high performance chips can not be disabled. To address this limitation a new algorithm called Modulo-X was created. This algorithm is not affected by cache or buffering. The algorithm works as follows: 1. For starting offsets of 0 - 20 do steps 2-5 2. write every 20th location with a pattern 3. write all other locations with the patterns complement 4. repeat 1b one or more times 5. check every 20th location for the pattern This algorithm accomplishes nearly the same level of adjacency testing as moving inversions but is not affected by caching or buffering. Since separate write es (1a, 1b) and the read (1c) are done for all of memory we can be assured that all of the buffers and cache have been flushed between es. The selection of 20 as the stride size was somewhat arbitrary. Larger strides may be more effective but would take longer to execute. The choice of 20 seemed to be a reasonable compromise between speed and thoroughness.
A.3 Individual Test Descriptions MemTest86 executes a series of numbered test sections to check for errors. These test sections consist of a combination of test algorithm, data pattern and caching. The execution order for these tests were arranged so that errors will be detected as rapidly as possible. A description of each of the test sections follows: Address test, walking ones Tests all address bits in all memory banks by using a walking ones address pattern. Errors from this test are not used to calculate BadRAM patterns. Address test, own address Each address is written with its own address and then is checked for consistency. In theory previous tests should have caught any memory addressing problems. This test should catch any addressing errors that somehow were not previously detected. Moving inversions, ones & zeros This test uses the moving inversions algorithm with patterns of all ones and zeros. Cache is enabled even though it interferes to some degree with the test algorithm. With cache enabled this test does not take long and should quickly find all "hard" errors and some more subtle errors. Copyright 2013 mark® Software
Page 26
Moving inversions, 8 bit pattern This is the same as test 3 but uses a 8 bit wide pattern of "walking" ones and zeros. This test will better detect subtle errors in "wide" memory chips. A total of 20 data patterns are used. Moving inversions, random pattern This test uses the same algorithm as test 3 but the data pattern is a random number and it's complement. This test is particularly effective in finding difficult to detect data sensitive errors. A total of 60 patterns are used. The random number sequence is different with each so multiple es increase effectiveness. Block move This test stresses memory by using block move (movsl) instructions and is based on Robert Redelmeier's burnBX test. Memory is initialized with shifting patterns that are inverted every 8 bytes. Then 4MB blocks of memory are moved around using the movsl instruction. After the moves are completed the data patterns are checked. Because the data is checked only after the memory moves are completed it is not possible to know where the error occurred. The addresses reported are only for where the bad pattern was found. Since the moves are constrained to an 8MB segment of memory the failing address will always be lest than 8MB away from the reported address. Errors from this test are not used to calculate BadRAM patterns. Moving inversions, 32 bit pattern This is a variation of the moving inversions algorithm that shifts the data pattern left one bit for each successive address. The starting bit position is shifted left for each . To use all possible data patterns 32 es are required. This test is quite effective at detecting data sensitive errors but the execution time is long. Random number sequence This test writes a series of random numbers into memory. The initial pattern is checked and then complemented and checked again on the next iteration. However, unlike the moving inversions test, writing and checking can only be done in the forward direction. On the first , a fixed seed number is used so that the random number generator always generates the same sequence of numbers, allowing the test run to be reproducible. All subsequent es use a seed number generated from the system clock, resulting in more permutations being tested. The 64-bit and 128-bit versions of this test is essentially the same as the original 32-bit test, except that it uses native 64-bit and SIMD instructions respectively. Modulo 20, random pattern Using the Modulo-X algorithm should uncover errors that are not detected by moving inversions due to cache and buffering interference with the algorithm. A sequence of 6 32 bit random patterns are used. Bit fade test, 2 patterns The bit fade test initializes all of memory with a pattern and then sleeps for 5 minutes. Then memory is examined to see if any memory bits have changed. All ones and all zero patterns are used. Copyright 2013 mark® Software
Page 27
A.4 Error Display Information MemTest86 has 4 options for reporting errors. The default is error summary mode where key failure information is displayed along with an error confidence value. Reporting of individual errors is also available. In BadRAM Patterns mode patterns are created for use with the Linux BadRAM feature. This feature allows Linux to avoid bad memory pages. Details about the BadRAM feature can be found at: http://home.zonnet.nl/vanrein/badram The error summary mode reports the following data: Error Confidence Value: A value that indicates the validity of the errors being reported with larger values indicating greater validity. There is a high probability that all errors reported are valid regardless of this value. However, when this value exceeds 100 it is nearly impossible that the reported errors will be invalid. Lowest Error Address: The lowest address that where an error has been reported. Highest Error Address: The highest address that where an error has been reported. Bits in Error Mask: A mask of all bits that have been in error (hexadecimal). Bits in Error: Total bit in error for all error instances and the min, max and average bit in error of each individual occurrence. Max Contiguous Errors: The maximum of contiguous addresses with errors. ECC Correctable/Uncorrectable Errors: The number of errors that have been corrected/uncorrected by ECC hardware. Test Errors: On the right hand side of the screen the number of errors for each test are displayed.
When the individual error reporting mode is selected the following information is displayed. An error message is only displayed for errors with a different address or failing bit pattern. All displayed values are in hexadecimal. Label
Description
Tst:
Test number
Failing Address :
Failing memory address
Good:
Expected data pattern
Bad:
Failing data pattern
Err-Bits:
Exclusive or of good and bad data (this shows the position of the failing bit(s))
Count:
Number of consecutive errors with the same address and failing bits
Copyright 2013 mark® Software
Page 28
Appendix B.Product Please report problems to: Email:
[email protected] Please include: •
The software version.
•
A detailed description of the problem and how to reproduce it.
•
A copy of the result files / log files (if available).
•
Details of how we may be able to you.
B.1 Known Problems MemTest86 can not diagnose many types of PC failures. For example a faulty U that causes Windows to crash will most likely just cause MemTest86 to crash in the same way. With some PC's MemTest86 will just die with no hints as to what went wrong. Without any details it is impossible to fix these failures. Fixing these problems will require debugging on your part. There is no point in reporting these failures unless you have a Linux system and would be willing to debug the failure. MemTest86 s all types of memory. If fact the test has absolutely no knowledge of the memory type nor does it need to. This not a problem or bug but is listed here due to the many questions about this issue. B.1.1 UEFI (v5+) MemTest86 depends on the services provided by the UEFI firmware, which include mouse/keyboard , multiprocessor services, file input/output, and PCI device management. Because the services that are provided by the firmware are developed individually by the motherboard manufacturers, there may be functional differences than can limit or prevent the proper operation of MemTest86. Some of these issues are highlighted below: •
Some UEFI implementation do not provide full mouse , preventing the use of the mouse in MemTest86. If this is the case, use the keyboard or try updating to a new firmware build.
•
Multiple U testing may not be available due to limited or non-functional implementations of Multiprocessor services provided by UEFI, especially for older firmware. This may cause a reduced number of processors available for testing, or even program freeze when attempting to run on other processors. If this is the case, run in single U mode or try updating to a new firmware build.
MemTest86 is also inherently limited by the UEFI environment and as such, gives rise to the following limitations: •
MemTest86 USB flash drives cannot be read in Windows XP (32-bit) due to its lack of of GPTdisks
•
MemTest86 cannot remap itself to different portions of memory in order to run tests in
Copyright 2013 mark® Software
Page 29
the section of memory it was occupying. •
Dual UEFI entries may be present in UEFI BIOS as boot devices. There is no difference in selecting either entry This is due to a workaround that allows MemTest86 USB flash drives to be accessible in Windows.
B.1.2 BIOS (v4) Sometimes when booting from a floppy disk the following messages scroll up on the screen: X:8000 AX:0212 BX:8600 CX:0201 DX:0000 This is the BIOS reporting floppy disk read errors. Either re-write or toss the floppy disk. On a small number of machines, false positives were reported when running Test #3 in parallel mode. If this is the case, re-run the test in single U mode before concluding that it is a RAM issue.
B.2 Enhancements Please send enhancement requests to:
[email protected] All requests will be considered, but not all can be implemented
Copyright 2013 mark® Software
Page 30
Appendix C.Change Log New features in v5.0.0 (Dec/2013) •
Completely re-written to work under UEFI.
•
Native 64-bit
•
No longer requires the use of the PAE workaround to access more than 4GB of memory. (PAE = Physical Address Extension)
•
Mouse , where ed by the underlying UEFI system. On older systems a keyboard is still required.
•
Improved USB keyboard . The keyboard now works on systems that fail to emulate IO Port 64/60 correctly. So Mac USB keyboards are now ed.
•
Improved multi-threading , where ed by the underlying UEFI system.
•
Dual boot with Memtest version 4 for ing older systems without UEFI. So with a single USB or CD drive both UEFI systems and BIOS systems can be ed.
•
Reporting of detailed RAM SPD information. Timings, clock speeds, vendor names and much more.
•
to writing to the USB drive that Memtest is running from for logging and report generation. In all prior MemTest releases there was no disk .
•
Use of GPT. (GUID Partition Table)
•
ECC RAM (limited hardware , ongoing development) ◦
Detection of ECC in both the RAM and memory controller
◦
Polling for ECC errors
◦
Injection of ECC errors for test purposes. (limited hardware only)
•
Option to disable U caching for all tests
•
for reading parameters from a configuration file to allow settings to be predefined without the need for keyboard input. This can help with automation.
•
for Secure Boot
•
Speed improvements of between 10% and 30%+. Especially for tests, #5, #8 & #9. This is the result more moving to native 64bit code, removing the PAE paging hack, switching compilers and using faster random number generation algorithms.
•
Addition of 2 new memory tests to take advantage of 64bit data and SIMD instructions.
Copyright 2013 mark® Software
Page 31
Enhancements in v4.3.6 (Nov/2013) •
Fixed crash (particularly for AMD machines) that is seemingly resolved by adding U synchronication barriers before and after performing the memory speed test
•
Fixed an error in setting the barrier structure's base address, preventing a possible crash or freeze of the system.
•
Added a check to perform a spin lock only when more than 1 Us are detected
Enhancements in v4.3.5 (Oct/2013) •
Fixed potential error due to barrier structure located at fixed memory location
•
Fixed block move test freeze on higher memory addresses
Enhancements in v4.3.4 (Oct/2013) •
Fixed incorrect progress calculation for test 0
•
Fixed incorrect memory size due to bug with memory map when the e820 entry size member is 0
•
Fixed incorrect number of U's found due to duplicate entries in the MADT
•
Changed the method used to search for processors to searching the APIC MADT first, then search the MP spec table (as opposed to vice versa). The MP spec table has largely been deprecated.
Enhancements in v4.3.3 (Sept/2013) •
Fixed incorrect progress calculation for test 4
•
Fixed potential false positives in parallel mode caused by overlapped/unaligned memory chunk allocations per U
•
Fixed program freeze when selecting test 0 or 1 when running in non-parallel mode
Enhancements in v4.3.2 (Aug/2013) •
Memory bandwidth is now measured for one U (as opposed to being a total for all Us & Cores). This will lower the reported bandwidth for multi-core machines. But we think it makes more sense this way.
•
Fixed crash when attempting to boot on older single core machines with hyperthreading. Only effects old machines, from around the early Pentium 4 era, that didn't have a MP (Multi-Processor) Spec table defined but did have both a MADT (Multiple APIC Description Table) defined and hyperthreading enabled.
•
Restored the "Start only one U" boot option. This option should not be required in normal use, but might be useful for debugging purposes.
•
Updates to the included help file
Enhancements in v4.3.1 (Aug/2013) •
Fixed bug with Test 6 (Block Move Test) not testing the end of a memory segment correctly
•
Removed unnecessary boot options in menu
Copyright 2013 mark® Software
Page 32
Enhancements in v4.3 (Jul/2013) •
Changed default U selection mode to round robin. Running all Us at once has been shown to cause false positives on a number of systems.
•
Fixed a bug that could cause the program to go into a tight loop that could not be escaped when setting certain memory ranges to test.
•
Fixed a bug displaying the memory location of individual errors. The values after the decimal point in the MB readout were incorrect.
•
Fixed a bug in configuring upper and lower memory limits, previously lower limits equal or grater than 2gb would not work, as well as some other more obsucre configurations.
•
Added a misc option to display the systems memory map.
•
Fixed a bug that would cause the number of es to not correctly reset after changing the selected tests.
•
Added missing source code to some of the packages.
•
Fixed a bug in test 8 causing a single error to cascade into multiple errors.
•
Fixed a bug causing the average error bits to be incorrect once the errors had maxed out at 65k
•
Fixeda bug preventing test 10 to be selected as a single test to run.
•
Fixed bug displaying individual test error counts.
•
Fixed bug making overall errors 10x what they should be.
Enhancements in v4.2 (Mar/2013) •
Fixed issues with USB keyboards. The USB keyboard functionality is memory mapped into a portion of low memory on some (maybe many) machines, typing on a USB keyboard changes some values in RAM as the key presses are stored in memory as you type. This can cause the keyboard to become unresponsive during testing or input from the keyboard to generate errors in the tests.
•
Fixed crash when configuring memory ranges. Changing the meory range during the first test, or changing the memory range multiple times during later tests could cause the current test number to become negative, triggering a crash.
•
Fixed highest error address not reporting correctly on error.
•
Fixed error counters overflowing and reseting to 0 after too many errors.
•
Fixed max contiguous error reporting.
•
Cleaned up some UI text.
•
The Windows USB package now includes ImageUSB to make creating Memtest86 USB drives easier.
Copyright 2013 mark® Software
Page 33
Enhancements in v4.1 (Jan/2013) •
Added a new boot trace option that single steps through the testing process and displays messages and data that is valuable in diagnosing problems with test execution. A large number of trace points have been added in key portions of the code (in particular SMP startup routines) to provide visibility of obscure failures. This feature will allow non-technical s to provide troubleshooting data for better test stability.
•
Added a new One feature. This feature runs the complete test once and then exits, but only if there were no errors. This provides a convenient method for unattended testing. One may be enabled via a boot option or via an on-line command.
•
Images for CD, USB key and Floppy disks now use Syslinux for booting and include a variety of standard options and two previous versions of Memtest86. The new boot time options may be specified at the boot prompt.
•
A feature has been added to allow customization of the list of tests to be run. The test list may be specified via a boot option or via an on-line command.
•
A feature has been added to restrict specific Us that are to be used for testing. The maximum number of Us may be specified or a 32 bit U mask may be specified. These are enabled with boot options.
•
A number of problem with use of on-line commands when testing with more than one U have been fixed.
•
A selection of boot time parameters are were added. These options enable boot tracing, the One feature, limit the maximum number of Us to use, specify a U mask to select Us to be used and setup serial console parameters.
•
Improved and extended U identification routines. Newer UID based method is now used to determine cache sizes for Intel Us for better accuracy and ability.
•
Routines for calculating cache and memory speeds have been reworked for better accuracy. An overflow problem has been fixed that resulted in no memory speed being reported for Us with large L3 caches.
•
Fixed some errors in the crash reporting routines.
•
Misc minor fixes and code cleanup.
Copyright 2013 mark® Software
Page 34
Enhancements in v4.0 (28/Mar/2011) •
Full for testing with multiple Us. All tests except for #11 (Bit Fade) have been multithreaded. A maximum of 16 Us will be used for testing.
•
U detection has been completely re-written to use the brand ID string rather than the cumbersome, difficult to maintain and often out of date UID family information. All new processors will now be correctly identified without requiring code .
•
All code related to controller identification, PCI and DMI has been removed.
•
This may be a controversial decision and was not made lightly. The following are justifications for the decision: 1. Controller identification has nothing to do with actual testing ofmemory, the core purpose of Memtest86. 2. This code needed to be updated with every new chipset. With the ever growing number of chipsets it is not possible to keep up with the changes. The result is that new chipsets were more often than not reported in-correctly. In the authors opinion incorrect information is worse than no information. 3. Probing for chipset information carries the risk of making the program crash. 4. The amount of code involved with controller identification was quite large, making more difficult.
•
Removing this code also had the unfortunate effect of removing reporting of correctable ECC errors. The code to ECC was hopelessly intertwined the controller identification code. A fresh, streamlined implementation of ECC reporting is planned for a future release.
•
A surprising number of conditions existed that potentially cause problems when testing more than 4 GB of memory. Most if not all of these conditions have been identified and corrected.
•
A number of cases were corrected where not all of memory was being tested.
•
For most tests the last word of each test block was not tested. In addition an error in the paging code was fixed that omitted from testing the last 256 bytes of each block above 2 GB.
•
The information display has been simplified and a number of details that were not relevant to testing were removed.
•
Memory speed reporting has been parallelized for more accurate reporting for multi channel memory controllers.
•
This is a major re-write of the Memtest86 with a large number of minor bugfixes and substantial cleanup and re-organization of the code.
Copyright 2013 mark® Software
Page 35
Enhancements in v3.5 (3/Jan/2008) •
Limited for execution with multiple Us. Us are selected round-robin or sequential for each test.
•
for additional chipsets. (from Memtest86+ v2.11).
•
Additions and corrections for U detection including reporting of L3 cache.
•
Reworked information display for better readability and new information.
•
Abbreviated iterations for first .
•
Enhancements to memory sizing.
•
Misc fixes and code cleanup
Enhancements in v3.4 (8/Sep/2007) •
A new error summary display with error confidence analysis
•
Display of memory module information (from Memtest86+ v1.70)
•
Relocated testing reworked to overlap main testing for better error detection
•
for additional chipsets. (from Memtest86+ v1.70)
•
Additions and corrections for U identification
•
Misc bug fixes and code cleanup
Enhancements in v3.3 (12/Jan/2007) •
Added for additional chipsets. (from Memtest86+ v1.60)
•
Changed Modulo 20 test (#8) to use a more effective random pattern rather than simple ones and zeros.
•
Fixed a bug that prevented testing of low memory.
•
Added an advanced menu option to display SPD info (only for selected chipsets).
•
Updated U detection for new Us and corrected some bugs.
•
Reworked online command text for better clarity.
•
Added a fix to correct a Badram pattern bug.
Copyright 2013 mark® Software
Page 36
Appendix D.Acknowledgments D.1 UEFI (v5+) MemTest86 (UEFI) was developed by Mark Software based on the original source code by Chris Brady.
D.2 BIOS (v4) MemTest86 was developed by Chris Brady and maintained by Mark Software with the resources and generous assistance of individuals and sources listed below: The initial versions of the source files bootsect.S, setup.S, head.S and build.c are from the Linux 1.2.1 kernel and have been heavily modified. Doug Sisk provided code to a console connected via a serial port. Code to create BadRAM patterns was provided by Rick van Rein. Tests 5 is based on Robert Redelmeier's burnBX test. Screen buffer code was provided by Jani Averbach. Eric Biederman provided all of the feature content for version 3.0 plus many bugfixes and significant code cleanup. Major enhancements to hardware detection and reporting in version 3.2, 3.3 and 3.4 provided by Samuel Demeulemeester (from MemTest86+ v1.11, v1.60 and v1.70).
Copyright 2013 mark® Software
Page 37