On Sun, 2012-04-15 at 10:52 -0500, Jimmy Hess wrote:
In any given 24 hour period, the probability of at least one single bit error exceeds 98%. Assuming the memory is good and functioning correctly;
It's expected to see on average approximately 3 to 4 1-bit errors per day. More are frequently seen.
Now if most of this 16GB of memory is unused, you will never notice that over 30 days, 120 or so bits have been flipped from their proper value..
Hi, I've been operating 4 desktop PCs with each the following configuration: 16 GB of RAM (4x4GB Kingston) running Linux about 15 VM (KVM) on DRBD disks using more than 10 GB of RAM for nearly a year now in a room without cooling. Over the year I've got one dead HDD and one dead SSD (both replaced) but no data corruption or host or VM crash. Do you have reference to recent papers with experimental data about non ECC memory errors? It should be fairly easy to do (write and read scan memory in a loop) and given your computations you should get bit errors in less than a day. I remember this paper in 2003 but this was using abnormal heat: http://www.cs.princeton.edu/~sudhakar/papers/memerr-slashdot-commentary.html Thanks in advance, Sincerely, Laurent