Sta­bil­ity prob­lems solved (hard­ware prob­lem)

Af­ter put­ting the disks of the 7–stable sys­tem which ex­hib­ited sta­bil­ity prob­lems in­to a com­pletely dif­fer­ent sys­tem (it is a ren­ted root-​server, not our own hard­ware), the sys­tem now sur­vived more than a day (and still no trace of prob­lems) with the UFS setup. Pre­vi­ously it would crash af­ter some minutes.

The ZFS setup with the changed hard­ware had a prob­lem dur­ing the night be­fore (like al­ways af­ter all my ZFS re­lated changes on this ma­chine), but on this ma­chine I changed all locks in ZFS from shared locks to ex­clus­ive locks (this ex­ten­ded the up­time from 4 – 6 hours to “un­til I re­booted the morn­ing af­ter be­cause of hanging pro­cesses”), so this may be be­cause of this. I do not know yet if we will test the ZFS setup with the pure 7-​stable source we use now or not (the goal was to get back a stable sys­tem, in­stead of play­ing around with un­re­lated stuff).

It looks like some kind of hard­ware prob­lem was un­covered by up­dat­ing from 7.1 to 7.2 (and 7-​stable sub­sequently). This new ma­chine has a com­pletely dif­fer­ent chip­set, a new CPU and RAM and PSU and … so I do not really know what caused this (but the fact that the pre­vi­ous sys­tem did not re­cog­nize the CPU af­ter re­pla­cing it with a big­ger one and the ob­ser­va­tion that only shared locks with a spe­cific us­age pat­tern where af­fected lets me point to­wards miss­ing mi­cro­code up­dates…).