« Servers Still Down Monday AM | Home | Intel Compilers & MKL for OS X »
January 9, 2006
Downtime on Tuesday, January 10th
The machines were turned back on at ca. 10:30 am when the room had cooled enough to take the extra systems.
Now I'm told that the ventillation will go down tomorrow am from ca. 5 - 7. The machines can easily overheat - which could result in file-system damage, even hardware damage - before two hours without ventillation. I'm going going to shut down everything but mathserv, bayes and freesurface this time. Redpine, spruce, modelmath and mathserv2 will shutdown at ca. 4:00 am.
Yes, this is ridiculous. I'm going to contact PP tomorrow and see what can be done to avoid these frequent, sudden shutdowns. Part of the problem is the way the server room was divided at the 11th hour during the 2003 renovations: a server room as large as was intended would overheat far less quickly. Is this fixable? Yes - it would be expensive and involve having signifigant downtime. I'm hoping that that won't start to look desirable.