April 24, 2006
Mathserv Problem Monday Morning
The primary server, mathserv, stopped responding to network activity at 4:00 am this morning; services were restored at 9:45 am. There is no evidence of data loss; msprime workstations began working again as soon as service was restored.
Email and web sites were down while mathserv was unresponsive, and the msprime systems were hung. Shortly after 9:45, email queued since 4:00 am was delivered and the msprime systems unfroze. I can't say what the precise cause of the problem was; it had something to do with the network-traffic routing at the kernel level. I will be keeping an eye on the system.