Tuesday, April 27, 2010

Network Issues

"When a Fail-Safe system fails, it fails by failing to fail safe." - John Gall

I come into work today, and on the elevator someone from another department says to me that I will have a fun day today, all the computers are down. Okay, whatever. I get to my office, and yes the entire network is down. I cannot get to any internal sites nor to the internet! Then I find out that the shipment of COWs just showed up in receiving. Well, that at least gives me something to do. We bring the herd of 51 up to our area, and then I go to the computer room to see how things are going. They were still trying to figure out what had happened. It looked like all the servers and applications were running, but the was no communication. They have been trying to bring things back slowly and there has been stuff up and down throughout the system. I just got internet up and am still waiting on the internal stuff to be up. What happened to the redundant systems that are suppose to prevent this. 5-6 hours of downtime is a long time.

Update: It looks like the redundant system that was suppose to prevent this was the cause.

No comments: