SIDEBAR
»
S
I
D
E
B
A
R
«
VMWare Cluster Disk Failure
May 30th, 2014 by Tim Watts

Update: Now replaced

Nothing to worry about – Disk 19 failed. The system has one hotspare remaining and is functionally normally.

Dell have dispatched a replacement disk that should be here today.

Backup server down
May 19th, 2014 by Tim Watts

Update – 19/5/2014 – Cause of failure determined to be one of the ITS network switches had failed. This has been fixed and everything is working normally.

Update – 15/5/2014 16pm – OK – Almost all backup sets have been retried with success. I think this will be safe for the time being until I do some hardware tests on the server.

Update – 15/5/2014 11am – It’s back. Thanks to Paul V who was passing and very kindly peplugged its network into another CISCO switch.

It is currently running the backups that it should have done last night. However it’s Remote Access (LOM) card seems to have failed which may have caused the network glitch (that model of LOM shares the host servers Network Interface). It’s a recycled server – but we have several spares of that model so I should be able to effect a repair by Monday.

In the meantime, please continue to be careful as the backup system is “at-risk”.

 

Right now, I cannot ping miner.cch.kcl.ac.uk – this lives in Drury Lane and is physically where our backup files live.

I have reason to believe that it is a network issue not a server problem, but we are investigating.

For now, please be on guard as I cannot restore any files nor are backups currently running. We should still be covered by the last successful backups from 2 days ago though.

gsr2.cch failed
May 2nd, 2014 by Tim Watts

Due to the SAN LUN behind it filling. This has now been corrected and gsr2 restarted.

SIDEBAR
»
S
I
D
E
B
A
R
«
»  Substance:WordPress   »  Style:Ahren Ahimsa