Glasgow computing service lost power to the main campus routers this morning - and the UPS didn't work. So Glasgow was down completely for 3 or so hours - failing everything. Durham and Edinburgh were failing RM tests because the Glasgow BDII could not be contacted.
Single points of failure, eh! It's long been a Glasgow complaint that only one BDII can be specified, with no fail over.
1 comment:
Indeed. There seem to be a couple of places where the middleware requires you to have no fall-back position at all, which leads to precisely this kind of problem.
Maybe we should all run a top-level BDII per site...
Post a Comment