RPM Service Report

From RPM Wiki

2011 Unplanned Outages

  • 3.25 hours on Sep 26
    • RESOLVE and AGENT both completely down.
    • Cause was a network software problem at our hosting provider.
  • 1.5 hours on Dec 31
    • All AGENT subscribers affected to different degrees
    • At 8:36pm MST the central database server reported the first error. After more errors we rebooted the server which takes at least 2 hours to return to full performance as interrupted databases are recovered.
    • This affected all AGENT users somewhat as this server contains the global user database. It severely affected subscribers on that server including.
    • AGENT was effectively unavailable for some or all subscribers from 8:45pm to about 10pm MST. This outage also pushed back the regular Saturday night optimization so it ran later into Sunday morning than usual.
  • This page was last modified 18:13, 9 Jan 2012.
  • This page has been accessed 206 times.