ARSC system downtime for all systems (all)
Menu to filter items by type
Contents for all systems
Start Time: 05/18/2013 -- 18:00 End Time: 05/20/2013 -- 09:00 Reason: University FAST power outage. Routine system maintenance.
Start Time: 05/18/2013 -- 18:00 End Time: 05/19/2013 -- 21:00 Reason: University FAST power outage. Routine system maintenance.
Start Time: 05/18/2013 -- 12:00 End Time: 05/19/2013 -- 21:00 Reason: University FAST power outage and OS upgrade to Default modules will be updated. Routine system maintenance.
Start Time: 05/18/2013 -- 12:00 End Time: 05/24/2013 -- 12:00 Reason: University FAST power outage and OS upgrade to RHEL 6.4. Default modules will be updated. *** Recompiling of user code required. *** If users wish to recompile your pacman code in the RHEL 6.4 environment prior to the May 18th scheduled downtime, please log onto pacman14.arsc.edu and recompile using the software available via modules and /usr/local/pkg. If users would like to submit batch jobs in the RHEL 6.4 environment prior to the scheduled downtime, please contact the ARSC Help Desk.
Start Time: 04/20/2013 -- 11:00 End Time: 04/20/2013 -- 12:35 Reason: Due to a network outage, many 4 core nodes were rebooted.
Start Time: 04/06/2013 -- 00:09 End Time: 04/11/2013 -- 00:09 Reason: The reoccuring issue with the pacman batch scheduler has been resolved. Users who lost jobs as a result of the 12:09am batch scheduler failure on April 6th, 7th, and 8th were notified. All previously held long running jobs have been released and all pacman queues are now functioning normally.
Start Time: 04/06/2013 -- 00:00 End Time: 04/06/2013 -- 14:00 Reason: There was an issue with the administrative node on pacman which resulted in jobs on 12 core, 16 core and bigmem nodes failing. Jobs running on 4 core nodes should not have been affected by this outage.
Machines: linuxws pacman bigdipper fish
Start Time: 03/27/2013 -- 15:30 End Time: 03/28/2013 -- 12:55 Reason: Power was lost to the machine room. An emergency power down was initiated on pacman, fish, bigdipper. All running jobs were lost. Users with running job at the time of the power outage will be contacted with a list of lost jobs. 03/27/2013 -- 20:30 - Linux Workstations were returned to service 03/27/2013 -- 23:00 - Fish was returned to service 03/28/2013 -- 00:00 - Pacman was returned to service - 03/28/2013 -- 09:30 - Web Servers and the License server are still being recovered. 03/28/2013 -- 11:30 - Web Servers have been returned to service. 03/28/2013 -- 12:55 - The license server was returned to service.