Messages & Announcements

  • 2017-08-21:  Crane: service restored
    Category:  System Failure

    At 4:30am today, Crane experienced a failure of the core 10Gbps Ethernet switch. Service was restored at 9:22am. We apologize for the inconvenience.

    This outage affected communications between nodes (except for Infiniband or Omni-Path) and access to network filesystems. Jobs running at this time likely experienced errors, and we encourage you to review the output of jobs which ran during this time.


    At 4:30am today, Crane experienced a failure of the core 10Gbps Ethernet switch. Service was restored at 9:22am. We apologize for the inconvenience.

    This outage affected communications between nodes (except for Infiniband or Omni-Path) and access to network filesystems. Jobs running at this time likely experienced errors, and we encourage you to review the output of jobs which ran during this time.

  • 2017-08-21:  Crane unexpected down time
    Category:  System Failure

    HCC staff is working to get Crane back online. System is currently unreachable; more details will be announced as they become available.


    At 4:30am today, Crane experienced a failure of the core 10Gbps Ethernet switch. Service was restored at 9:22am. We apologize for the inconvenience.

    This outage affected communications between nodes (except for Infiniband or Omni-Path) and access to network filesystems. Jobs running at this time likely experienced errors, and we encourage you to review the output of jobs which ran during this time.

  • 2017-08-16:  Crane unexpected downtime, reboot of login node
    Category:  System Failure

    On Wednesday evening, the Crane login node, crane.unl.edu, had a software issue causing running processes to hang. The service is now running normally.

    Fixing the issue required a reboot of the login node. Running jobs were not affected. We apologize for the inconvenience.


    On Wednesday evening, the Crane login node, crane.unl.edu, had a software issue causing running processes to hang. The service is now running normally.

    Fixing the issue required a reboot of the login node. Running jobs were not affected. We apologize for the inconvenience.

  • 2017-06-05:  Crane /work filesystem downtime resolved
    Category:  General Announcement

    The /work filesystem for Crane is restored as of 2:55pm.

    One of the storage servers crashed and rebooted. A filesystem check was completed with no errors found. Running jobs which were accessing /work stalled until the filesystem was restored. This may have caused jobs to exceed their time limit. There was no data loss from this outage.

    We believe the storage server crash was triggered by I/O delays as the RAID controller was rebuilding a failed disk drive. The rebuild is still running and we are monitoring the system.


  • 2017-06-05:  Crane /work filesystem unplanned downtime
    Category:  System Failure

    The /work filesystem for Crane is partially unavailable. One of the storage servers crashed and rebooted. We are now running a filesystem check before placing the server back online. Pending jobs will be held until the maintenance is complete.


    The filesystem check has been completed with no errors found. The /work filesystem is back online. Running jobs may be affected, but there was no data loss from this outage.

    We believe the storage server crash was triggered by I/O delays as the RAID controller was rebuilding a failed disk drive. The rebuild is still running and we are monitoring the system.

Pages