Open vSwitch Crash in cg.d.faelix.net Virtual Servers

Event Started
2025-02-02 22:43
Report Published
2025-02-02 22:49
Last Updated
2025-02-03 07:55
Event Finished
2025-02-03 03:00

A VPS hosting node in Telehouse West has caused its ovs-vswitchd process to restart, but not cleanly reapplied the configuration to all guest VPSs in the process.

Timeline (most recent first)
  • 2025-02-03
    00:16:00

    Strangely, this might have been caused by one of the SSDs in the VPS host going awry, and now reporting:

    Read SMART Data failed: scsi error aborted command
    
    === START OF READ SMART DATA SECTION ===
    SMART Status command failed: scsi error aborted command
    SMART overall-health self-assessment test result: UNKNOWN!
    SMART Status, Attributes and Thresholds cannot be read.
    
  • 2025-02-02
    23:10:00

    We've migrated most VPSs off cg.d.faelix.net which has solved most problems. Cleanly restarting the OVS process and this seems to be bringing the rest back to life.