Thursday, December 17, 2009

DS4300 Controller Issues - CONTROLLER HEALTH CHECK FAILURE

This past weekend we started encountering some odd messages in the AIX error report.  When inspecting the Storage Manager client we also saw that all paths had moved over to the second controller.

BC669AA7   1212153609 P H dac1           CONTROLLER HEALTH CHECK FAILURE
BC669AA7   1212152409 P H dac1           CONTROLLER HEALTH CHECK FAILURE
483C9D10     1212151109 I H dac0            ARRAY ACTIVE CONTROLLER SWITCH
D5385D18     1212151109 T H hdisk3       ARRAY OPERATION ERROR
C86ACB7E    1212151109 I H hdisk3        ARRAY CONFIGURATION CHANGED
483C9D10      1212151109 I H dac0            ARRAY ACTIVE CONTROLLER SWITCH
D5385D18     1212151109 T H hdisk5        ARRAY OPERATION ERROR
C86ACB7E    1212151109 I H hdisk5         ARRAY CONFIGURATION CHANGED
BC669AA7    1212150309 P H dac1           CONTROLLER HEALTH CHECK FAILURE

After inspecting the controller errors via the Storage Manager client, I Google'd them and saw references to the controller being faulty.  And when looking at the details of the AIX errors, they also seemed to point to the controller as the issue.  All HBA's were also online and available so it wasn't a connection issue in those terms.

After a call to IBM support and some onsite troubleshooting with the CE, we reseated the controller.  This also had the benefit of power cycling it.  Once the controller was reseated and brought online, it functioned just fine and even moved the paths back over automatically.

No comments:

Post a Comment