This past weekend we started encountering some odd messages in the AIX error report. When inspecting the Storage Manager client we also saw that all paths had moved over to the second controller.
BC669AA7 1212153609 P H dac1 CONTROLLER HEALTH CHECK FAILURE
BC669AA7 1212152409 P H dac1 CONTROLLER HEALTH CHECK FAILURE
483C9D10 1212151109 I H dac0 ARRAY ACTIVE CONTROLLER SWITCH
D5385D18 1212151109 T H hdisk3 ARRAY OPERATION ERROR
C86ACB7E 1212151109 I H hdisk3 ARRAY CONFIGURATION CHANGED
483C9D10 1212151109 I H dac0 ARRAY ACTIVE CONTROLLER SWITCH
D5385D18 1212151109 T H hdisk5 ARRAY OPERATION ERROR
C86ACB7E 1212151109 I H hdisk5 ARRAY CONFIGURATION CHANGED
BC669AA7 1212150309 P H dac1 CONTROLLER HEALTH CHECK FAILURE
After inspecting the controller errors via the Storage Manager client, I Google'd them and saw references to the controller being faulty. And when looking at the details of the AIX errors, they also seemed to point to the controller as the issue. All HBA's were also online and available so it wasn't a connection issue in those terms.
After a call to IBM support and some onsite troubleshooting with the CE, we reseated the controller. This also had the benefit of power cycling it. Once the controller was reseated and brought online, it functioned just fine and even moved the paths back over automatically.
Thursday, December 17, 2009
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment