hello managers,
i have been getting some messages from HSZ70 controllers,
and i don't know what are these messages or what should i do.
here is explanation of situation ;
we are using DEC 8400 5/625 + DU 4.0d with patch kit 3 connected to two
ESA10000 cabinets + HSZ70 controllers. (Firmware V71Z and V70Z)
i am using swcc agent (SWCC201) for doing some monitoring, remote control
tasks and modification.
messages;
...
...
Sep 2 15:16:46 iski01 steamd[576]: WARNING: - iski01 yeni_alt 00000000001
HSZ70 luns(D000:0)
Sep 2 15:16:46 iski01 steamd[576]: WARNING: Socket error - connecting to
cihan - Error 0 occurred. (SP_SOCKET: socketConnectByServi
ce)
Sep 2 15:29:46 iski01 steamd[576]: INFORMATION: Validation successful -
Client: cihan (SP_TCP: ValidateClient)
Sep 2 15:56:47 iski01 steamd[576]: WARNING: - open(): device:/dev/rrz25c,
Error 0 occurred. (ScsiDevOpen())
Sep 2 15:56:47 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 eski_ust OVRL=1
Sep 2 15:56:47 iski01 steamd[576]: WARNING: - iski01 eski_ust 10000000002
HSZ70 luns(D100:4)
Sep 2 15:56:48 iski01 steamd[576]: WARNING: Socket error - connecting to
cihan - Error 0 occurred. (SP_SOCKET: socketConnectByServi
ce)
Sep 2 16:06:48 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 eski_ust OVRL=0
Sep 2 16:06:48 iski01 steamd[576]: WARNING: - iski01 eski_ust 00000000001
HSZ70 luns(D100:0)
Sep 2 16:06:48 iski01 steamd[576]: WARNING: Socket error - connecting to
cihan - Error 0 occurred. (SP_SOCKET: socketConnectByServi
ce)
Sep 2 16:15:59 iski01 steamd[576]: INFORMATION: Validation successful -
Client: cihan (SP_TCP: ValidateClient)
Sep 2 16:16:08 iski01 last message repeated 3 times
Sep 2 16:29:59 iski01 steamd[576]: INFORMATION: Validation successful -
Client: cihan (SP_TCP: ValidateClient)
Sep 2 16:30:47 iski01 last message repeated 7 times
Sep 2 17:00:22 iski01 steamd[576]: INFORMATION: Validation successful -
Client: cihan (SP_TCP: ValidateClient)
Sep 2 17:00:31 iski01 last message repeated 3 times
Sep 2 17:16:49 iski01 steamd[576]: WARNING: - open(): device:/dev/rrzb72c,
Error 0 occurred. (ScsiDevOpen())
Sep 2 17:16:50 iski01 steamd[576]: CRITICAL: Unable to open device -
rrzb72c (SP_MONITOR: MonitorSubsys)
Sep 2 17:16:50 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 yeni_ust OVRL=1
Sep 2 17:16:50 iski01 steamd[576]: WARNING: - iski01 yeni_ust 10000002000
HSZ70
Sep 2 17:26:51 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 yeni_ust OVRL=0
Sep 2 17:26:51 iski01 steamd[576]: WARNING: - iski01 yeni_ust 00000001000
HSZ70
Sep 2 18:26:51 iski01 steamd[576]: WARNING: - open(): device:/dev/rrz24c,
Error 0 occurred. (ScsiDevOpen())
Sep 2 18:26:52 iski01 steamd[576]: CRITICAL: Unable to open device - rrz24c
(SP_MONITOR: MonitorSubsys)
Sep 2 18:26:53 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 eski_ust OVRL=1
Sep 2 18:26:53 iski01 steamd[576]: WARNING: - iski01 eski_ust 10000002000
HSZ70
Sep 2 18:36:53 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 eski_ust OVRL=0
Sep 2 18:36:53 iski01 steamd[576]: WARNING: - iski01 eski_ust 00000001000
HSZ70
Sep 2 19:46:53 iski01 steamd[576]: WARNING: - open(): device:/dev/rrzb72c,
Error 0 occurred. (ScsiDevOpen())
Sep 2 19:46:54 iski01 steamd[576]: CRITICAL: Unable to open device -
rrzb72c (SP_MONITOR: MonitorSubsys)
Sep 2 19:46:54 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 yeni_ust OVRL=1
Sep 2 19:46:55 iski01 steamd[576]: WARNING: - iski01 yeni_ust 10000002000
HSZ70
Sep 2 19:56:55 iski01 steamd[576]: WARNING: - A subsytem change has been
detected: iski01 yeni_ust OVRL=0
Sep 2 19:56:55 iski01 steamd[576]: WARNING: - iski01 yeni_ust 00000001000
HSZ70
....
....
agent is also sending traps to my computer;
Trap received from (Host:iski01, Subsystem:eski_ust) : virtual disk failed.
Trap received from (Host:iski01, Subsystem:yeni_alt) : virtual disk failed.
Trap received from (Host:iski01, Subsystem:yeni_ust) : Communication with
the subsystem failed.
**********************
and, information about controllers
(from controller yeni_ust and yeni_alt)
YENI_UST >show this
Controller:
HSZ70 ZG83418738 Firmware V71Z-0, Hardware H02
Configured for dual-redundancy with ZG83116708
In dual-redundant configuration
Device Port SCSI address 7
Time: NOT SET
Host port:
SCSI target(s) (0, 1, 2, 3, 4)
Preferred target(s) (0, 1)
TRANSFER_RATE_REQUESTED = 20MHZ
Host Functionality Mode = A
Allocation class 0
Command Console LUN is target 0, lun 1
Cache:
64 megabyte write cache, version 4
Cache is GOOD
Battery is GOOD
Unflushed data in cache
CACHE_FLUSH_TIMER = 45 (seconds)
NOCACHE_UPS
Mirrored Cache:
Not enabled
--------------------
(from controller eski_ust and eski_alt)
Controller:
HSZ70 ZG80808404 Firmware V70Z-0, Hardware H01
Configured for dual-redundancy with ZG80808431
In dual-redundant configuration
Device Port SCSI address 7
Time: NOT SET
Host port:
SCSI target(s) (0, 1, 2, 3)
Preferred target(s) (0, 1)
TRANSFER_RATE_REQUESTED = 20MHZ
Host Functionality Mode = A
Command Console LUN is target 0, lun 0
Cache:
64 megabyte write cache, version 4
Cache is GOOD
Battery is GOOD
Unflushed data in cache
CACHE_FLUSH_TIMER = 45 (seconds)
NOCACHE_UPS
Mirrored Cache:
Not enabled
************************
Failure output from controller;
FMU> show LAST_FAILURE MOST_RECENT
Last Failure Entry: 4. Flags: 000FF380
Template: 1.(01) Description: Last Failure Event
Power On Time: 0. Years, 175. Days, 21. Hours, 43. Minutes, 28. Seconds
Controller Model: HSZ70
Serial Number: ZG83418738 Hardware Version: H02(48)
Firmware Version: V71Z(00)
Informational Report
Instance Code: 01010302 Description:
An unrecoverable hardware detected fault occurred.
Reporting Component: 1.(01) Description:
Executive Services
Reporting component's event number: 1.(01)
Event Threshold: 2.(02) Classification:
HARD. Failure of a component that affects controller performance or
precludes access to a device connected to the controller is indicated.
Last Failure Code: 018000A0 (No Last Failure Parameters)
Last Failure Code: 018000A0 Description:
A powerfail interrupt occurred.
Reporting Component: 1.(01) Description:
Executive Services
Reporting component's event number: 128.(80)
Restart Type: 2.(02) Description: Automatic hardware restart
Last Failure Entry: 3. Flags: 000FF300
Template: 1.(01) Description: Last Failure Event
Power On Time: 0. Years, 172. Days, 11. Hours, 59. Minutes, 5. Seconds
Controller Model: HSZ70
Serial Number: ZG83418738 Hardware Version: H02(48)
Firmware Version: V71Z(00)
Informational Report
Instance Code: 0102030A Description:
An unrecoverable firmware inconsistency was detected or an intentional
restart or shutdown of controller operation was requested.
Reporting Component: 1.(01) Description:
Executive Services
Reporting component's event number: 2.(02)
Event Threshold: 10.(0A) Classification:
SOFT. An unexpected condition detected by a controller firmware component
(e.g., protocol violations, host buffer access errors, internal
inconsistencies, uninterpreted device errors, etc.) or an intentional
restart or shutdown of controller operation is indicated.
Last Failure Code: 081E0000 (No Last Failure Parameters)
Last Failure Code: 081E0000 Description:
In order to go into nomirrored cache mode, the controllers must be
restarted
Reporting Component: 8.(08) Description:
Nonvolatile Parameter Memory Failover Control
Reporting component's event number: 30.(1E)
Restart Type: 0.(00) Description: Full firmware restart
*************************************
that's all..
do you think firmware update can solve our problem. ? or should i do
something else
thanks. :)
Cihan Ozgen
System Administrator.
Istanbul Water and Sawage Organization
Received on Fri Sep 03 1999 - 09:14:36 NZST