Advfs I/O errors

From: Manish Vashi <manish_vashi_at_fanniemae.com>
Date: Tue, 28 Aug 2001 14:19:25 -0400

Greetings

          We have a ES40 at 5.1 and patchkit3 , the server is connected
with Fiber Channel { HSG80 and KGPSA } using a Fiber HUB. The sever is
running oracle 8.1.7.1 , now the problem is we are getting advfs I/O
errors on the system time to time, and at times corrupts the data.
# tail /var/adm/messages
Aug 28 07:51:26 pwarehouse14 vmunix: Block: 4179808
Aug 28 07:51:26 pwarehouse14 vmunix: Block count: 8192
Aug 28 07:51:26 pwarehouse14 vmunix: Type of operation: Read
Aug 28 07:51:26 pwarehouse14 vmunix: Error: 5
Aug 28 07:51:26 pwarehouse14 vmunix: EEI: 0x6400
Aug 28 07:51:26 pwarehouse14 vmunix: I/O error appears to be due to
a hardware problem.
Aug 28 07:51:26 pwarehouse14 vmunix: Check the binary error log for
details.
Aug 28 07:51:26 pwarehouse14 vmunix: To obtain the name of the file
on which
Aug 28 07:51:26 pwarehouse14 vmunix: the error occurred, type the
command:
Aug 28 07:51:26 pwarehouse14 vmunix: /sbin/advfs/tag2name
/db/db014/.tags/9

when u see using dec event we see command time out errors. We have
replaced the KGPSA { HBA } also , the errors are not particular to one
drive or one file system, everytime errors are on different filesystems
and different drives so that rules out possibilty that HSG80 can be bad
and we have replaced KGPSA. Has anyone seen such problem with oracle,
and what was the solution to it. We upgraded the oracle version from
8.0.6.3 to 8.1.7.1 last week.

# dia -R |more

DECevent V3.3


**** V3.3 ********************* ENTRY 1
********************************


Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 1743.
Timestamp of occurrence 27-AUG-2001 13:41:26
Host name pwarehouse1

System type register x00000022 Systype 34. (Regatta Family)
Number of CPUs (mpnum) x00000004
CPU logging event (mperr) x00000000

Event validity 1. O/S claims event is valid
Event severity 5. Low Priority
Entry type 310. Time Stamp
                                 -1. - (minor class)


**** V3.3 ********************* ENTRY 2
********************************


Logging OS 2. Digital UNIX
System Architecture 2. Alpha
Event sequence number 1708.
Timestamp of occurrence 27-AUG-2001 07:51:25
Host name pwarehouse1

System type register x00000022 Systype 34. (Regatta Family)
Number of CPUs (mpnum) x00000004
CPU logging event (mperr) x00000000

Event validity 1. O/S claims event is valid
Event severity 3. High Priority
Entry type 199. CAM SCSI Event Type
                                  0. - (minor class)


------- Unit Info -------
Bus Number 3.
Unit Number x00C4 Target = 0.
                                     LUN = 4.
------- CAM Data -------
Class x00 Disk
Subsystem x00 Disk
Number of Packets 7.

------ Packet Type ------ 258. Module Name String

Routine Name cdisk_complete

------ Packet Type ------ 256. Generic String

                                     Retries Exhausted


------ Packet Type ------ 260. Hardware Error String

Error Type Hard Error Detected

------ Packet Type ------ 257. Device Name String

Device Name DEC HSG80 V85F

------ Packet Type ------ 256. Generic String

                                     Active CCB at time of error

------ Packet Type ------ 256. Generic String

                                     Command timed out

------ Packet Type ------ 1. SCSI I/O Request CCB(CCB_SCSIIO)
Packet Revision 76.

CCB Address xFFFFFC01E7E5FE30
CCB Length x00C0
XPT Function Code x01 Execute requested SCSI I/O
CAM Status x0B Command Timeout
Path ID 3.
Target ID 0.
Target LUN 4.
CAM Flags x00000442 SIM Queue Actions are Enabled
                                     Data Direction (01: DATA IN)
                                     Disable the SIM Queue Frozen State
*pdrv_ptr xFFFFFC01E7E5FA70
*next_ccb x0000000000000000
*req_map xFFFFFC01EEDB9180
void (*cam_cbfcnp)() xFFFFFC0000670990
*data_ptr x0000000140110000
Data Transfer Length 8192.
*sense_ptr xFFFFFC01E7E5FAD0
Auotsense Byte Length 255.
CDB Length 10.
Scatter/Gather Entry Cnt 0.
SCSI Status x00 Good Condition
Autosense Residue Length x00
Transfer Residue Length x00002000
(CDB) Command & Data Buf

          15--<-12 11--<-08 07--<-04 03--<-00 :Byte Order
 0000: 00000010 000060C7 3F000028 * (..?.`......*

Timeout Value x0000003C
*msg_ptr x0000000000000000
Message Length 0.
Vendor Unique Flags x4000
Tag Queue Actions x20 Tag for Simple Queue


Regards
Manish Vashi

Received on Tue Aug 28 2001 - 18:24:48 NZST

This archive was generated by hypermail 2.4.0 : Wed Nov 08 2023 - 11:53:42 NZDT