3
NOID and
Controller Output
differences
Noid, 05, 29, differences
5
05 output (1)
SCell: Active MFC via Port MIRR6-00508b-400080d0bx 00009-0000002-[0008]x
SCell Master - Quorum POIDs:2001x 2013x 2002x 2003x 2004x
WWN: 5001-4380-013B-D630 MODEL: HSV300 SIDE: B(1) LOC: RIGHT <--- Position of current
controller
NSC: UUID = 5-00508b-400080cf6x 00000-0000000-0000x
MEMBER_TAG = 6-00508b-400080d0bx 00009-000001a-[0021]x
FC Nodes: Temp ID ID
Lp ALPA Lp ALPA Type Node WWN FNB Noid State Usablty
0. dc 1. ef disk 2-004-000cca-0c29d8x 211e67bc 0 Valid Usable
0. ca 1. d2 disk 2-004-000cca-0c2a34x 211e73cc 0 Valid Usable
0. d2 1. ca disk 2-004-000cca-0c2a88x 211e4690 0 Valid Usable
0. c6 1. c6 disk 2-000-001d38-059f09x 211e9b80 0 Valid ILFDisk
0. e0 1. d4 disk 2-004-000cca-10fcc4x 211e3a80 0 Valid Usable
0. ce 1. c7 disk 2-004-000cca-10fd10x 211e4994 0 Valid Usable
0. c5 1. cc disk 2-004-000cca-10fee0x 211e5eb0 0 Valid Usable
0. c6 1. cd disk 2-004-000cca-1100b4x 211e5bac 0 Valid Usable
0. 2 1. 2 NSC 5-00508b-400080cf6x 211e2b6c 0 <------- controller
0. 1 1. 1 NSC 5-00508b-400080d0bx 211e82e0 0 <------- controller
0. f 1. 8 err 5-00508b-400080f72x 211e7cd8 80 Unknown Usable <------- diskshelf
0. 17 1. 10 err 5-00508b-400080f90x 211e7fdc 81 Unknown Usable <------- diskshelf
0. 8 1. f err 5-00508b-400080fc9x 211e76d0 82 Unknown Usable <------- diskshelf
0. 10 1. 17 err 5-00508b-4000812adx 211e79d4 83 Unknown Usable <------- diskshelf
6
05 output (2)
All RSSs:
RSS LDAD Free Members LDAD
Noid Noid PSEGS Type
Abnrml Missing MgrFlgs MbrMgrt SrcRIdx TrgRIdx Migration Array
Member Blk
Volnoid Capacity
400x 0x 0. 5. 255. <------------------------ Quorum RSS Group
ffe0x ffe0x 0x 0000x 0x 0x 00000000x 00000000x 00000000x 00000000x
7
05 output (3)
8
29 output - Loop configuration
WWN: 5001-4380-013B-D630
9
29 output Enclosure Management Process
EVA ENCLOSURE MANAGMENT PROCESS
ENC_MGT TASK 00 is OFF 00 Out of the 14 listed tasks, only 10
are used
ENC_MGT TASK 01 is OFF 00 Only one task can be active at any
given time
ENC_MGT TASK 02 is OFF 00 When two or more tasks need to be
run, the lowest task number is run
first
ENC_MGT TASK 03 is OFF 00 Task 03 - Codeload Enclosure A
ENC_MGT TASK 04 is OFF 00 Task 04 - Codeload Enclosure B
ENC_MGT TASK 05 is OFF 00 Task 05 - SES Receive
ENC_MGT TASK 06 is OFF 00 Task 06 - Shutdown Enclosures
ENC_MGT TASK 07 is OFF 00 Task 07 - Enclosure Numbering
ENC_MGT TASK 08 is OFF 00 Task 08 - Add Element Status
ENC_MGT TASK 09 is OFF 00 Task 09 - Drive Bypass
ENC_MGT TASK 10 is OFF 00 Task 10 - Change LED Pattern
ENC_MGT TASK 11 is OFF 00 Task 11 - Drive Power
ENC_MGT TASK 12 is OFF 00 Task 12 - Set Enclosure time
ENC_MGT TASK 13 is OFF 00
10
29 output FW and Bypass Mask
Enclosure #02 - WWN 0x500508B400080F90
ASIDE FW - 0050 AVAILABLE <---- FW of IO Module (EWE)-
BSIDE FW - 0050 AVAILABLE <---- FW of IO Module (EWE)
Bay 01 02 03 04 05 06 07 08 09 10 11 12 <----- Slot Id in Enclosure
Overall Bypass 00 00 11 00 00 11 00 00 00 00 11 00 <----- Overall Bypass Mask
(ORed of next 3 fields)
Ctrl Bypass 00 00 00 00 00 00 00 00 00 00 00 00 <----- Bypass Mask set by
Controller
Device Bypass 00 00 11 00 00 11 00 00 00 00 11 00 <----- Bypass Mask set by
Device
Enc Bypass 00 00 00 00 00 00 00 00 00 00 00 00 <----- Bypass Mask set by
Enclosure (IO Module)
Powered Off 00 00 00 00 00 00 00 00 00 00 00 00 <----- Power UP status
11
29 output Bypass Mask
- The Overall Bypass
- The Controller Bypass
- The Device Bypass
- The Enclosure Bypass
12
29 output Bypass coding
The Blue numbers are the Bypass Mask, the Red numbers are the Bypass Source
00000000 => OK
01000001 => Bypassed by IO Module on B loop
10000010 => Bypassed by IO Module on A loop
11000011 => Bypassed by IO Module on BOTH loops
01000100 => Bypassed by drive on B loop
10001000 => Bypassed by drive on A loop
11001100 => Bypassed by drive on BOTH loops
01010000 => Bypassed by controllers on B loop
10100000 => Bypassed by controllers on A loop
11110000 => Bypassed by controllers on BOTH loops
01010001 => Bypassed by controller and IO Module on B loop
10100010 => Bypassed by controller and IO Module on A loop
11110011 => Bypassed by controller and IO Module on BOTH loops
13
IO Modules CLI
useful commands
The processing unit inside the IO
modules of EVA LE
-PIC
-IO Modules
15
LIP primitive path through the CTS
For the counters
discussed later
the data is
displayed for
-Disk 1 12
-SMC
-SFP 1 2
16
IO Modules Log Output Capture
ELMo and EMP are the same thing
The term ELMo and/or EMP should no longer be used
Whenever these are found, read IO Modules
This log is captured from the IO Module
Use standard serial cable (316131-001) and connect from laptop
or PC (DB9 end) to either I/O module (RJ45 end)
User HyperTerminal to set the enclosure serial number
115200 baud
One stop bit
No parity
No flow control
17
IO Module CLI usage
CLI for support only
Not supported for customers
Useful only for troubleshooting
Logs are cryptic
Prompt explanation
Format NNES:####>
NN = decimal display shelf ID
E = a or b I/O module
S = 2 or 4 (shelf speed)
#### = command number since boot
Case sensitive
18
IO Module CLI useful commands
Some useful commands
menu Display list of commands
prompt Turn prompt on/off
info Display static HW and FW information
stat Status of dynamic elements
ctsErrCnt X [Y] Get Loop counters
X = a, d, or c (a=All, d=Delta, c=Clear)
X =zero This will clear the counters
logRead X [Y] Read log X
(1 event log, 2 = temp log), Y event to start at (0 oldest)
Temp log is hardly useful for us
19
Typical IO Module command usage
It is usual best to get the output of
info
stat
ctsErrCnt a <---- absolute count output
ctsErrCnt d <---- from last clear command
wait 5 minutes
ctsErrCnt c <---- set new baseline
if possible, wait another 30 minutes
ctsErrCnt d <---- from last clear command
All in a single file per IO Module
20
Info Command Output
ASCII file containing output on IO Module parts
Starts with a build Date and Time
Enclosure Info
Vendor ID, Product ID, Product Rev., WWID, Serial Number
IO Module
HW Type, HW Rev., App Rev. (is FW), Boot Rev., 7 Segment FW. Rev.,
WWID, Serial Number
Midplane
HW Type., HW Revision, PIC FW, PIC boot Rev., Module serial Number
Backplane
HW Revision
21
Stat Command Output
ASCII file containing status of the following
Enclosure
Actual time of capture, Run time
IO Modules
Run time, E2E Status
Drives
Bay, alpa, Present, Bypass (Source of Bypass), Code (Bypass Reason)
Transceivers
Number (XCVR 1 or XCVR 2), Pres, LSR, Lnk, TxF, Type, MaxSpd,
Power Supplies
Num (Power 1 or Power 2), Pres (Yes or No), Stat (On or Off), DCOK
Fan Modules
Num, Pres, Stat, Spd, Tach
Temp sensors
Loc(IOA, IOB,MP1,MP2), Val (temp in degrees), Warn, Fail
22
ctsErrCnt - read loop statistics
Command show the Error counters show the Error
counters of the CTS chip
X = a for absolute count of values in SES database.
equivalent to 34 command
X = d for delta from last clear command.
equivalent to 36 command
X = c for clear relative counts to equal absolute values so that
next delta command will show change.
equivalent to 35 command
Y can be entered with a or d options as either N or n to only
display non-zero error counts.
X = zero [exact] to zero out the database of counters, which is
what shows up in the SES pages.
23
ctsErrCnt - Output
This is what is output for every
element of the IO Module
All disk slots (1-12)
SMC
Transceiver 01 (Left)
Transceiver 02 (Right)
The value displayed is the
number of occurrences of
exceeding the threshold for the
counter as defined in the Policy
Matrix (causing the interrupt)
24
IO Module Log File
26
Internal log entries Layout
Collected through the CLI command logread
Very verbose and requires decoding ring
Only two logs available, event log and temperature log
Temperature log only updates once a hour
unless reported temperature changes by three or more degrees
###### : ############ - ####.## ## ## . . . ## All numbers are HEX
Parameter N
Parameter 2
Parameter 1
Length
Event Code
Entity Code
TimeStamp (ms)
Event number
Log type
27
Rev. 8.11 HP Restricted
Example of Internal log entries
Example of a Log entry
22a2:0004>logRead 1
010000 : 00dc70fe0801 - 0801.01 00
010001 : 00dc71010801 - 0801.01 00
010002 : 00dc71030801 - 0801.01 00
010003 : 00dc710c0801 - 0801.01 00
010004 : 00dc71240801 - 0801.01 00
010005 : 00dc71400801 - 0801.01 00
010006 : 00dc71441702 - 1702.05 10 80 00 02 00
28
Drive enclosure 28 Rev. 8.11 HP Restricted
Event Logging
Each IO contains persistent data log with 1998 entries
Circular
log
Element status events are formatted following SES
Important timestamps
Jan 1, 1970 Starting time of IO Module clock
Indicates that IO Module has been unable to communicate with
midplane since boot
Jan 1, 2000 Starting time of midplane clock
Indicates that power was recently applied and time has not been
set by the storage controller
Always sort on sequence number, not on date (see
above)
29
Drive enclosure 29 Rev. 8.11 HP Restricted
Importing the Report(s)
Open Navigator
Create a workspace if needed
Select File Import Storage shelf log (One file per Shelf, or All shelves in one file)
Or File Import All from ZIP if all logs are in a ZIP
You will now see one or more Shelf logs in the GUI
A number at the end of the SD_2000XXXX name indicates multiple files of the
same date. Not necessarily same enclosure or same IO Module
These 3 characters indicate the Enclosure number (01) and loop (B)
Introduced in V4.06.006
30
Reports
After selecting the appropriate
Log a new screen opens and
you have to select
Shelf Data Event
Log
This report does not show
the parameter used
throughout the rest of this
PPT
Create a custom report
and add the Parameter
field and save this in your
config if needed
31
SES Status Event Example
Exceptions:
Temperature delta 4 degrees for logging
Fan Actual speed (RPM) filtered
32
Rev. 8.11 HP Restricted
Event Logs: Filtered content at decoder
Filtered content at decoder
Some Fields may be hidden based on value
Status parameters include previous status values (wont be decoded)
Bit/ byt 7 6 5 4 3 2 1 0
e
0 Swap Element Status Code
33
Rev. 8.11 HP Restricted
Midplane
Bit/byte 7 6 5 4 3 2 1 0
0 Element Status Code
1
2
3
34
Rev. 8.11 HP Restricted
I/O Module
Bit/byte 7 6 5 4 3 2 1 0
0 Swap Element Status Code
1 Ident Fail
2
3
35
Rev. 8.11 HP Restricted
Power Supply
Bit/byte 7 6 5 4 3 2 1 0
0 Swap Element Status Code
1
2
3 Rqstd On DC Fail
Swap
Element Status Code
1 - OK
3 Non-critical DC not ok
5 - Not Installed
39
Rev. 8.11 HP Restricted
Drives (2/3)
Amber LED (Fault)
Fault Sensed - Drive activated fault light
Fault Requested Controller requested IO Module activated fault
light
Bypass
Only bypass bits for IO Modules active loop are populated
App. Client Bypass N
Controller requested bypass on loop N
Enclosure Bypass N
Enclosure requested bypass on loop N
Device Bypass N
Drive requested bypass on loop N
Bypassed N
Drive is bypassed by one or more of the above requests on loop N
40
Rev. 8.11 HP Restricted
Drives (3/3) Source of Bypass
N = Not Bypassed
Bypass LIP
Indicates a LIP triggered by a port being (un)bypassed
Port1=7: means Port 1 is disk
Bypass Update 7
Records reason for ports to enter/exit bypassed state Port1 R = Not bypassed
CTS LIP Start means the port is Not
Bypassed
Start of LIP
Port3=9 means Port 3 is disk 9
CTS LIP Done
Port3 R = Not bypassed
LIP Completed means the port is Not
CTS LIP Test Bypassed
Drives are LIP tested before being unbypassed. Failures are reported
SES Control
Usually best indicator of when a controller has previously bypassed a drive
42
Rev. 8.11 HP Restricted
Enclosure
Bit/byte 7 6 5 4 3 2 1 0
0 Element Status Code
1 Failure Warning Failure Warning
Ident Requeste Requeste Indicatio Indicatio
d d n n
2 CTPC Unit Countdown to Power Down
3 RPOD Unit Duration of Power Down
5
44
Rev. 8.11 HP Restricted
Reset Event Sequence (2/4)
Reset on power failure
45
Rev. 8.11 HP Restricted
Reset Event Sequence (3/4)Event indicating
thread non-responsive
Reset on Watchdog Expiration
46
Rev. 8.11 HP Restricted
Reset Event Sequence (4/4)
Reset on Codeload
47
Rev. 8.11 HP Restricted