Troubleshooting
BRKDCT-3708
Ozden Karakok
BRKDCT-3708
Cisco Public
Housekeeping
We value your feedback- don't forget to complete your
online session evaluations after each session & complete
the Overall Conference Evaluation which will be available
online from Thursday
Visit the World of Solutions
Please remember this is a 'non-smoking' venue!
Please switch off your mobile phones
Please make use of the recycling bins provided
Please remember to wear your badge at all times
including the Party
BRKDCT-3708
Cisco Public
Objective
How to use MDS supportability features and tools to
troubleshoot your SAN
PAA-2
BRKDCT-3708
Cisco Public
Agenda
Section 1: Fibre Channel and MDS Basics
FC Operations Review
Addressing, FC Services
Domains, Zoning
ISLs
Zoning
IVR
NPV
BRKDCT-3708
Cisco Public
Section 1
Fibre Channel Operations
Physical Layer Basics
BRKDCT-3708
Cisco Public
FC-2defines how
information is transported
Frames, sequences,
exchanges, login sessions
FC-4s
SCSI
IP
FC-3
Common Services
FC-2
Signaling Protocol
FC-1
Transmission Code
FC-0
Physical Interface
FC-PH
BRKDCT-3708
Cisco Public
BRKDCT-3708
Cisco Public
BRKDCT-3708
Cisco Public
Port B
AC
AC
NOS
LF
LF
OLS
OL
LR
LR
LR = Link Reset
LRR
LR
Idle
AC
Idle
AC = Active State
AC
BRKDCT-3708
Cisco Public
Tip:
Clear Counters and Monitor to Verify Active
Issues, Use Device Manager Monitor Tool to
Monitor Live; Set and Activate Threshold
Manager to Alert You;
MDS# clear counters interface fc 1/11
BRKDCT-3708
Cisco Public
10
Sw
-------------4.1(1c)
4.1(1c)
4.1(1c)
4.1(1c)
4.1(1c)
4.1(1c)
4.1(1c)
4.1(1c)
Mod
--2
3
4
5
6
7
8
10
MAC-Address(es)
-------------------------------------00-14-a9-74-f1-c4 to 00-14-a9-74-f1-c8
00-14-a9-74-f1-dc to 00-14-a9-74-f1-e0
00-14-a9-74-eb-6c to 00-14-a9-74-eb-70
00-17-5a-f8-54-d0 to 00-17-5a-f8-54-d4
00-05-30-00-80-ee to 00-05-30-00-80-f2
00-17-94-4b-22-c8 to 00-17-94-4b-22-cc
00-17-94-4b-22-88 to 00-17-94-4b-22-8c
00-14-a9-74-eb-f0 to 00-14-a9-74-eb-f4
Xbar Ports
--- ----1
0
Hw
-----1.3
1.3
0.403
1.3
0.3
1.3
1.3
0.303
Model
-----------------DS-X9112
DS-X9124
DS-X9148
DS-X9704
DS-X9032
DS-X9530-SF2-K9
DS-X9530-SF2-K9
DS-X9112
Status
-----------ok
ok
ok
ok
ok
active *
ha-standby
ok
World-Wide-Name(s) (WWN)
-------------------------------------------------20:41:00:0d:ec:24:f4:c0 to 20:4c:00:0d:ec:24:f4:c0
20:81:00:0d:ec:24:f4:c0 to 20:98:00:0d:ec:24:f4:c0
20:c1:00:0d:ec:24:f4:c0 to 20:f0:00:0d:ec:24:f4:c0
21:01:00:0d:ec:24:f4:c0 to 21:04:00:0d:ec:24:f4:c0
21:41:00:0d:ec:24:f4:c0 to 21:60:00:0d:ec:24:f4:c0
--22:41:00:0d:ec:24:f4:c0 to 22:4c:00:0d:ec:24:f4:c0
Serial-Num
---------JAB1011002C
JAB1011001L
JAB095102UP
JAB101100V7
JAB063605HP
JAB102000BR
JAB1020009N
JAB095200EK
Module-Type
Model
Status
-------------------------------- ------------------ -----------Fabric Module 2
DS-13SLT-FAB2
ok
Xbar Sw
--- -------------1
NA
Hw
-----0.111
World-Wide-Name(s) (WWN)
--------------------------------------------------Serial-Num
---------JAE1212BO09
Exit to Detach
Mod
--2
3
4
5
6
7
8
10
Xbar MAC-Address(es)
--- -------------------------------------1
NA
BRKDCT-3708
Cisco Public
11
FC Port 1/2
LR_RECEIVE(03) =>ACTIVE(01)
OLS_TRANSMIT(07)
=>LR_RECEIVE(03)
SENTINAL(00)
=>OLS_TRANSMIT(07)
Port Tries
Loop First
When Port Is
Set to Auto
Interface Shut/
No Shut via
Configuration
BRKDCT-3708
MASTER_LIFA_WAIT(19)
ACTIVE(01)
Cisco Public
LPSM_STARTED(01)
=>SENTINAL(00)
12
LC
BRKDCT-3708
014953
014953
014953
014953
014953
014949
usecs
Event
369768 (0000)
368963 (0000)
365690 (0000)
365593 (0001)
360463 (016D)
710690 (413C)
Cisco Public
E_LINK_IDLE
E_LINK_LR
E_LINK_NOS
E_LINK_MIN_OLS
E_LINK_LINK_INIT
E_LINK_CLEANUP
Current State
LINK_ACTIVE
LINK_LR_RX
LINK_NOS_RX
LINK_OLS_TX
LINK_INIT
LINK_DIS
LINK_ACTIVE
13
Section 1
Fibre Channel Operations
Understanding FC
Addressing
BRKDCT-3708
Cisco Public
14
BRKDCT-3708
Cisco Public
15
BRKDCT-3708
Cisco Public
16
BRKDCT-3708
Cisco Public
17
BRKDCT-3708
Cisco Public
18
BRKDCT-3708
Cisco Public
19
Addresses
From Fabric Manager
FCID Domain/Area/Host
Port World Wide Name
0b / 00 / 01
BRKDCT-3708
Cisco Public
20
Cisco Public
21
Addresses
From Performance Manager
BRKDCT-3708
Cisco Public
22
Fabric Services
FC Standards
FFFFFEWell-known address of login server
Basic responsibility to assign FCID
Cisco Public
23
Accept
PLOGI
Accept
SCR
Accept
Name Server Query
Fabric F_Port
Login Server
XFF FF FE
Name Server
XFF FF FC
Fabric Controller
XFF FF FD
Distributed
Name Server
Database;
One for
Each SAN
Name Server
XFF FF FC
Accept
Cisco Public
24
FC Switch Network
Name Server
XFF FF FC
X680100
DB
FSPF
Routing
PRLI
D_ID x650500
S_ID x680100
Accept
D_ID x650500
S_ID x680100
Accept
SCSI Inquiry
Report LUNS
Presents LUNs to Host
BRKDCT-3708
Cisco Public
X680100
Storage on Port
1/6 of Switch
25
Wireshark Uses
Wireshark (Once known as
Ethereal) is part of the SANOS system image and can
be run directly on the
switch via ssh/telnet.
(FCAnalyzer command)
BRKDCT-3708
Cisco Public
26
FLOGI
PLOGI
SCR
BRKDCT-3708
Cisco Public
27
ISL
BRKDCT-3708
Cisco Public
28
Section 1
Fibre Channel Operations
Domains
BRKDCT-3708
Cisco Public
29
Domain IDs
Configured Domains in a VSAN
BF
RCF
BRKDCT-3708
Cisco Public
30
Must do a non-disruptive
(BF) restart
BRKDCT-3708
Cisco Public
31
BRKDCT-3708
Cisco Public
32
Section 1
Fibre Channel Operations
Zoning
BRKDCT-3708
Cisco Public
33
BRKDCT-3708
Cisco Public
34
1/8
7/11
VSAN 100
1/23
Domain 0x4
Domain 0x5
1/2
7/13
3/5
Zoning Choices
Switch FC Interface or Fabric Port WWN (FWWN)
BRKDCT-3708
Cisco Public
35
Cisco Public
36
BRKDCT-3708
Cisco Public
37
ACC
SFCStage Fabric Conf
ACC
UFCUpdate Fabric Conf
Trigger Switches to
Activate the New Zoneset
ACC
RCARelease Change Auth
ACC
Cisco Public
38
Enhanced Zoning
Enhanced zoning provides the following advantages
Disallow parallel configuration attempts
Standardized generation of RSCN
Reduced payload size of the SFC frame
Fabric-wide policy enforcement (default zone, merge control)
Enhanced error reporting
Distributing zonesets without activation
Unique Vendor Types
FWWN based member type standardized
Enhanced interop thru ESS (Exchange Switch Support) defined in SW-3
BRKDCT-3708
Cisco Public
39
BRKDCT-3708
Cisco Public
40
Cisco Public
41
Recommendations
To be able to manage zones from any switch in the
fabric, configure all switches/VSANs for distribute full
zone database.
To manage zone changes from only 1 switch in the
fabric, leave the default configuration of distribute
active zone database only.
Avoid inconsistent zone distribution policies.
BRKDCT-3708
Cisco Public
42
Section 2
MDS Serviceability Tools
FC Analyzer
BRKDCT-3708
Cisco Public
43
Run Wireshark
on IP Host
iSCSI.server1
FC Analyzer
Local
SI
iSC
iS
C
SI
iSCSI.server2
iSCSIserver3
iSCSI.server4
Server2
Terminal
Server3
PAA
Server1
Storage
Ethernet
PC
FC Analyzer
Remote
Out-of-Band
Management Network
Wireshark
PC
Cisco Public
44
BRKDCT-3708
Cisco Public
45
Cisco Public
46
FC Analyzer Options
Local or remoteWhere to send the trace. To local
devices or remote PAA attached to different MDS
switch.
Brief or detailedHeader information vs. full output of
frame including hex. Detail is default.
Limit-captured-framesNumber of frames to capture.
Default is FC analyzer will trace 100 frames. Specifying
zero is unlimited frame capture.
Limit-capture-sizeAllows to capture N number of
bytes of frame.
BRKDCT-3708
Cisco Public
47
Display-Filter Options
When not specified, FC analyzer will capture all traffic on
VSAN 1
Example: fcanalyzer local brief
Cisco Public
48
Cisco Public
49
Capture Is Done in
Configuration Mode
BRKDCT-3708
Cisco Public
2.829871
2.853261
2.853422
2.853592
2.853565
2.859648
2.860885
2.861007
2.861175
2.862053
2.865904
2.865981
2.866153
2.866297
2.866445
2.866496
2.866615
2.868792
2.868857
2.871132
2.872013
2.872021
2.872139
2.872163
2.872234
2.891239
2.891359
2.891469
2.891613
2.900160
2.900394
2.901916
2.902151
2.908296
2.908444
2.919880
50
Section 2
MDS Serviceability Tools
SPAN and PAA
BRKDCT-3708
Cisco Public
51
BRKDCT-3708
Cisco Public
52
FC Linecard
MDS9124
Only 1 SPAN session, 1 Direction at a time
BRKDCT-3708
IBM Blade
Cisco Public
53
Section 2
MDS Serviceability Tools
SAN-OS Tools
BRKDCT-3708
Cisco Public
54
BRKDCT-3708
Cisco Public
55
Debug Logging
1. MDS-9216i# debug logfile networkers_debugs size 5000
2. Display captured debug to screen
Tip: use show debug to see name of debug file
MDS-9216i# show debug logfile networkers_debugs
Cisco Public
56
BRKDCT-3708
Cisco Public
57
57
BRKDCT-3708
Cisco Public
58
Section 2
MDS Serviceability Tools
Performance Manager
BRKDCT-3708
Cisco Public
59
BRKDCT-3708
Cisco Public
60
Accounting
BRKDCT-3708
Cisco Public
61
BRKDCT-3708
Cisco Public
62
BRKDCT-3708
Cisco Public
63
Section 3
Troubleshooting
with the Tools
Device Connections
and ISL
BRKDCT-3708
Cisco Public
64
MDS 9124
Simple SAN FC Device Operation
MDS 9124 1 SPAN session, look at Tx first then Rx
SPAN Host session to SD port so to see complete FC
login and monitor SCSI traffic
PAA
Ethernet
Fibre Channel
Wireshark
Application
Cisco Public
65
Response Side
side of Trace
BRKDCT-3708
Cisco Public
66
BRKDCT-3708
Cisco Public
67
Wireshark
Application
BRKDCT-3708
Cisco Public
68
ISL Trace
ACK1 Filter Applied
Exchange Link Parameters
Exchange Switch Capabilities
Exchange Peer Parameters
(This Is Proprietary to MDS Only
to Determine if Connecting
Switch Is Another MDS)
Exchange Fabric Parameters
Build Fabric
Domain ID Assign by Existing
Principal Switch
Request Domain ID from
New Switch
Enhanced Zoning Merge Request
Resource Allocation (New in FCSW3 Standard)
Zone Merge Request
Break down of how the Domains were
selected and distributed on next slide
BRKDCT-3708
Cisco Public
69
VSAN 100
BRKDCT-3708
Cisco Public
70
Section 3
Troubleshooting
with the Tools
Zoning
BRKDCT-3708
Cisco Public
71
Debugging Zoning
Understanding what is Active
Fabric Manager / CLI
Basic/Enhanced
Zoneset activate failure?
Zoneset merge failure?
Members not able to communicate?
Host can not see storage?
BRKDCT-3708
Cisco Public
72
BRKDCT-3708
Cisco Public
73
BRKDCT-3708
Cisco Public
74
BRKDCT-3708
Cisco Public
Command to view
change history for
VSAN 100
One event.
Time of event is noted
previous state, event, and
next state are shown.
Domain that initiated the
event is also seen
75
BRKDCT-3708
Cisco Public
76
BRKDCT-3708
Cisco Public
77
LUNs
BRKDCT-3708
Targets
Cisco Public
78
Success
BRKDCT-3708
Cisco Public
79
that is deemed
undesirable
Command to bypass
merge checking and force
import of zone data
<show port internal info> will give greater detail on merger failure reason
BRKDCT-3708
Cisco Public
80
VSAN 10 is no
longer isolated
VSAN 10 zoneset
has been imported
Entry in the
accounting log
reflecting the
import action
Cisco Public
81
Different
member
Same zone
name
Cisco Public
82
VSANs with no
merge conflicts
will show here.
VSAN 1 is
isolated
Cisco Public
83
BRKDCT-3708
Cisco Public
84
BRKDCT-3708
Cisco Public
85
MR
command
code
BRKDCT-3708
Cisco Public
86
BRKDCT-3708
Cisco Public
87
Cisco Public
88
Best Practices
Make periodic backups of zoning database
Prior to any changes, make a backup of the current
zoning
Single initiator zones
Meaningful zone names
Default-zone set to deny
Distribute full zoneset
Use alias, device-alias preferred
Manage from designated seed switch or switches
BRKDCT-3708
Cisco Public
89
Section 3
Troubleshooting
with the Tools
Inter VSAN Routing
BRKDCT-3708
Cisco Public
90
Reading IVR
Where, What
VSAN10
VSAN 200
BRKDCT-3708
Cisco Public
91
20:00:00:05:30:00:58:de yes
yes 2,4
BRKDCT-3708
Cisco Public
92
BRKDCT-3708
Cisco Public
93
BRKDCT-3708
Cisco Public
94
Zoning
Keep default zone policy at deny
Manage local zones from IVR enabled switch
Cisco Public
95
IVR Troubleshooting
Database checks
Zone checks
Do FC devices show
correctly in the Transit
VSAN, if one is in use?
Cisco Public
Miscellaneous Checks
Is it possible that a natted
FCID changed because of a
reload causing AIX or HP-UX
to have target binding
issues?
96
BRKDCT-3708
Cisco Public
97
BRKDCT-3708
Cisco Public
98
Section 3
Troubleshooting
with the Tools
NPV Troubleshooting
BRKDCT-3708
Cisco Public
99
fwwn of Port P2
P1
VSAN WWN of NPV
NP
fc1/2
IP Address of NPV
P2
Switch name of NPV &
Interface name of P2
NPV Switch
fwwn of Port P1
BRKDCT-3708
Cisco Public
100
F
NP
fc1/2
P1
P2
NPV
VSAN:1
FCID:0x331e00
-----------------------port-wwn (vendor)
node-wwn
class
node-ip-addr
ipa
fc4-types:fc4_features
symbolic-port-name
:2f:ff:00:06:2b:10:c4:4c (LSI)
:2f:ff:00:06:2b:10:c4:4c
:3
:0.0.0.0
:ff ff ff ff ff ff ff ff
:scsi-fcp:init
:LSI7404XP-LC BR A.1 03-01081-03A
FW:01.03.17 Port 0
symbolic-node-name
:
port-type
:N
port-ip-addr
:0.0.0.0
fabric-port-wwn
:20:02:00:0d:ec:04:99:40
hard-addr
:0x000000
permanent-port-wwn (vendor) :20:02:00:0d:ec:2f:c1:40 (Cisco)
F P3
fwwn of Port P2
fc1/1
fwwn of Port P1
P4
FCID 0X331e00
BRKDCT-3708
Cisco Public
101
NP
Ports
NPV Switch
FC
P2
P1
NP Port
P1
P2
(fc1/7, fc1/21)
P3
P3
BRKDCT-3708
Cisco Public
102
(shut no shut)
If no NP port is available
F ports would remain in down state waiting for an NP port
NP
Ports
P1
P2 P3
NPV
F F F
BRKDCT-3708
Cisco Public
103
NP Ports
P2
P1
P3
FC
BRKDCT-3708
Cisco Public
104
FLOGI (acc)
PLOGI (acc)
ACC
PLOGI (acc)
F
FLOGI
NP
PLOGI
FDISC
F
FLOGI
BRKDCT-3708
Cisco Public
PLOGI
PRLI
105
FDISC- converted
FLOGI or connecting
host or storage device
on the NPV switch
PLOGI accept to the
real HBA on the MDS
NPV switch, FCID of
attached device is
0b0002
Remaining exchanges
are from device 0b0002
to the code MDS switch
BRKDCT-3708
Cisco Public
106
There is no familiar fcns or flogi database to view, for these services are
not running in a NPV enabled switch
show npv flogi-table
show npv status
show tech-support npv
show npv internal event-history { errors | events | ext-if-fsm [ <interface > ] |
flogi-fsm [ interface < interface > | pwwn <wwn> ] }
show npv internal event-history msgs
show npv internal event-history svr-if-fsm [ interface < interface > ]
show
show
show
show
show
show
show
npv
npv
npv
npv
npv
npv
npv
internal
internal
internal
internal
internal
internal
internal
info
info external-interface { all | < interface > }
info global
info interface { all | < interface > }
info server-interface { all | < interface > }
mem-stats [ detail ]
pending-queue interface { all | < interface >
BRKDCT-3708
Cisco Public
107
The following show commands can be used in the NPV-core switch to display
information on the NPV devices. Since these outputs are based on the name
server information, this command can be run from any non-NPV MDS switch
running 3.2(1) or later release
show
show
fcns
fcns
database
database
npv
npv
Example Outputs
npv# show fcns database npv
VSAN 1:
------------------------------------------------------------------------------NPV NODE-NAME
NPV IP_ADDR
NPV IF CORE SWITCH WWN
CORE IF
------------------------------------------------------------------------------20:00:00:0d:ec:3d:62:80 10.1.96.24
fc1/20 20:00:00:0d:ec:2d:af:40 fc4/4
20:00:00:0d:ec:3d:62:80 10.1.96.24
fc1/19 20:00:00:0d:ec:2d:af:40 fc4/3
20:00:00:0d:ec:3d:62:80 10.1.96.24
fc1/17 20:00:00:0d:ec:2d:af:40 fc4/1
...
BRKDCT-3708
Cisco Public
108
Wrap-up
Cornerstone to SAN network troubleshooting is to
understand Standards operation
Each SAN-OS feature has methods of troubleshooting,
Basic most seen issues were covered in this session
Familiarity with available tools in SAN-OS will narrow time
to resolution if either O/S bug or operational issues
Have other insight on HBA operations & firmware,
Array tools and Applications
Interop with Brocade/McData requires another skill set
Understanding architecture of Cisco Fabric Manager
deployments and best practices also a plus
BRKDCT-3708
Cisco Public
109
Appendix - Extras
BRKDCT-3708
Cisco Public
110
Core Dumps
BRKDCT-3708
Cisco Public
111
BRKDCT-3708
Cisco Public
112
Cisco Public
113
BRKDCT-3708
Cisco Public
114
BRKDCT-3708
Cisco Public
115
Cisco Public
116
BRKDCT-3708
Cisco Public
117
Core Dumps
Show cores
MDS-9509# sh cores
Module-num
----------------1
Process-name
PID
Core-create-time
--------------------- ---------cimxmlserver
20029
----------------------Jul 18 08:39
BRKDCT-3708
Core Log-create-time
------- -------------------------------N Tue Dec 7 19:08:09 2004
N Tue Dec 7 19:08:20 2004
N Tue Dec 7 19:08:30 2004
N Sun Sep 23 18:47:15 2007
N Sun Sep 23 18:47:17 2007
N Thu Sep 7 13:29:12 2006
N Thu Sep 7 13:30:12 2006
N Thu Sep 7 13:31:12 2006
Cisco Public
118
Q and A
BRKDCT-3708
Cisco Public
119
BRKDCT-3708
Cisco Public
120
Recommended Reading
BRKDCT-3708
Cisco Public
121
BRKDCT-3708
Cisco Public
122