Best Practices to Deploy High-Availability in Service Provider Edge and Aggregation Architectures
BRKSPG-2402
Cisco Public
Abstract
user, overall Service Availability becomes increasingly important. High Availability techniques such as Fast Convergence or MPLS TE FRR have focused on raising the availability of the network core in the past. Recently, these techniques are being increasingly deployed in
As Service Providers are deploying value-added triple-play or quadruple play services to maintain or generate a higher average revenue per
Ethernet Aggregation networks, for example by introducing MPLS TE FRR in the aggregation. Also, additional high-availability mechanism are being developed to enhance the resilience of the IP Edge against failures. Examples of new developments include IP Fast-Reroute, BGP
Prefix Independent Convergence for both the Core and Edge, or even stateful application inter-chassis redundancy mechanisms to overcome
single-system outages. This Session aims to provide the audience with best current practices to increase service availability by deploying Cisco High-Availability mechanisms in both the Aggregation and the IP Edge. Traditional HA techniques such as NSF/SSO, BFD, Fast convergence or NSR are reviewed. The details of new technologies such as IP FRR, BGP PIC are discussed in depth. Furthermore,
advanced topics such as achieving HA for Layer 4-7 services or stateful inter-chassis redundancy solutions are introduced. The Session also
provides the best current practices of deploying the tools offered by the Cisco High-availability toolset, in particular the deployment of MPLS TE FRR in the aggregation. Furthermore, possible stateful and stateless clustering approaches are introduced, which SPs may use to increase the availability of their IP Edge architecture.
BRKSPG-2402
Cisco Public
Glossary
NHAT ACL ACT APS ARP AS ATM BFD BNG BW CC CC CDR CE CE CF CFM CLI CM CP CPLD CSC DHCP DP DPM DSLAM E2E ECMP EEM EOAM
BRKSPG-2402
next hop address tracking Access Control List Active Automatic Protection Switching Address Resolution Protocol autonomous System Asynchronous Transfer Mode Bi Directional Forwarding Detection Broadband Network Gateway Bandwidth Continuity Check control connection call detail record Customer Edge Customer Edge checkpoint facility Configuration and Fault Management Command Line Interface Chassis Manager Control Plane Complex Programmable Logic Device ? Carrier's Carrier Dynamic Host Configuration Protocol Data Plane Defects per Million DSL Access Multiplexer End to end equal cost multipath Embedded Event Manager Ethernet OAM
EOBC ESP EVC EVDO FECP FIB FM FR FRR FSOL FWLB GEC GLBP GR GRE GW HA HSRP HW IETF IF IGP IOCP IOS IP IPC ISG iSPF ISSU IWF
Ethernet out of band management Embedded Services Processor Ethernet Virtual Circuit Evolution Data Only Forwarding Engine Control Processor Forwarding Information Base Forwarding Manager Frame Relay Fast Re Route First Sign of Life Firewall Loadbalancing Gigabit Ether Channel Global Load Balancing Protocol Graceful Restart Generic Route Encapsulation Gateway High Availability Hot Standby Routing Protocol Hardware Internet Engineering Task Force Interface Internal Gateway Protocol Input Output control Processor Internet Operating System Internet Protocol Inter process communication Intelligent Services Gateway incremental Shortest Path First in service software upgrade Interworking function
Cisco Public
4
Glossary (Cont.)
L2TP LAC LACP LAN LC LDP LFA LI LMI LNS LOS LSDB LSP LTE MC LAG mcast MD5 MFIB MLD MME MoFRR MPLS MRIB MSC MSPP MST MTBF MTSO MTTR NAT
BRKSPG-2402
Layer 2 transport protocol L2TP access concentrator Link aggregation control Protocol Local Area Network Linecard label Distribution Protocol loop free alternate Lawful Intercept Local management interface L2TP network Server Loss of signal link state database label switched path long term evolution multi chassis link aggregation multicast message Digest algorithm 5 multicast forwarding information base multicast listener discovery mobile management entity Multicast Only fast reroute Multiprotocol label switching multicast routing information base mobile switching center Multi-service provisioning platform Minimum spanning tree mean time between failures mobile telephone switching office mean time to repair network address translation
NIC Nr Ns NSF NSR NVRAM OAM OCE OIR OS PADR PE PIC PIM PPP PS PSN PTA PVRSTP PW QFP RADIUS RF RMA RNC RP RPR RSP RSVP SAA
network interface card receive sequence number send sequence number non stop forwarding non stop routing non volatile random access memory operations, administration and maintenance Object Chain Element online insertion and removal operating system PPP active discovery provider edge prefix independent convergence protocol independent multicast Point to point protocol power supply Packet Switched Network PPP termination and aggregation Per VLAN rapid spanning tree pseudowire Quantum flow Processor remote authentication dial in user service redundancy facility Return material authorization radio network controller route processor route processor redundancy route switch processor resource reservation protocol service assurance agent
Cisco Public
5
Glossary (Cont.)
SBC SBY SGW SIP SLA SLB SP SPA SPF SRLG SSH SSO STP SW T&C TCAM TE TR UC uRPF VAI VC VCCV VIP VLAN VMAC VPN VRF VRRP WAN
BRKSPG-2402
session border controller standby SAE gateway Session initiation protocol service level assurance server loadbalancing service Provider Shared port adapter shortest path first shared risk link group secure shell stateful switchover spanning tree protocol software terms & conditions ternary content addressable memory traffic engineering Traceroute unified communications unicast reverse path forwarding virtual access interface Virtual Circuit VC connection verification virtual IP virtual LAN virtual MAC virtual private network virtual routing and forwarding table virtual router redundancy protocol wide area network
Cisco Public
6
Agenda
Motivation for High Availability in SP Aggregation Networks
Network Level High Availability System High Availability Service High Availability Stateful Inter-chassis Redundancy
Case Studies
Summary and Conclusions
BRKSPG-2402
Cisco Public
Metrics
Service Availability (averaged over time) Mean time to repair (MTTR) Packet Loss / Delay / Jitter
Examples
AT&T Sprint Verizon Business BT Level 3
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
DPM
10000 5000 1000 500 100
Downtime per Year (24x365) 3 Days 1 Day 15 Hours 19 Hours 8 Hours 4 Hours 36 Minutes 48 Minutes 46 Minutes 23 Minutes 53 Minutes 5 Minutes 30 Seconds Predictive High Availability Proactive Reactive
10 1
Two ways to state availability of a network: Percentage Method DPM Method = Defects per Million (Hours of Running Time)
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
10
Availability Definitions
MTBF Availability = MTBF + MTTR
Uptime divided by the total time to create the percentage time your network is operational MTBF is Mean Time Between Failure
When does it fail?
Cisco Public
11
A Series =
AK
k =1
= A1 A2 .... AN
(1 - A k )
K =1
= 1 - (1 - A1 ) .... (1 - AN )
Cisco Public
12
Reduction of MTTR
Stateful inter-chassis redundancy allows for additional resilience against
System Failures Interface Failures
Product ID ASR1000-RP2 ASR1000-ESP20 ASR1000-SIP10 ASR1006 ASR1006-PWR-AC ASR1006-PWR-DC ASR1000-SIP40 ASR1000-ESP40 SPA-8X1GE-V2 SPA-1X10GE-L-V2
MTBF (hrs) 380532 335317 287549 1986649 570776 357781 283225 118790 482023 411892
Interface failures
Power outages
Goal of stateful inter-chassis redundancy is subsecond failover with state preservation for applications
BRKSPG-2402
Cisco Public
13
IF1
IF2
CPU
NPE-400
P/S P/S
PWR-7200-AC
PA-E3 PA-POS-OC3
Cisco 7206
IOS =
IF1 = IF2 =
CPU =
490.000 = 0.999992 490.000 + 4 460.000 BB = = 0.999983 460.000 + 8 750.000 = 0.999995 P/S = 750.000 + 4
Cisco Public
14
R3
R4
but not considered: -Links (WAN, LAN) -Computer NICs -Computer OS -Computer Applications
Cisco Public
15
Embedded Embed intelligent event management for proactive maintenance Management Automation and configuration management to reduce human and Automation errors
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
16
Access Layer
BRKSPG-2402
Cisco Public
17
Network-Level Resiliency
Network Design Resiliency
Dual-homing APS, GEC, MC-LAG
SP Core IP Edge DC
Internet
Graceful Restart (MBGP, OSPF, RSVP, LDP) EMCP, Anycast, dual RR VRRP/HSRP/GLBP/SLB/FWLB MPLS High Availability LDP Graceful Restart MPLS/VPN NSF BFD MPLS FRR Path Protection MoFRR IP FRR Pseudowire Redundancy Spanning Tree (MST, PVRSTP...) ..................
BRKSPG-2402
Access
Cisco Public
18
Redundant System Components (route processors, forwarding processors, power supplies, etc.)
Operational costs
Lower steady-state Utilization levels
Increased configuration and management Tighter maintenance windows
0% Availability 100%
BRKSPG-2402
Cisco Public
19
Values for AN, Agg1, Agg2, Agg3 and Edge nodes only (No Pp-routers). Cumulative redundancy Schemes, GPL
Residential Residential
MPLS Aggregation
Edge
Core
AN 200,000 Generic
P 74 CRS-3
20
Aggregation
3
7
5 4
SP Core
1
0 2
0 System-level HA (Baseline) RSP failover: 0 packet loss All L3 protocols are NSF capable NSR: OSPF, ISIS, BGP Routing timers and protocol configs are optimized by default
BRKSPG-2402
Cisco Public
22
HA Network Map
Access Aggregation Edge Core
Access Domain
<->
AGG
<->
<->
Keepalives NSR / NSF HSRP / VRRP/GLBP/SLB/FWLB Multicast HA Keepalives PPP / FR / ATM / HDLC / GE SSO Interrupts Module Redundancy
Cisco Public
BFD, Keepalives ECMP, iSPF, BGP PIC Core / Edge, IP / MPLS FRR, LNS Load sharing / Anycast / Dual RR, Fast Hello EOAM, MPLS Ping / TR GEC, APS, MC-LAG
L2
EOAM, (VCCV) GEC / APS / MC-LAG Loss of Signal Path diversity / dual homing
VCCV, EOAM, MPLS Ping / TR GEC / APS / MCLAG Loss of Signal Path diversity / dual homing
L0/1
BRKSPG-2402
23
HA Network Map
Access Aggregation Edge Core
Access Domain
<->
AGG
<->
<->
Keepalives NSR / NSF HSRP / VRRP/GLBP/SLB/FWLB Multicast HA Keepalives PPP / FR / ATM / HDLC / GE SSO Interrupts Module Redundancy
Cisco Public
BFD, Keepalives ECMP, iSPF, BGP PIC Core / Edge, IP / MPLS FRR, LNS Load sharing / Anycast / Dual RR, Fast Hello EOAM, MPLS Ping / TR GEC, APS, MC-LAG
L2
EOAM, (VCCV) GEC / APS / MC-LAG Loss of Signal Path diversity / dual homing
VCCV, EOAM, MPLS Ping / TR GEC / APS / MCLAG Loss of Signal Path diversity / dual homing
L0/1
BRKSPG-2402
24
CC CP
CC DP
Loopback
-
Performance Traceroute
-
IEEE 802.1ah
LSP Ping
-
LSP TR
Cisco Public
IP TR
25
BRKSPG-2202
Family of protocols that provides capabilities to detect, verify, isolate and report end-to-end ethernet connectivity faults
Protocols (Continuity Check, Loopback and Linktrace) used for Fault Management activities
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
26
BRKSPG-2202
Core
E-LMI 802.3ah
L2 connectivity management
IEEE 802.3ah: When applicable, physical connectivity management between devices. IEEE 802.1ag: Connectivity Fault Management (CFM) Uses Domains to contain OAM flows and bound OAM responsibilities Provides per EVC connectivity management and fault isolation Three types of packets: Continuity Check, L2 Ping, L2 Traceroute
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
27
Maintenance Domain
Maintenance Association
Catalogued by MIPs at the same MD-Level and service, Terminated by remote MEPs in the same MA
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
28
Ethernet Loopback (ETH-LB) (Y.1731 adds multicast LBM) Ethernet Linktrace (ETH-LT) Ethernet Remote Defect Indication (ETH-RDI) Ethernet Alarm Indication Signal (ETH-AIS) Ethernet Locked Signal (ETH-LCK) In addition: ETH-TEST, ETH-APS, ETH-MCC, ETH-EXP, ETH-VSP
Cisco Public
29
Type 1
(in-band vccv)
Signal out-of-band VCCV inserting MPLS router alert label between tunnel and PW Labels Manipulate and Signal TTL exhaust (TTL == 1) for multiple switching point PEs
marks the payload as control packet for switching purpose; packet follows the PW data path Control packets sent over the AToM tunnels are intercepted by the egress PE
BRKSPG-2402
Cisco Public
30
PE2
AC
PW Status Signaling method selected if supported by both peers. PEs exchange label mapping messages upon PW configuration. Simple Label Withdraw status method will be used if one of the peers doesnt support PW Status Signaling. PW label wont be withdrawn unless AC is administratively down or the PW configuration is deleted. PW state set to down if the Label mapping is not available. Capability is on by default.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
31
CE1
PE1
Native Service
PW1
PW2
PE2
CE2
Native Service
Motivation
One tunnel can serve many pseudo-wires. MPLS LSP ping is sufficient to monitor the PSN tunnel (PE-PE connectivity), but not Virtual Circuits (VCs) inside of tunnel.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
32
BRKSPG-2402
Cisco Public
33
BFD Details
Async Mode
Session established between two peers
Timers are negotiated Hello packets similar to IGP control packets Does NOT react to failures itself -> notifies clients
green is alive
orange is alive
Echo Mode
orange is alive
Echo mode: echo packets sent at negotiated rate, used for failure detection
Control packets sent at low rate
green is alive
Cisco Public
34
10s of ms
100s of ms
10s of ms
BRKSPG-2402
Cisco Public
35
Report failure to Route Controller Generate and flood an LSP Trigger and Compute an SPF
10s of ms
BGP PIC
100s of ms
10s of ms
BRKSPG-2402
Cisco Public
36
Hierarchical CEF
Optimizes the data plane for sub-second convergence CEF Data Structure Enhancements Solves the FIB Download Convergence Bottleneck
LSP and Prefix Independent Optimizes FIB
Default IP to MPLS
Failure
BGP Prefix FIB Entry BGP Prefix FIB Entry BGP Prefix FIB Entry MPLS Label OCE MPLS Label OCE MPLS Label OCE Adjacency OCE - Interface
Repair
BGP Prefix FIB Entry BGP Prefix FIB Entry BGP Prefix FIB Entry MPLS Label OCE MPLS Label OCE MPLS Label OCE
Repair
BGP Prefix FIB Entry BGP Prefix FIB Entry BGP Prefix FIB Entry MPLS Label OCE MPLS Label OCE MPLS Label OCE Load Balance OCE
Cisco Public
37
BRKRST-3363
BW Reservation
Per Tunnel Traffic Statistics
Link Failure
IP/MPLS Aggregation
Caveats
Requires MPLS and MPLS-TE No Protection for Ingress or Egress Tunnel Failures Requires Pre-Computed Backup Paths Requires (n-1)! Tunnels for Full Protection
Applicability
Protecting Links in the aggregation network
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
MPLS-FRR CEF
MPLS-FRR IP and MPLS
Failure
Loadbalance OCE
IP & MPLS CEF
Midchain OCE
Label OCE
Repair
Loadbalance OCE
IP & MPLS CEF
Typical FIB Programming Rate - ~5000 10,000 CEF Updates per second
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
39
Link Failure
IP/MPLS Aggregation
Caveats
Requires MPLS and MPLS-TE No Protection for Ingress or Egress Tunnel Failures
Applicability
Protecting Ring Topologies
Tunnel LSP
BRKSPG-2402
Cisco Public
40
Midchain OCE
Midchain OCE
One CEF Update Message per IGP Prefix and L2VPN LSP!
Loadbalance OCE Loadbalance OCE Midchain OCE Label OCE Adjacency OCE - Interface
Typical FIB Programming Rate - ~5000 10,000 CEF Updates per second
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
41
BRKRST-3363
IP FRR-LFA: 50 ms Convergence
R5
Key Features
50 msec Convergence for Link and Node Failures Works for MPLS and IP Only Environments Simple Automatic configuration of Loop Free Alternate Paths via OSPF or ISIS No Tunnels
Link Failure
R3
R4
Caveats
Requires a Loop Free Path for Protection No Bandwidth Reservation No Support for SRLG New Feature
R1
R2
Applicability
Strong Solution for Deployments with Cost Effective Bandwidth
No Convergence Required on Routers R2, R3, R4 and R5 to Maintain Green Traffic Flow!
Cisco Public
42
BRKSPG-2402
Two-plane network topologies are most friendly for IPFRR LFA deployments
Topology analysis required to assess IPFRR LFA efficiency
Two-plane Network Topology
LSDB = Link State DataBase Cisco PublicLDP = Label Discovery Protocol
BRKSPG-2402
43
IP FRR-LFA IP to MPLS
Repair
IP Prefix FIB Entry IP Prefix FIB Entry IP Prefix FIB Entry Load Balance OCE
IP-FRR OCE
Typical FIB Programming Rate - ~5000 10,000 CEF Updates per second
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
44
P1
PE1
CE1
Routers to maintain forwarding state when communication between them is lost Routing sessions are established with NSF aware peers. Upon HA event, neighboring peers maintain forwarding until routing sessions are reestablished. Copy of FIB maintained on secondary and used on failure for continuously traffic flow. Requires neighboring routers to be NSF aware.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
45
BRKRST-3363
Applicability
PE Routers
VPN1
CE1 PE1 PE3 CE2
VPN1
BRKSPG-2402
Cisco Public
46
PE2
RR2
PE3
PE1
BRKSPG-2402
Cisco Public
47
PE2
CE1
PE1
BRKSPG-2402
Cisco Public
48
PE2
CE1
PE1
PE-CE link Failure Detects that link is down and CEF layer will switch to precomputed backup path
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
49
PE2
RR1 CE2
CE1
RR2
The best BGP path to CE1 is now through PE2
PE3
PE1
PE-CE link Failure
BRKSPG-2402
Cisco Public
50
MPLS-VPN
PE3 CE2
PE1 and PE2 precomputes bgp backup paths using bgp best-external approach When primary link PE1 - CE1 fails:
PE1 holds on to the bgp local labels and re-routes CE1s traffic to PE2 using labels advertised by PE2 PE1 uses fixed timer to clean up stale local labels
PE3 is expected to converge to start using PE2 as the BGP nexthop and IGP label for PE2 to send traffic from CE2 to CE1
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
51
RR2
The best BGP path to CE1 is through PE1 PE3
PE1
BRKSPG-2402
Cisco Public
52
PE2
RR1 CE2 RR2 The best BGP path to CE1 is through PE1 PE3 CE1
PE1
BRKSPG-2402
Cisco Public
53
PE2
Detects that PE1 is down and CEF layer will switch to precomputed backup path
PE3
PE1
BRKSPG-2402
Cisco Public
54
PE2
RR1
CE2
RR2
CE1
next BGP next-hop scan the path through PE2 will become the best Path
PE3
PE1
BRKSPG-2402
Cisco Public
55
PE2
PE3 CE2
MPLS-VPN
CE1 PE1
On Primary PE: router bgp 100 address-family ipv4 vrf red bgp additional-paths install bgp advertise-best-external
On Ingress PE: router bgp 100 address-family ipv4 vrf red bgp additional-paths install
PE1, PE2 and PE3 precomputes bgp backup When node PE1 fails:
IGP notification on PE3 invalidates active path PE3 switches to backup path
PE3 is expected to converge to start using PE2 as the BGP nexthop and IGP label for PE2 to send traffic from CE2 to CE1
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
56
BRKSPG-2402
Cisco Public
57
Repair
BGP Prefix FIB Entry BGP Prefix FIB Entry BGP Prefix FIB Entry
Typical FIB Programming Rate - ~5000 10,000 CEF Updates per second
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
58
3
CE1
CE2
VPN1 site1
PE3
VPN1 Site2
PE1
Examples: PE-P or P-P link failure, P node failure Sub-second convergence (prefix independent) vs. multiple seconds convergence (prefix and hardware dependent) Enabled by default since IOS XE 2.5.0 (cef table output-chain build favor
convergence-speed)
when remote PE node fails or no longer reachable. when PE-CE link fails. Immediate to sub-second convergence (prefix independent) vs. multiple seconds convergence (prefix and hardware dependent)
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
59
Operational Simplicity
O(x00ms)
Provides local protection (link, node) with <50msec recovery Tool to improve on IGP FC for most topologies (triangle, square, mesh)
<50ms <50ms
O(x00ms)
Feasible to deliver very tight E2E Service Availability SLAs without increasing operational complexity
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
60
Interface Redundancy typically achieved using IEEE 802.3ad / LACP or APS Hardware Redundancy needs to be complemented by Software redundancy Features Cisco Platforms supporting hardware redundancy
CRS-3
BRKSPG-2402
ASR 9000
ASR 5000
ASR 1000
Cisco 12000
Cisco Public
Cisco 7600
62
RSP(s)
(Autonomous Forwarding)
CPU
BITS/DTI
BITS/DTI
BRKSPG-2402
Cisco Public
63
Control Plane
Embedded Services Processor (standby)
FECP
RP
RP
QFP subsystem
QFP subsystem
Crypto assist
QFP subsys-tem
Crypto assist
QFP subsys-tem
Interconn.
Interconn.
Interconn.
Interconn.
Passive Midplane
Interconn. Interconn. Interconn. Interconn. Interconn.
Passive Midplane
Interconn.
SPA Agg.
IOCP
SPA Agg.
IOCP
SPA Agg.
IOCP
SPA
Agg.
IOCP
SPA
Agg.
IOCP
SPA
Agg.
IOCP
SPA
SPA
SPA
SPA
SPA
SPA
SPA
SPA
SPA
SPA
SPA
SPA
BRKSPG-2402
Cisco Public
64
RP
CPU
Chassis Mgr.
IOS
Forwarding Mgr.
ESP
FECP
IOCP
Chassis Mgr.
SPA Agg.
SPA
BRKSPG-2402
Cisco Public
65
IOSact
IOSsby
Non-HA-Aware Application
RPsby
CF
IPC Message Qs
I P C
RF
MRIB RT
I P C
CF
IPC Message Qs
Config MLD
RF
MFIB FIB
RIB
IDB
FMRP
MFIB FIB
FMRP
ESPact
FMESP QFP Client
ESPsby
HA operates in a similar manner to other protocols on the ASR 1000 Reliable IPC transport used for synchronization
FMESP
QFP Client
SPAs
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
66
BRKSPG-2402
Cisco Public
67
FECP
Detected using
CPLD interrupts / register bits within O(ms) controlled by CMRP
Interconn. Interconn.
QFP subsys-tem
Crypto assist
Watchdog timers: low level watchdogs running in O(min) that can initiate a reset (e.g. RP)
SIP IOCP
JTAG: RP can program CPLD on other modules. Test interconnects and other boards (primarily for RMAd hardware)
Interconn.
SPA Agg.
Hardware failures are typically fatal such that modules need to be replaced!
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
SPA
SPA
Cisco Public
68
Chassis Mgr.
IOS
Forwarding Mgr.
ESP
Detected using
Kernel: the kernel supervises middleware or SPA driver processes (kmonitor()). It always knows if a process is healthy IPC: between 2 IOS (and only for IOS)
Interconn.
Crypto assist
IOCP
SPA SPA drive SPA SPA r drive r drive r drive r
Chassis Mgr.
Note: some other processes are re-startable (CMRP, FMRP, SSH, Telnet)
Kernel will try to re-start the processes in this case
SPA
SPA
BRKSPG-2402
Cisco Public
69
ESP (slot 1)
FMESP CMESP Kernel
SBY
RP (slot 0)
CMRP FMRP Kernel IOS
ACT Failure
RP (slot 1)
CMRP FMRP Kernel IOS
SBY
State information If not received in time, send restart message. Update H/W component file system
Cisco Public
70
ESP (slot 1)
FMESP CMESP Kernel
RP (slot 0)
CMRP FMRP Kernel IOS
RP (slot 1)
CMRP FMRP Kernel IOS
Failover Take-over control using checkpointed state
Forwarding State information Check updated state and discard old state Check updated state and discard old state
Service recovered
H/W initialization Initialize EOBC start Start kernel Start IOS
Start CM Start FM
Detect RPsby
ESP (slot 1)
FMESP CMESP Kernel
SBY Interrupt SBY
RP (slot 0)
CMRP FMRP Kernel IOS
ACT ACT
RP (slot 1)
CMRP FMRP Kernel IOS
SBY
State information of failed ESP Failed Disable ESI link w/ failed ESP Change state of ESI link w/ new ESPact ACT Reconfigure ESI link w/ RPs
BRKSPG-2402
Cisco Public
72
ESP (slot 1)
FMESP CMESP Kernel
RP (slot 0)
CMRP FMRP Kernel IOS
RP (slot 1)
CMRP FMRP Kernel IOS
Restart
H/W initialization
Initialize EOBC
BRKSPG-2402
Cisco Public
73
Active RP
Standby RP
Non-HA-Aware Application
Config L2TP
CF
IPC Message Queues
CF
I P C
I P C
CEF
IPC Message Queues
RF
RF
FIB
IDB
FIB
IDB
Line Card
BRKSPG-2402
Cisco Public
74
ASR 1000
FR, PPP, MLPPP, HDLC, 802.1Q, BFD (BGP, IS-IS, OSPF) RP, HSRP, IPv6 NDP, uRPF, SNMP, GLBP, VRRP, NSR (MP-iBGP, eBGP), ISSU, GRE, IPv4 Multicast (IGMP), IPv6 Multicast (MLD, PIM-SSM, MLD Access group), MoFRR MPLS L3VPN, MPLS LDP , VRF-aware BFD, Roadmap: NSR LDP, T-LDP PPPoE, L2TP (LAC, LNS), DHCPv4/v6, AAA, session state (virtual templates), ISG, ANCP, LI SSO, Stateful Inter-chassis redundancy for FW / NAT SSO
ASR 9000
BFD (OSPF, BGP, IS-IS, Static) NSF (ISIS, OSPF, BGP), NSR (ISIS, OSPFv2, OSPFv3, BGP) NSF Multicast, BFD for PIM, MoFRR NSF (LDP, T-LDP, RSVP-TE) NSR (LDP), BFD for MPLS FRR, VRF-aware BFD PPPoE (including nV) Roadmap Roadmap
BRKSPG-2402
Cisco Public
75
P1
PE1
CE1
Routers to maintain routing state and forwarding state when communication between them is lost
Routing sessions are maintained between processors on a failure, allowing routing sessions to stay up with Peer
Copy of FIB maintained on secondary and used on failure for continuously traffic flow
No need for neighboring routers to be NSF aware or capable. Can give high reliability without upgrading CE.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
76
BGP NSR
Implemented by hardening code for
BGP RIB checkpointing BGP TCP interaction
Only supported for IPv4 unicast, VPNv4 unicast Address families in Cisco IOS Configuration
router bgp <asn> address-family ipv4 vrf RED neighbor x.x.x.x ha-mode sso
BRKSPG-2402
Cisco Public
77
OSPFv2 NSR
Provides the ability to perform hitless RP switchovers when OSPF is used as the routing protocol (Expect zero-traffic loss across such HA events)
Activated on a per-process basis (for both ipv4 or ipv4 VRF for PE-CE sessions) Depends on the forwarding planes ability to retain state across control plane restarts and RP switchovers Alleviates dependency on OSPFv2 protocol extensions (NSF)
Neighboring routers are unaware that a router is NSR-capable Neighboring routers are unaware that a router has gone through an RP switchover
Cisco Public
78
LI
Some Services can be made highly-available using Intra-chassis redundancy (e.g. IPSec, Firewall, NAT, PPPoX, L2TP) Stateless inter-chassis redundancy available for BNG Stateful Inter-chassis redundancy available for NAT, Firewall and SBC on the Cisco ASR 1000
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
80
Chassis Redundancy
NSF/NSR
Multihoming
Link Detection
IP Event Dampening BFD
CPE
Mobile Business
VRF Blue
Access
Edge
Core
Edge
Access
CPE
Mobile Business
VRF Blue
Business
VRF Red
Business
VRF Red
Site A Business
VRF Green
Site C Business
VRF Green
Business
VRF Orange
Business
VRF Orange
BRKSPG-2402
Site D
Cisco Public
81
Access
Edge
Core
Edge
Access
CPE
Mobile Business
VRF Blue
Business
VRF Red
Business
VRF Red
Site A Business
VRF Green
Site C Business
VRF Green
Business
VRF Orange
Business
VRF Orange
Site B
Site D
CPE
BFD for PE-CE Link Detection NSF/NSR for Chassis HA PE Multihoming
Intra-Site PE for PE Diversity Inter-Site for SP Facility Diversity
Access
Circuit Diversity - Physical Diversity for Multihomed CPE
Physical Circuit Diversity is Not the Default Must be Requested from the SP
Edge
BFD for PE-CPE / PE-P Link Detection NSF/NSR for Chassis HA IP Event Dampening for PE-CPE IP-FRR for PE-P
For Cost Effective PE-P Bandwidth
Cisco Public
82
BRKSPG-2207
Detection Mechanisms
IGP Convergence for Remote PE Failure LDP Signaling for PE-CE Failure LDP Timeout for Remote PE Software Failure
Standby PW
P P
PE2 CE
PE1
P P
CE PE23
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Active PW
Cisco Public
83
RPact
RPsby
IOSact
IOSsby
Config
CF
I P C
RF
I P C
CF
MLD CEF
RF
Mcast
After failure
RPsby sends out PIM hellos to all neighbors PIM neighbors re-send PIM state Newly active RP re-builds the PIM state IGP reconverges to assure uRPF check MFIB and ESP updates proceed to incorporate refreshed PIM state Forwarding of multicast packets is NOT disrupted
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
FMRP
MFIB MFIB
FMRP
ESPact
FMESP QFP Client
ESPsby
FMESP QFP Client
SPAs
ESPact continues to forward multicast traffic based on its version of the MFIB
Cisco Public
84
Data packets are received from the primary and secondary paths The redundant packets are discarded at topology merge points due to RPF checks Failure:
Interface chance on where packets are accepted Backup path interfaces become active
POP1 POPN
BRKSPG-2402
Cisco Public
85
BRKSPG-2402
Cisco Public
86
RPact ESP
L2TP Tunnel
LNS
Cisco Public
87
STB 1
PADI PADO
2 PADR 3 PADS 4
PPP Smart Server Selection allows user to configure specific PADO delay for a received PADI packet
Can be configured per bba-group or based on circuit-id/remote-id
In case of an outage of a BNG in the cluster, other BNG stand ready to accept subscriber sessions
Detection of failure possible at both ends of PPPoE session because of missing keepalives
Subscriber sessions have to be re-established
Cisco Public
88
Any other failures may result in recovery times O(hours) Inter-chassis redundancy provides additional resilience against
Interface Failures System failures Site failures (allowing for geographic redundancy)
RP RP FP FP SIP SIP
SIP SIP
BRKSPG-2402
Cisco Public
90
Cisco Public
91
Forwarding-plane active-active E.g. can have one set of firewall services resilient, and other set of firewall services non-resilient
Failover
RPsby act
Fabric LC LC
RPact
FW
RPact
FW
ESP SIP
ESP
LC
LC
SIP
SIP
SIP
BRKSPG-2402
Cisco Public
92
Building-blocks required to achieve stateful interchassis redundancy are for ASR 1006 / 1013: 1. Redundant Hardware components
RP
RIB RT NAT
RP
ESP
SIP SIP
ESP SIP
2. Forwarding / Application State Tables 3. Control mechanism to synchronize between active-standby components
Who is active / who is standby? Initiate failover in case of failure
Active Enhanced Serdes Link ESI (internal dataplane) Standby Enhanced Serdes Link ESI (internal dataplane)
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
93
nV Edge Overview
Control Plane EOBC Extension (L1 or L2 connection) One or two 10G/1G from each RSP
External EOBC link fail wont cause RP failover as long as it has alternative EOBC link
Active RSP
Secondary RSP
1 Standby RSP
Secondary RSP
Internal EOBC
LC
LC
LC
LC
LC
LC
LC
LC
Inter-chassis data link (L1 connection) 10G or 100 G bundle (up to 32 ports)
Control plane EOBC extension is through special 1G or 10G EOBC ports on the RSP. External EOBC could be over dedicated L1 link, or over port-mode L2 connection
Data plane extension is through regular LC ports (it can even mix regular data ports and inter-chassis data plane ports on the same LC) Doesnt require dedicated fabric chassis flexible co-located or different location deployment, lower cost
For redundancy purpose, minimal two control plane and two data plane links are required
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
94
BRKSPG-2402
Cisco Public
95
Multi-chassis LAG
mLACP uses ICCP to synchronize LACP configuration & operational state between PoAs, to provide DHD the perception of being connected to a single switch All PoAs use the same System MAC Address & System Priority when communicating with DHD
Configurable or automatically synchronized via ICCP
Every PoA in the RG is configured with a unique Node ID (value 0 to 7). Node ID + 8 forms the most significant nibble of the Port Number For a given bundle, all links on the same PoA must have the same Port Priority
Port #: 0x9001, Port Priority 1 PoA1 Node ID: 1
DHD
ICCP
Node ID: 2
LACP
BRKSPG-2402
Cisco Public
96
ICCP allows two or more devices to form a Redundancy Group ICCP provides a control channel for synchronizing state between devices ICCP uses TCP/IP as the underlying transport
ICCP rides on targeted LDP session, but MPLS need not be enabled
ICCP over Dedicated Link
RG
Cisco Public
97
Only one Active RSP, Only one standby RSP at a given time, which are located on two different chassis
SSO/NSF/NSR works exactly the same way as two RSPs on the same chassis
Virtual Chassis is always on as long as there is one chassis and one RSP alive
* Practically, recommend maximum 10msec latency between two chassis
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
98
LC
LC
LC
LC
LC
LC
LC
LC
BRKSPG-2402
Cisco Public
99
Data Forwarding
InterChassis LC
Data Plane
Encapsula tion 3
Decapsulat ion
InterChassis LC
Data Plane
3 Encapsula tion P1 P2
Chassis 0
1 2
Ingress Forwarding Lookup L2/L3/Mcast regular lookup
Decapsulat ion
Chassis 1
3
Inter-Chassis Encapsulation
Inter-Chassis Decapsulation
Cisco Public
LOOKUP
Introduction to RG-Infra
RG Infra is the IOS Redundancy Group Infrastructure to enable the synchronization of application state data between different physical systems
Does the job of RF/CF between chassis
Assumptions
Application state has to be supported by RG infra (ASR 1000 currently supports NAT, Firewall, SBC)
Connectivity redundancy solved at the architectural level (need to externalize the redundant ESI links of the intra-chassis redundancy solution)
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
101
RG state data
RPact ESP
RG
SIP
SPA
BRKSPG-2402
SIP
SPA SPA SPA
SIP
SPA
SIP
SPA SPA
102
SPA
Cisco Public
Peer Management
Maintain information about peers
Fault Handling
Changing priorities of RG (may affect RG state) Fault event dampening Logging Integration with Enhanced Object tracking / BFD
RG State control
Init, Active, Standby, disabled Communicating state changes to other software entities in the system (e.g. QFP software)
Synchronization management
Synchronization state tracking (standby has to request bulkupdates from active) Determines when synchronization is started (e.g. ensures transport is available)
Transport Connectivity
Knows via which interface application state is synchronized Can be different for application state data and RG control messages
BRKSPG-2402
Cisco Public
103
Function / Method Hardware redundancy Redundant connectivity Redundancy control State synchronization Failure detection mechanism
Stateful Intra-chassis ESP, RP Internal ESI links RF/CF IPC over EOBC Interrupts
Stateful Inter-chassis ESP, RP, Interfaces Redundant links to neighbor nodes RG External GEC BFD, Hellos
Failover mechanism
Chassis Manager
BRKSPG-2402
Cisco Public
104
Active-Standby All application traffic associated with a SINGLE RG instance Failures would switch all traffic over to the standby chassis Active-Active
Multiple RG instances configured per system Subset of traffic associated with a particular RG instance
RG1act RG3sby RG1sby RG2act RG3act RG2sby RGact RGsby
2+1 Active-Standby
2 or more chassis loadshare application traffic, backed up by a
RG2act
RG1sby RG2sby
different chassis
Single failure only affects subset of overall application traffic
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
105
Case Studies
Case Study: Highly Available IP Architecture for Mobile One Second Convergence Requirement
MPLS VPN
EvDO/LTE VRF
CSR
QFP
Agg1
LTE Core
VRF EvDO/LTE
MME SGW
1xRTT VRF
VRF 1xRTT
MSC
EvDO/LTE VRF
CSR
RNC
MSPP
1xRTT VRF
QFP
Agg2 PE Cellsite
Local VLANs or T1s
Internet Core
EoMPLS Backhaul
MTSO
OSPF/RIP/VRRP
PE
L2 Domain
L3 Domain
L3 Domain
FE T1
QFP
GE
10 GE
Service Termination
107
BRKSPG-2402
Cisco Public
CSR
VRF VRF QFP
Agg1
LTE Core VRF
VRF EvDO/LTE
MME SGW
1xRTT VRF
VRF 1xRTT
MSC
EvDO/LTE VRF
CSR
RNC
MSPP
1xRTT VRF
QFP
VRF
PE PE
Agg2
Internet Core
Static routes for cellsite reachability BGP PIC Edge for Layer-3 convergence VRRP for MTSO
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
108
Case Study: Highly Available IP Architecture for Mobile Steady-State Traffic Flows
MPLS VPN
EvDO/LTE VRF 1xRTT EVDO
CSR
Agg1
LTE Core
VRF EvDO/LTE
1xRTT VRF
QFP
MME SGW
VRF 1xRTT
MSC
EvDO/LTE VRF
CSR
MSPP MSPP
QFP
1xRTT VRF
CDMA Core
PE Agg2 PE
Internet Core
Steady state: CSR distributes flows across both Aggs using ECMP. Traffic could flow across Agg inter switch links. Each Agg handles traffic related to all services from the cell-site.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
109
EvDO/LTE VRF
CSR
MPLS VPN QFP
Agg1
MME SGW
1xRTT VRF
x
CSR
MSPP MSPP
QFP
VRF 1xRTT
MSC RNC
EvDO/LTE VRF
1xRTT VRF
CDMA Core
PE Agg2 PE
Internet Core
Steady state: Traffic flows distributed across both Agg. Failure: GE link from MSPP to Agg1 fails. Action: BFD session to Agg1 times out at CSR. Agg1 next hop removed from forwarding table. Traffic flows resume across existing path to Agg2. Results: Traffic flows to Agg1 via Agg2.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
110
Case Study: Highly Available IP Architecture for Mobile Aggregation Switch Failure
EvDO/LTE VRF
CSR
QFP
MPLS VPN
Agg1
1xRTT VRF
LTE Core
VRF EvDO/LTE
MME SGW
VRF 1xRTT
MSC
EvDO/LTE VRF
CSR
RNC
MSPP
1xRTT VRF
QFP
PE
Agg2
Internet Core
PE
Steady state: Traffic flows distributed across Agg. Failure: Agg1 power outage. Action: BFD and VRRP sessions time out BGP and OSPF neighbors drop due to BFD BGP PIC Edge ensures sub-second convergence Traffic flows resume across existing path thru Agg2. Results: Traffic flows via Agg2 to end hosts.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
111
x
QFP QFP
CSR
MPLS VPN
Agg1
LTE Core
VRF EvDO/LTE
MME SGW
VRF 1xRTT
MSC
EvDO/LTE VRF
CSR
RNC
MSPP
1xRTT VRF
PE Agg2 PE
Internet Core
Steady state: Traffic flows distributed across CSR. Failure: CSR power outage. Action: BFD sessions time out BGP neighbors drop due to BFD Mobile handsets resync to neighboring cell site Results: Mobile handset voice connectivity is maintained.
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
112
Summary
Summary
Motivation for High Availability in SP Aggregation Networks
BRKSPG-2402
Cisco Public
114
Key Takeaways
High-Availability becoming increasingly deployed in Aggregation Networks
Motivated by experiences with MPLS Core Networks
Many high-availability techniques deployed in the core are now applied in MPLS aggregation networks
MPLS TE FRR, BFD, EOAM, Pseudowire Redundancy
Stateful Inter-chassis redundancy increasingly being considered to provide geographic redundancy for applications
BRKSPG-2402
Cisco Public
115
Call to Action
Get hands-on experience attending one of the Walk-in Labs Schedule face to face meeting with one of Ciscos engineers at the Meet the Engineer center Discuss your projects challenges at the Technical Solutions Clinics
BRKSPG-2402
Cisco Public
116
BRKSPG-2402
Cisco Public
117
Recommended Reading
N. Stringfield et. Al, Cisco Express Forwarding,
ISBN-13: 978-1-58705-236-1
T. Szigeti, C. Hattingh, End-to-End QoS Network Design: Quality of Service in LANs, WANs, and VPNs:,
ISBN-13: 978-1-58705-176-0
BRKSPG-2402
Cisco Public
118
Whitepapers on CCO
Cisco IOS High Availability
http://www.cisco.com/en/US/tech/tk869/tk769/tech_white_papers_list.html
http://www.cisco.com/en/US/products/ps6550/prod_white_papers_list.html
ASR 9000
Cisco ASR 9000 Series High Availability: Continuous Network Operations Introduction to Cisco ASR 9000 Series Network Virtualization Technology Distributed Virtual Data Center for Enterprise and Service Provider Cloud
ASR 1000
Cisco ASR 1000 Series Aggregation Services Routers Cisco ASR 1000 Series: ISSU Deployment Guide and Case Study Cisco Unified Border Element (SP Edition) on Cisco ASR 1000 Series Cisco Unified WAN Services: Services, Security, Resiliency, and Intelligence
BRKSPG-2402 2013 Cisco and/or its affiliates. All rights reserved.
Cisco Public
119