Internet DRAFT - draft-zhangwu-bier-nvo3-analysis

draft-zhangwu-bier-nvo3-analysis







BIER WG                                                         Z. Zhang
Internet-Draft                                                     B. Wu
Intended status: Informational                           ZTE Corporation
Expires: November 25, 2017                                  May 24, 2017


                    Analysis of BIER in NVO3 network
                  draft-zhangwu-bier-nvo3-analysis-00

Abstract

   This document analyses BIER deployment in NVO3 network.  The intent
   is to evaluate whether BIER could achieve some simplicity and
   efficiency.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at http://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on November 25, 2017.

Copyright Notice

   Copyright (c) 2017 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document.  Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document.  Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.





Zhang & Wu              Expires November 25, 2017               [Page 1]

Internet-Draft                  BIER NVO3                       May 2017


Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  BIER in NVO3 network  . . . . . . . . . . . . . . . . . . . .   2
     2.1.  Leaf devices as the edge BIER nodes . . . . . . . . . . .   4
     2.2.  NVE as edge BIER nodes  . . . . . . . . . . . . . . . . .   7
   3.  Other Consideration . . . . . . . . . . . . . . . . . . . . .   8
   4.  Normative References  . . . . . . . . . . . . . . . . . . . .   9
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  10

1.  Introduction

   As mentioned in [I-D.ietf-nvo3-mcast-framework], there is multicast
   requirement in NVO3 network, such as BUM packets for infrastructure
   multicast and other application-specific multicast.  Besides PIM,
   BIER is mentioned to be used as one of IP multicast underlay
   technology in section 3, which introduces several multicast
   mechanisms for NVO3 network.

   Bit Index Explicit Replication (BIER) [I-D.ietf-bier-architecture] is
   a new architecture for the forwarding of multicast data packets.  It
   provides optimal forwarding of multicast packets through a "multicast
   domain".  It does not require a protocol for explicitly building
   multicast distribution trees, nor does it require intermediate
   devices to maintain any per-flow state.  When a multicast data packet
   enters the BIER domain, the BIER ingress router determines the set of
   BIER egress routers to which the packet needs to be sent.  The BIER
   ingress router then encapsulates the packet in a BIER header.  The
   BIER header contains a bitstring in which each bit represents exactly
   one BIER egress router in the domain; to forward the packet to a
   given set of egress routers, the bits corresponding to those routers
   are set in the BIER header.  In this way, elimination of the per-flow
   state and the explicit tree-building protocols results in a
   considerable simplification.

   This document will give some basic analysis on how to deploy BIER in
   the NVO3 network to mitigate multicast states.

2.  BIER in NVO3 network












Zhang & Wu              Expires November 25, 2017               [Page 2]

Internet-Draft                  BIER NVO3                       May 2017


       +--------+                                             +--------+
       | Tenant +--+                                     +----| Tenant |
       | System |  |                                    (')   | System |
       +--------+  |          ................         (   )  +--------+
                   |  +-+--+  .              .  +--+-+  (_)
                   |  | NVE|--.              .--| NVE|   |
                   +--|    |  .              .  |    |---+
                      +-+--+  .              .   +--+-+
                      /       .              .
                     /        .  L3 Overlay  .   +--+-++--------+
       +--------+   /         .    Network   .   | NVE|| Tenant |
       | Tenant +--+          .              .- -|    || System |
       | System |             .              .   +--+-++--------+
       +--------+             ................
                                     |
                                   +----+
                                   | NVE|
                                   |    |
                                   +----+
                                     |
                                     |
                           =====================
                             |               |
                         +--------+      +--------+
                         | Tenant |      | Tenant |
                         | System |      | System |
                         +--------+      +--------+
                  Figure 1: NVO3 Generic Reference Model

   As described in [RFC8014], figure 1 is a generic reference model of
   NVO3.  BIER can be used in this model to provide underlay multicast
   function.



















Zhang & Wu              Expires November 25, 2017               [Page 3]

Internet-Draft                  BIER NVO3                       May 2017


                  ...........................................
                  .                                 ------- .   -------
                  .            ................     | Leaf| .---| NVE |
                  .-------     .              .-----|     | .   |     |
        -------   .| Leaf|     .              .     ------- .   -------
        | NVE |---.|     |-----.              .             .
        |     |   .-------     .    Spine     .             .
        -------   .            .              .             .
                  .            .              .             .
                  .            .              .     ------- .
                  .            .              .-----| Leaf| .   -------
                  .            ................     |     | .---| NVE |
                  .                   |             ------- .   |     |
                  .                   |                     .   -------
                  .                 -------                 .
                  .                 | Leaf|                 .
                  .                 |     |                 .
                  .                 -------                 .
                  ...........................................
                                |              |
                                |              |
                             -------        -------
                             | NVE |        | NVE |
                             |     |        |     |
                             -------        -------
              Figure 2: Example topology of NVO3 underlay network

   The architecture of L3 underlay Network can be various.  As depicted
   in figure 2, the Clos topology mentioned in [RFC7938] is taken as an
   example to offer underlay L3 connectivity.  And it is assumed that
   the leaf and spine devices support BIER functionality including
   exchanging BFR-id and other information and building BIER forwarding
   table.

   In this document, two possible ways to handle multicast packets are
   identified.  One of the choices is placing BIER boundaries at leaf
   switches which are a general choice of implementing BIER.  The other
   way is putting the BIER boundary at NVE to achieve better efficiency.

2.1.  Leaf devices as the edge BIER nodes

   Leaf devices are taken as BIER edge node.  Each leaf device is
   allocated one unique BFR-id.  And leaf devices are the ingress BIER
   nodes and egress BIER nodes.  Both the leaf and spine devices
   exchange the BIER information by IGP/BGP extensions.  The according
   extensions are defined in [I-D.ietf-bier-ospf-bier-extensions],
   [I-D.ietf-bier-isis-extensions], and [I-D.ietf-bier-idr-extensions].




Zhang & Wu              Expires November 25, 2017               [Page 4]

Internet-Draft                  BIER NVO3                       May 2017


   IGMP/MLD protocol is used between NVE and leaf device like the
   description in [I-D.ietf-nvo3-mcast-framework].  The BUM flows are
   encapsulated corresponding multicast group address.  BMLD
   [I-D.pfister-bier-mld] protocol is used among all the leaf devices to
   exchange multicast group information.  After BMLD protocol process is
   completed, each leaf devices knows the other leaf devices associated
   with specific multicast group address.  When one packet with
   multicast group address reaches leaf device, leaf device encapsulates
   the packet with BIER header which indicates all the destination leaf
   devices that belong to the same multicast group.

   After the BIER packet reaches the destination leaf devices through
   the spine network forwarding, the destination leaf device removes the
   BIER header of packet and forwards to corresponding NVEs.

   So the multicast state is eliminated because of removing of multicast
   tree in L3 underlay network.  NVE only needs to support IGMP/MLD
   protocol.  But one or several multicast group addresses for a tenant
   is still needed.
































Zhang & Wu              Expires November 25, 2017               [Page 5]

Internet-Draft                  BIER NVO3                       May 2017


                                                             IGMP/MLD
        IGMP/MLD  ...........................................  |
            \     .    ................................     .  |
             \    .    .  BIER forwarding             .     .  |
              \   .    .                            ------- .  |-------
               \  .    .       ................     |Leaf2| .---| NVE2|
                \ .-------     .              .-----|     | .   |     |
        -------   .|Leaf1|     .              .     ------- .   -------
        | NVE1|---.|     |-----.              .       .     .
        |     |   .-------     .    Spine     .       .     .
        -------   .    .       .              .       .     .
                  .    .       .              .       .     .
                  .    .       .              .     ------- .
                  .    .       .              .-----|Leaf3| .   -------
                  .    .       ................     |     | .---| NVE3|
                  .    .              |             ------- . | |     |
                  .    .              |               .     . | -------
                  .    . BMLD       -------           .     . |
                  .    ............ |Leaf4|............     . IGMP/MLD
                  .                 |     |                 .
                  .                 -------                 .
                  ...........................................
                                |   IGMP/MLD   |
                                |              |
                             -------        -------
                             | NVE4|        | NVE5|
                             |     |        |     |
                             -------        -------
                             Figure 3: BIER in VNO3

   For example, as illustrated in Figure 3, NVE1 needs to send BUM flows
   to NVE2, NVE3, NVE4 and NVE5.  The NVE supports the basic IGMP/MLD
   snooping function.  In most of condition, there would have to be a
   separate group for each tenant, plus a separate group for each
   multicast address (used for multicast applications) within a tenant.
   NVE1 sends the packet to Leaf1.  According the BMLD exchange Leaf1
   encapsulate BIER header with the destination of Leaf2, Leaf3 and
   Leaf4.  After BIER forwarding the packet reaches Leaf2, Leaf3 and
   Leaf4, the leaf devices remove the BIER header and send it to NVE2,
   NVE3, NVE4 and NVE5.

   In some use cases, there could be a big amount of leaf switch, and it
   is impossible to encapsulate destination BFR-ids in one same BIER
   header because of the limitation of BitStringLength, the packet
   should be sent more than once to reach destination.  In case the BFR-
   id of Leaf2 is 28, the BFR-id of Leaf3 is 78; the BitStringLength
   used in BIER encapsulation is 64, it is impossible to encapsulate the
   two BFR-ids in one BIER header.  In this situation, one solution is



Zhang & Wu              Expires November 25, 2017               [Page 6]

Internet-Draft                  BIER NVO3                       May 2017


   Leaf1 send two copies of packet to Leaf2 and Leaf3.  SI in BIER
   header defined in [I-D.ietf-bier-architecture] is be used to
   distinguish these two copies to deliver to the final destination.
   The other solution is increase the forwarding BitStringLength to 128.
   From the above analysis, BIER is quite applicable in the scenario
   with limited size of leaf switches.  But in a large scale of NVO3
   underlay network, there is some limitation due to the
   BitStringLength.

2.2.  NVE as edge BIER nodes

   In last section, IGMP/MLD is still needed to run between NVE and leaf
   devices.  And the multicast groups for Tenants and specific multicast
   applications are needed.

   If NVE is the edge BIER node, IGMP/MLD protocol does not need to run
   between NVE and leaf device.  NVE encapsulates the BIER header with
   the BFR-ids of destination NVEs straightly and send it to leaf
   devices.  Leaf and spine devices forward the BIER packet to
   destination NVEs.  Destination NVEs remove the BIER header and
   forwarding according to the inner encapsulation.

   In this situation, leaf device does not need to be allocated BFR-id.
   Every NVE should be allocated with one unique BFR-id.  The BFR-id
   information should be exchanged within the L3 underlay network.  If
   NVE supports the IGP/BGP BIER extension, NVE takes part in the BIER
   information exchange.  The forwarding plane will be established
   easily.  But in some situations NVE does not support IGP or BGP; NVE
   can not take part in the BIER information exchange.  So the BFR-id of
   NVE should be advertised by some other method.

   Besides multicast state elimination in L3 underlay network, IGMP/MLD
   does not need to run between NVE and leaf devices.  And BMLD does not
   need to run among all the leaf devices.  This function makes the BIER
   deployment more simply.
















Zhang & Wu              Expires November 25, 2017               [Page 7]

Internet-Draft                  BIER NVO3                       May 2017


                  ...........................................
                  .    ................................     .
                  .    .  BIER forwarding             .     .
                  .    .                            ------- .   -------
                  .    .       ................     |Leaf2| .---| NVE2|
                  .-------     .              .-----|     | .   |     |
        -------   .|Leaf1|     .              .     ------- .   -------
        | NVE1|---.|     |-----.              .       .     .
        |     |   .-------     .    Spine     .       .     .
        -------   .    .       .              .       .     .
                  .    .       .              .       .     .
                  .    .       .              .     ------- .
                  .    .       .              .-----|Leaf3| .   -------
                  .    .       ................     |     | .---| NVE3|
                  .    .              |             ------- .   |     |
                  .    .              |               .     .   -------
                  .    .            -------           .     .
                  .    ............ |Leaf4|............     .
                  .                 |     |                 .
                  .                 -------                 .
                  ...........................................
                                |              |
                                |              |
                             -------        -------
                             | NVE4|        | NVE5|
                             |     |        |     |
                             -------        -------
                             Figure 4: BIER in VNO3

   The same example for figure 3, NVE1 should send BUM packet to NVE2,
   NVE3, NVE4 and NVE5.  NVE1 encapsulates BIER header with NVE2, NVE3,
   NVE4 and NVE5 as destination and send it to Leaf1.  Leaf1 and spine
   devices forward the packet to NVE2, NVE3, NVE4 and NVE5 according to
   the BIER header.  The NVEs remove the BIER header and forward it.

   The BitStringLength limitation also remains in this solution.  And
   the situation may be more seriously because of the larger number of
   NVEs than leaf devices.  If the BFR-id of NVE is not allocated
   reasonably, in the worst situation, the forwarding efficiency is the
   same with the source replication described in
   [I-D.ietf-nvo3-mcast-framework] section 3.2.

3.  Other Consideration

   As illustrated in [RFC7938], there could be hundred thousand servers
   connected by underlay network in some NVO3 network.  So there are
   more than thousands of NVEs and leaf devices in the network.  Using
   BIER as multicast underlay protocol make significant advantage



Zhang & Wu              Expires November 25, 2017               [Page 8]

Internet-Draft                  BIER NVO3                       May 2017


   because of the elimination of multicast state stored in the underlay
   network.  But the BitStringLength limitation is one problem.

   In order to achieve the optimization of BIER, the BFR-ids allocation
   should be more reasonable.  The BFR-id of NVE/leaf device that is
   belong to one same VN could be allocated adjacent as much as
   possible.  So the encapsulation of BIER header can be more efficient.

   Along with the number of BFR-id increasing for NVE/leaf devices,
   there are thousands BIER forwarding items in the L3 underlay network.
   The forwarding efficiency in the L3 underlay network should also be
   considered.

4.  Normative References

   [I-D.ietf-bier-architecture]
              Wijnands, I., Rosen, E., Dolganow, A., Przygienda, T., and
              S. Aldrin, "Multicast using Bit Index Explicit
              Replication", draft-ietf-bier-architecture-06 (work in
              progress), April 2017.

   [I-D.ietf-bier-idr-extensions]
              Xu, X., Chen, M., Patel, K., Wijnands, I., and T.
              Przygienda, "BGP Extensions for BIER", draft-ietf-bier-
              idr-extensions-02 (work in progress), January 2017.

   [I-D.ietf-bier-isis-extensions]
              Ginsberg, L., Przygienda, T., Aldrin, S., and Z. Zhang,
              "BIER support via ISIS", draft-ietf-bier-isis-
              extensions-04 (work in progress), March 2017.

   [I-D.ietf-bier-ospf-bier-extensions]
              Psenak, P., Kumar, N., Wijnands, I., Dolganow, A.,
              Przygienda, T., Zhang, Z., and S. Aldrin, "OSPF Extensions
              for BIER", draft-ietf-bier-ospf-bier-extensions-05 (work
              in progress), March 2017.

   [I-D.ietf-nvo3-mcast-framework]
              Ghanwani, A., Dunbar, L., McBride, M., Bannai, V., and R.
              Krishnan, "A Framework for Multicast in Network
              Virtualization Overlays", draft-ietf-nvo3-mcast-
              framework-08 (work in progress), May 2017.

   [I-D.pfister-bier-mld]
              Pfister, P., Wijnands, I., Venaas, S., Wang, C., Zhang,
              Z., and M. Stenberg, "BIER Ingress Multicast Flow Overlay
              using Multicast Listener Discovery Protocols", draft-
              pfister-bier-mld-03 (work in progress), March 2017.



Zhang & Wu              Expires November 25, 2017               [Page 9]

Internet-Draft                  BIER NVO3                       May 2017


   [RFC7938]  Lapukhov, P., Premji, A., and J. Mitchell, Ed., "Use of
              BGP for Routing in Large-Scale Data Centers", RFC 7938,
              DOI 10.17487/RFC7938, August 2016,
              <http://www.rfc-editor.org/info/rfc7938>.

   [RFC8014]  Black, D., Hudson, J., Kreeger, L., Lasserre, M., and T.
              Narten, "An Architecture for Data-Center Network
              Virtualization over Layer 3 (NVO3)", RFC 8014,
              DOI 10.17487/RFC8014, December 2016,
              <http://www.rfc-editor.org/info/rfc8014>.

Authors' Addresses

   Zheng(Sandy) Zhang
   ZTE Corporation
   No. 50 Software Ave, Yuhuatai Distinct
   Nanjing
   China

   Email: zhang.zheng@zte.com.cn


   Bo Wu
   ZTE Corporation
   No. 50 Software Ave, Yuhuatai Distinct
   Nanjing
   China

   Email: wu.bo@zte.com.cn






















Zhang & Wu              Expires November 25, 2017              [Page 10]