Internet DRAFT - draft-balaji-panet-dc-label-semantic-for-pwr

draft-balaji-panet-dc-label-semantic-for-pwr



 



INTERNET-DRAFT                                Balaji Venkat Venkataswami
Intended Status: Experimental RFC                       Bhargav Bhikkaji
Expires: August 2013                                                DELL
                                                           Shankar Raman
                                                              IIT Madras
                                                       February 17, 2013


  Load-balancing to Data Centers in a L3VPN environment based on Power
            draft-balaji-panet-dc-label-semantic-for-pwr-00


Abstract

   Data Centers may be spread across different locations for a
   particular enterprise. Different locations may mean within the same
   country but across different geographical locations, or outside the
   country even in a different continent. These data centers may be
   serving the enterprise or multiple enterprises / tenants wherein the
   regular enterprise site may request data from a data center site
   which could be one of the data center sites proximal to the
   enterprise site. Proximity is usually calculated based on a metric
   that is bandwidth driven or in terms with regard to the number of
   hops to reach that data center site hence bringing into play delay
   characteristics. Assume a topology where the data center sites and
   the enterprise sites are MPLS based L3VPN sites that are being
   provided connectivity through a Service Provider deploying Layer 3
   VPNs. Given such a topology it is possible that replication of data
   happens across the data centers in a timely manner to keep the data
   active and refreshed across all data center sites. Suitable
   mechanisms for such replication will come into play for this purpose.
   Thus any of the data centers can cater to the request from a user
   site.

   It is possible that power consumption in each data center may vary
   according to the load on each data center. It would be prudent to
   introduce a scheme where the power metric coupled with other metrics
   such as bandwidth and delay be used by a Provider Edge router in a
   L3VPN scenario to direct the packets or requests from regular user
   sites to such data centers with the least such metric. This is in
   line with the follow-the-moon strategy of directing requests for data
   and compute to data centers which are power-wise more efficient
   during the night or during the day. This draft document lays out one
   such proposal.


Status of this Memo

 


Balaji Venkat et.al       Expires August 2013                   [Page 1]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


   This Internet-Draft is submitted to IETF in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF), its areas, and its working groups.  Note that
   other groups may also distribute working documents as
   Internet-Drafts.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   The list of current Internet-Drafts can be accessed at
   http://www.ietf.org/1id-abstracts.html

   The list of Internet-Draft Shadow Directories can be accessed at
   http://www.ietf.org/shadow.html


Copyright and License Notice

   Copyright (c) 2013 IETF Trust and the persons identified as the
   document authors. All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents
   (http://trustee.ietf.org/license-info) in effect on the date of
   publication of this document. Please review these documents
   carefully, as they describe your rights and restrictions with respect
   to this document. Code Components extracted from this document must
   include Simplified BSD License text as described in Section 4.e of
   the Trust Legal Provisions and are provided without warranty as
   described in the Simplified BSD License.



Table of Contents

   1  Introduction  . . . . . . . . . . . . . . . . . . . . . . . . .  4
     1.1  Terminology . . . . . . . . . . . . . . . . . . . . . . . .  4
     2.0 Reference Topology . . . . . . . . . . . . . . . . . . . . .  4
     2.1 Methodology of the new scheme  . . . . . . . . . . . . . . .  5
     2.2 Calculation of the Compound Power Metric . . . . . . . . . .  8
       2.2.1 Power metric calculation for the DC as a whole and per
             tenant . . . . . . . . . . . . . . . . . . . . . . . . .  9
     2.3 Extending the labeling scheme to the CE  . . . . . . . . . .  9
     2.4 Extensions to MP-iBGP for this scheme. . . . . . . . . . . .  9
 


Balaji Venkat et.al       Expires August 2013                   [Page 2]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


   3  Security Considerations . . . . . . . . . . . . . . . . . . . . 10
   4  IANA Considerations . . . . . . . . . . . . . . . . . . . . . . 10
   5  References  . . . . . . . . . . . . . . . . . . . . . . . . . . 10
     5.1  Normative References  . . . . . . . . . . . . . . . . . . . 10
     5.2  Informative References  . . . . . . . . . . . . . . . . . . 10
   APPENDIX - A : References for power saving related material  . . . 10
   Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . . 12









































 


Balaji Venkat et.al       Expires August 2013                   [Page 3]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


1  Introduction

   Data Centers may be spread across different locations for a
   particular enterprise. Different locations may mean within the same
   country but across different geographical locations, or outside the
   country even in a different continent. These data centers may be
   serving the enterprise or multiple enterprises / tenants wherein the
   regular enterprise site may request data from a data center site
   which could be one of the data center sites proximal to the
   enterprise site. Proximity is usually calculated based on a metric
   that is bandwidth driven or in terms with regard to the number of
   hops to reach that data center site hence bringing into play delay
   characteristics. Assume a topology where the data center sites and
   the enterprise sites are MPLS based L3VPN sites that are being
   provided connectivity through a Service Provider deploying Layer 3
   VPNs. Given such a topology it is possible that replication of data
   happens across the data centers in a timely manner to keep the data
   active and refreshed across all data center sites. Suitable
   mechanisms for such replication will come into play for this purpose.
   Thus any such data center site can cater to a request from a user
   site.

   It is possible that power consumption in each data center may vary
   according to the load on each data center. It would be prudent to
   introduce a scheme where the power metric coupled with other metrics
   such as bandwidth and delay be used by a Provider Edge router in a
   L3VPN scenario to direct the packets or requests from regular user
   sites to such data centers with the least such metric. This is in
   line with the follow-the-moon strategy of directing requests for data
   and compute to data centers which are power-wise more efficient
   during the night or during the day. This draft document lays out one
   such proposal.

1.1  Terminology

   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
   document are to be interpreted as described in RFC 2119 [RFC2119].


2.0 Reference Topology

   Assume the following topology where Data Center 1 and 2 are
   geographically dispersed within the county or across different
   continents and there is sufficient desirable properties of delay and
   bandwidth allocated to each of them. End User Sites depicted below
   the CEs would be enterprise sites that would request data and compute
   from these Data center sites. The DC GWs connecting the Data centers
 


Balaji Venkat et.al       Expires August 2013                   [Page 4]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


   would play the role of CEs for the Data Centers. The PEs (1-5) are
   the Provider Edge routers of the ISP Core network that provide
   regular MPLS based L3VPN services for the inter-connection of the
   Data Center sites with the End User sites.

   During the normal course of events VPN instance labels would be
   exchanged between the PEs and the ISP core would provide LDP or RSVP
   based connectivity amongst these PEs. Thus a stack of labels with
   inner VPN instance label with the outer label being LDP or RSVP would
   be used to direct traffic from End User Sites to the Data Center
   sites and even amongst the End User Sites themselves. It is also
   possible that replication services would run between the Data Center
   sites as well using this mechanism.


                            __________                 ,---------. 
                           /                         ,'           `. 
                          ;Data Center)              (  Data Center ) 
                          (    2      '               `.    1      ,' 
                          +-----------+                 `-+------+' 
                                       \            /
                                     +--+--+   +-+---+ 
                                     |DC GW|   |DC GW| 
                                     +-+---+   +-----+ 
                                         |       |
                                        PE5     PE4
                                        .--. .--. 
                                      (    '    '.--. 
                                    .-.' ISP Core     ' 
                                   (     network      ) 
                                    (             .'-' 
                                     '--'._.'.    )PE3 
                                     PE1     '--'  \ \ 
                                    / /      PE2    \ \ 
                                   / /        |      \ \
                              +---+--+   +------+  +--+----+ 
                              | CE1  |   | CE2  |  |  CE3  | 
                              +-+--`.+   +-+----+  +-+--+--+ 
                            __/_                \           \__ 
                     '--------'                '--------'   '--------' 
                     :End User:                :End User:   :End User: 
                     : Site   :                : Site   :   : Site   : 
                     '--------'                '--------'   '--------' 

   Figure 1 : A Generic Architecture for Multiple Data Centers providing
   Data and Compute services for End user sites.

2.1 Methodology of the new scheme
 


Balaji Venkat et.al       Expires August 2013                   [Page 5]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


   When a site on the enterprise connected through a L3 VPN intends to
   access a data center, load-balancing mechanisms may use the nearest
   data center or that within the nearest country or even in another
   continent depending on the availability and load being serviced.  The
   method proposed in this document would advertise a power consumption
   rating along with a L3VPN service instance label for that Data Center
   site and the enterprise site PE accessing the Data Center may arrive
   at a decision as to which DC it is to access based on the power
   consumption rating so advertised in the MP-iBGP update. Trust level
   between Provider and customer is advised in this case.

   The PE devices would receive different VPN instance labels from each
   of the Data Centers using a MP-iBGP update with the compound power
   based metric in the attribute information in the update. Suitable
   extensions to extended community attributes would be done to
   facilitate the passage of such information. Assume Label 100 is sent
   from PE4 to the enterprise sites with Compound power metric 1200 and
   Label 200 is sent from the PE5 to the enterprise sites with Compound
   power metric 1400 at a certain point in time. 

   When the End User sites request a service from a data center, the PE
   (1-3) which have the labels 100 and 200 with their respective
   compound power metrices 1200 and 1400 respectively will choose which
   Data Center (1 or 2) has the least compound metric and direct the
   services towards that PE connected to the data center with that least
   power metric. 






















 


Balaji Venkat et.al       Expires August 2013                   [Page 6]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


                            __________                 ,---------. 
                           /                         ,'           `. 
                          ;Data Center)              (  Data Center ) 
                          (    2      '               `.    1      ,' 
                          +-----------+                 `-+------+' 
                           pwr = 1400  \            /     pwr = 1200
                                     +--+--+   +-+---+ 
                                     |DC GW|   |DC GW| 
                                     +-+---+   +-----+ 
                                         |       |
                    (200, pwr = 1400)   PE5     PE4 (100, pwr = 1200)
                                        .--. .--. 
                                      (    '    '.--. 
                                    .-.' ISP Core     ' 
                                   (     network      ) 
                                    (             .'-' 
                                     '--'._.'.    )PE3 
                                     PE1     '--'  \ \ 
                                    / /      PE2    \ \ 
                                   / /        |      \ \
                              +---+--+   +---+--+  +--+----+ 
                              | CE1  |   | CE2  |  |  CE3  | 
                              +-+--`.+   +-+----+  +-+--+--+ 
                            __/_                \           \__ 
                     '--------'                '--------'   '--------' 
                     :End User:                :End User:   :End User: 
                     : Site   :                : Site   :   : Site   : 
                     '--------'                '--------'   '--------' 

   Figure 2: Control plane exchanges between PE5, PE4 and PEs (1-3)


   In this case Data Center 1 has the least Compound power metric 1200
   and hence the traffic from say CE1 would be sent to Data Center 1.
   There is a periodic calculation of the compound power metric in each
   of the Data Center sites and this is exchanged with the PE through
   eBGP (say for example) between the respective CEs and the PEs. The
   VPN instance label used would be 100 to get to the Data Center 1 and
   the reachable PE would be set to PE4.









 


Balaji Venkat et.al       Expires August 2013                   [Page 7]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


                            __________                 ,---------. 
                           /                         ,'           `. 
                          ;Data Center)              (  Data Center ) 
                          (    2      '               `.    1      ,' 
                          +-----------+                 `-+------+' 
                           pwr = 1400  \            /     pwr = 1200
                                     +--+--+   +-+---+ 
                                     |DC GW|   |DC GW| 
                                     +-+---+   +-----+ 
                                         |       |
                                        PE5     PE4 
                                        .--. .--. 
                                      (    '    '.--. 
                                    .-.' ISP Core     ' 
                                   (     network      ) 
                                    (             .'-' 
                                     '--'._.'.    )PE3 
                    (LDP Label, 100) PE1     '--'  \ \ 
                                    / /      PE2    \ \ 
                                   / /        |      \ \
                              +---+--+   +---+--+  +--+----+ 
                              | CE1  |   | CE2  |  |  CE3  | 
                              +-+--`.+   +-+----+  +-+--+--+ 
                            __/_                \           \__ 
                     '--------'                '--------'   '--------' 
                     :End User:                :End User:   :End User: 
                     : Site   :                : Site   :   : Site   : 
                     '--------'                '--------'   '--------' 

   Figure 3: Data Plane path from say CE1 to PE4 based on Power Metric
   for DC.

   Here the power metric is not discrete but in intervals of thresholds.
   Only the threshold interval is thus exchanged between the CEs and the
   PEs. This way dampening frequent fluctuations or oscillations within
   a given power metric interval is taken care of.

   Inter-DC connectivity and replication may also benefit from this
   scheme. DC A would choose to replicate with DC B at a time when the
   power consumption rating in both sites or in DC B is the lowest.

2.2 Calculation of the Compound Power Metric

   Factors such as compute load, cooling power consumption and network
   bandwidth within the Data Center would be used to compute the
   compound power metric within a Data Center to be advertised to the
   PEs within the Layer 3 VPN. The actual calculation is out of scope of
   this document and there exists sufficient literature to suggest a
 


Balaji Venkat et.al       Expires August 2013                   [Page 8]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


   suitable method for such calculation. For now, this draft proposes a
   scheme for exchanging such power metrices and using them for load-
   balancing within a L3VPN scenario.

2.2.1 Power metric calculation for the DC as a whole and per tenant

   Many schemes exist for calculating the power consumed per tenant
   based on the occupancy of each tenant in a DC and for calculating the
   power consumed by the DC as a whole. Appropriate labels can be
   exchanged per tenant with the respective power metrices factored in
   per tenant if necessary. The specific end user site that is using the
   data and compute power of a data center may belong to a particular
   tenant and may wish to direct its traffic to the data center based on
   the power consumed for that specific tenant Identifier and hence use
   the appropriate label for the same in the PE. If the Data Center
   cater to a single tenant and are owned by the tenant then the overall
   power consumed by the DC will be used in the MP-iBGP update.

2.3 Extending the labeling scheme to the CE

   It is possible to extend the label imposition from the CE itself
   towards the ISP core in order to ensure that the CE can be made aware
   of such a scheme being available and use appropriate labels to
   indicate to the PEs that the CE requires power metric based load-
   balancing. Two different labels could be used by the CEs one for
   conventional methods of requesting services and the other the power
   metric based method where the PEs would consult the power metrices
   available and direct the request towards the low power consuming data
   center.

   If trust levels are not to be adhered to the label may be propogated
   along with the power consumption ratio to the CE and the CE would
   make the appropriate decision.

2.4 Extensions to MP-iBGP for this scheme.

   A future version of the draft will outline the actual extensions to
   the BGP protocol and its attributes with regard to how the compound
   power metric is carried in the actual BGP exchanges.









 


Balaji Venkat et.al       Expires August 2013                   [Page 9]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


3  Security Considerations

   Trust levels between the Provider Edge and the customer edge should
   be proper in order that the Data Center's power metrices are
   exchanged between the PE and CE. eBGP as a PE-CE protocol could be
   adhered to for this purpose. Appropriate security mechanisms would
   have to be taken into account if the Data Center is serving multiple
   tenants. The computed Compound Power Metric may be calculated for
   each tenant and mechanisms should be adopted that one tenant's
   compound metric is not shared with other tenants. Appropriate label
   exchanges with each tenant's Label information and corresponding
   power metrices should be done with such separation in mind. If there
   is a collated power metric for all the tenants put together then the
   PE device should make sure that other Data Center provider's
   information is held separately in it's tables.


4  IANA Considerations

   Suitable IANA considerations for extending the BGP extended community
   attribute for accommodating the power metric information in the MP-
   iBGP update are to be taken into account. This will be made more
   clear in subsequent versions of the document.


5  References

5.1  Normative References

   Please see Appendix A.


5.2  Informative References

              Please see Appendix A.

APPENDIX - A : References for power saving related material


              M. Zhang, J. Dong, B. Zhang, "Use Cases for Power-Aware
              Networks", draft-zhang-panet-use-cases (work in progress)

              B. Nordman, K. Christensen, "Nanogrids", draft-nordman-
              nanogrids-00 (work in progress)

              T. Suzuki, T. Tarui, "Requirements for an Energy-Efficient
              Network System", draft-suzuki-eens-requirements (work in
              progress)
 


Balaji Venkat et.al       Expires August 2013                  [Page 10]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


              Z. Cao, "Synchronization Layer: an Implementation Method
              for Energy Efficient Sensor Stack", draft-cao-lwig-syn-
              layer (work in progress)

              A. Junior, R. Sofia, "Energy-awareness metrics global
              applicability guideline", draft-ajunior-energy-awareness-
              00 (work in progress)

              B. Zhang, J. Shi, M. Zhang, J. Dong, "Power-aware Routing
              and Traffic Engineering: Requirements, Approaches, and
              Issues", draft-zhang- greennet (work in progress)

              T. Suganuma, N. Nakamura, S. Izumi, H. Tsunoda, M.
              Matsuda, K. Ohta, "Green Usage Monitoring Information
              Base", draft-suganuma-greenmib (work in progress)

              S. Raman, B. V. Venkataswami, G. Raina, V. Srini, "Power
              Based Topologies and TE-Shortest Power Paths in OSPF",
              draft-mjsraman- rtgwg-ospf-power-topo-01 (work in
              progress)

              S. Raman, B. V. Venkataswami, G. Raina, V. Srini,
              "Building power optimal Multicast Trees", draft-mjsraman-
              rtgwg-pim-power-02 (work in progress)

              S. Raman, B. V. Venkataswami, G. Raina, "Reducing Power
              Consumption using BGP", draft-mjsraman-rtgwg-inter-as-psp-
              03 (work in progress)

              S. Raman, B. V. Venkataswami, G. Raina, "Building power
              shortest inter-Area TE LSPs using pre-computed paths",
              draft-mjsraman-rtgwg- intra-as-psp-te-leak-02 (work in
              progress)

              S. Raman, B. V. Venkataswami, G. Raina, V. Srini,
              "Reducing Power Consumption using BGP path selection",
              draft-mjsraman-rtgwg-bgp- power-path-02 (work in progress)











 


Balaji Venkat et.al       Expires August 2013                  [Page 11]

INTERNET DRAFT     Load-balancing in L3VPN based DCs   February 17, 2013


Authors' Addresses



    Balaji Venkat Venkataswami
    DELL
    Plot #1 SIDCO Estate
    Olympia Tech Park
    Guindy
    Chennai
    India

    EMail: balaji_venkat_venkat@dell.com



    Bhargav Bhikkaji
    DELL
    350 Holger Way
    San Jose, CA
    U.S.A

    Email: Bhargav_Bhikkaji@dell.com



    Shankar Raman
    Department of Computer Science and Engineering
    IIT Madras
    Chennai - 600036
    TamilNadu
    India

    EMail: mjsraman@cse.iitm.ac.in

















Balaji Venkat et.al       Expires August 2013                  [Page 12]