ST Working Group                                            L. Delgrossi
Internet-Draft                                              October 1994
File: draft-ietf-st2-spec-00.txt                   Expires: January 1995


                Internet Stream Protocol Version 2 (ST2)


                Protocol Specification - Version ST2Plus


Status of this Memo

   This document is an Internet-Draft. Internet-Drafts are working
   documents of the Internet Engineering Task Force (IETF), its Areas,
   and its Working Groups. Note that other groups may also distribute
   working documents as Internet-Drafts.

   Internet-Drafts are draft documents valid for a maximum of six
   months. Internet-Drafts may be updated, replaced, or obsoleted by
   other documents at any time. It is not appropriate to use Internet-
   Drafts as reference material or to cite them other than as "work in
   progress".

   To learn the current status of any Internet-Draft, please check the
   "1id-abstracts.txt" listing contained in the Internet-Drafts Shadow
   Directories on ds.internic.net (US East Coast), nic.nordu.net
   (Europe), ftp.isi.edu (US West Coast), or munnari.oz.au (Pacific
   Rim).


   Abstract:

   This memo contains a revised specification of the Internet STream
   Protocol Version 2 (ST2). ST2 is a resource reservation protocol
   intended to provide end-to-end real-time guarantees over an internet.
   It allows its applications to build multi-destination simplex data
   streams with a desired quality of service. The revised version of ST2
   specified in this memo is called ST2Plus.


   Editor's Note:

   This memo is available both in ASCII format (file: draft-ietf-st2-
   spec-00.txt) and in PostScript (file: draft-ietf-st2-spec-00.ps). The
   PostScript version contains some additional pictures that help to
   clarify the text, and it is therefore recommended.


L. Delgrossi (ed.)                                              [Page 1]

INTERNET DRAFT                                              October 1994


   Introduction    6

   1 What is ST2?  6

           1.1 Protocol History    6

           1.2 Streams     7

           1.3 Data Transmission   8

           1.4 Flow Specifications 9

           1.5 ST2 and IP  9

           1.6 Outline of This Document    10

   2 ST User Service Description   11

           2.1 Stream Operations and Primitive Functions   11

           2.2 State Diagrams      12

           2.3 State Transition Tables     15

   SCMP Functional Description     16

   3 Stream Setup  17

           3.1 Initial Setup at the Origin 17

           3.1.1 Invoking the Routing Function     17

           3.1.2 Reserving Resources       17

           3.2 Sending CONNECT Messages    18

           3.2.1 Empty Target List 18

           3.2.2 Long Target Lists 19

           3.3 Processing CONNECT Messages 19

           3.3.1 CONNECT Processing by an Intermediate Agent       19

           3.3.2 Setup at the Targets      19

           3.4 Processing ACCEPT Messages  20


L. Delgrossi (ed.)                                              [Page 2]

INTERNET DRAFT                                              October 1994


           3.4.1 ACCEPT Processing by an Intermediate Agent        20

           3.4.2 ACCEPT Processing by the Origin   20

           3.5 Processing REFUSE Messages  20

           3.5.1 REFUSE Processing by the Intermediate Agent       20

           3.5.2 REFUSE Processing by the Origin   21

   4 Stream Options        21

           4.1 No Recovery 21

           4.2 Join Authorization  21

   5 Data Transfer 21

   6 Modifying an Existing Stream  22

           6.1 The Origin Adding New Targets       23

           6.2 A Target Joining a Stream   23

           6.2.1 FlowSpec  24

           6.2.2 Router as Origin  24

           6.3 The Origin Removing Targets 24

           6.4 A Target Deleting Itself    25

           6.5 Changing a Stream's FlowSpec        25

   7 Stream Tear Down      26

   8 Exceptional Cases     27

           8.1 Setup Failures      27

           8.1.1 Setup Failure due to CONNECT Timeout      27

           8.1.2 Setup Failure due to ACCEPT Timeout       27

           8.1.3 Setup Failure due to Routing Failures     27

           8.2 Further Issues      28


L. Delgrossi (ed.)                                              [Page 3]

INTERNET DRAFT                                              October 1994


           8.2.1 Problems due to Routing Inconsistency     28

           8.2.2 Path Convergence  29

           8.2.3 Problems in Reserving Resources   29

           8.2.4 Problems Caused by CHANGE Messages        30

   9 Failure Detection and Recovery        31

           9.1 Failure Detection   31

           9.1.1 Network Failures  31

           9.1.2 Detecting ST Agents Failures      31

           9.2 Failure Recovery    33

           9.2.1 Problems in Stream Recovery       34

           9.3 Stream Preemption   35

   10 A Group of Streams   36

           10.1 Group Name Generator       36

           10.2 Basic ST Relationships     37

           10.2.1 Bandwidth Sharing        37

           10.2.2 Fate Sharing     37

           10.2.3 Route Sharing    38

           10.2.4 Subnet Resources Sharing 38

           10.3 Relationships Orthogonality        38

   11 Ancillary Functions  39

           11.1 Stream IDs Generation      39

           11.2 SCMP Reliability   39

           11.3 IP Encapsulation of ST     39

           11.4 IP Multicasting    40


L. Delgrossi (ed.)                                              [Page 4]

INTERNET DRAFT                                              October 1994


           11.5 Routing    40

           11.6 Security   40

   12 FlowSpec     40

           12.1 FlowSpec Versions  41

           12.2 The Null FlowSpec (#0)     41

           12.3 The ST Current FlowSpec (#7)       41

           12.3.1 Qos Classes      42

           12.3.2 Maximum Message Size     42

           12.3.3 Rate or Throughput       42

           12.3.4 Maximum Delay and Delay Jitter   42

   13 ST State Machines    43

   ST Protocol Data Units  44

   14 ST Data Packets      45

           14.1 Stream ID  45

   15 SCMP Protocol Data Units     45

           15.1 ST Control Messages        46

           15.2 Common SCMP Elements       47

           15.2.1 ErroredPDU       48

           15.2.2 FlowSpec 48

           15.2.3 Group    49

           15.2.4 MulticastAddress 49

           15.2.5 NextHopIPAddress 50

           15.2.6  Origin  50

           15.2.7 RecordRoute      51


L. Delgrossi (ed.)                                              [Page 5]

INTERNET DRAFT                                              October 1994


           15.2.8 Target and TargetList    51

           15.2.9 UserData 52

   16 ST Control Message PDUs      53

           16.1 ACCEPT     53

           16.2 ACK        54

           16.3 CHANGE     54

           16.4 CONNECT    55

           16.5 DISCONNECT 56

           16.6 ERROR      57

           16.7 HELLO      58

           16.8 JOIN-REQUEST       59

           16.9 NOTIFY     59

           16.10 REFUSE    60

           16.11 STATUS    61

           16.12 STATUS-RESPONSE   62

   17 Suggested Protocol Constants 63

           17.1 SCMP Messages      63

           17.2 SCMP Parameters    63

           17.3 ReasonCode 64

           17.4 IP Multicast Addresses     64

   18 Notation     64

   19 Further Study        64

   20 References   64

   Introduction


L. Delgrossi (ed.)                                              [Page 6]

INTERNET DRAFT                                              October 1994


   1  What is ST2?

   The Internet Stream Protocol, Version 2 (ST2) is a connection-
   oriented internetworking protocol that operates at the same layer as
   connectionless IP. It has been developed to support the efficient
   delivery of data streams to single or multiple destinations in
   applications that require guaranteed data throughput and controlled
   delay characteristics. The main application area of the protocol is
   the real-time transport of digital audio and video packet streams
   across internets.

   ST2 can be used to reserve bandwidth for multimedia streams across
   network routes. This reservation, together with appropriate network
   access and packet scheduling mechanisms in all nodes running the
   protocol, guarantees a well-defined quality of service to ST2
   applications. It ensures that each multimedia packet is delivered
   within its deadline, that is, at the time where it needs to be
   presented. This facilitates a smooth playout of digital audio and
   video that is essential for this time-critical data, but can
   typically not be provided by best-effort IP communication.

   Just like IP, ST2 actually consists of two protocols: ST for the data
   transport and SCMP, the Stream Control Message Protocol, for all
   control functions, mainly those for resource reservation. ST is
   simple and contains only one PDU that is designed for fast and
   efficient data forwarding in order to achieve low communication
   delays. SCMP, however, is quite complex. As with ICMP and IP, SCMP
   packets are transferred within ST packets as shown in Figure 1.

   1.1  Protocol History

   The first version of ST was published in the late 1970's and was used
   throughout the 1980's for experimental voice and video transmission.
   The experience gained in these applications led to the development of
   the revised protocol version ST2. The revision extends the original
   protocol to make it more complete and more applicable to emerging
   multimedia environments. The specification of this protocol version
   is contained in Internet RFC 1190 which was published in October 1990
   [RFC1190].

   With more and more developments of commercial distributed multimedia
   applications underway and with a growing dissatisfaction at the
   transmission quality for audio and video over IP in the MBONE,
   interest in ST2 has grown over the last years. Companies such as BBN
   have products available incorporating the protocol. The BERKOM
   project of the German PTT uses ST2 as its core protocol for the
   provision of multimedia teleservices such as conferencing and
   mailing. Among others, Digital, HP, IBM, and Siemens-Nixdorf


L. Delgrossi (ed.)                                              [Page 7]

INTERNET DRAFT                                              October 1994


   participate in this project. In addition, implementations of ST2 for
   Sun, Silicon Graphics, Macintosh, NeXT, and PC platforms are
   available.

   In 1993, the IETF has started a new working group on ST2. Its mission
   is to clean up the current protocol specification to ensure better
   interoperability between the existing and emerging implementations.
   It shall also reflect the experiences gained with the current ST2
   implementations and applications. This has led to the specification
   of the ST2Plus version contained in this document.

   1.2  Streams

   Streams form the core concepts of ST2. They are established between a
   sending origin and one or more receiving targets in the form of a
   routing tree. Nodes in the tree represent so-called ST agents,
   entities executing the ST2 protocol; links in the tree are called
   hops.

   Figure 2 illustrates a stream from an origin to four targets, where
   the ST agent on Target 2 also functions as a router. Let us use this
   Target 2/Router node to explain some basic ST2 terminology: the
   direction of the stream from this node to Target 3 and 4 is called
   downstream, the direction towards the Origin node upstream. ST agents
   that are one hop away from a given node are called previous-hops in
   the upstream, and next-hops in the downstream direction.

   Streams are maintained using SCMP messages. Typical SCMP messages are
   CONNECT and ACCEPT to build a stream, DISCONNECT and REFUSE to close
   a stream, or CHANGE to modify the stream characteristics, for example
   the set of targets or the quality of service.

   Each ST agent maintains state information describing the streams
   flowing through it. It can actively gather and distribute such
   information. If, for example, an intermediate ST agent fails, the
   neighboring agents can recognize this via HELLO messages that are
   periodically exchanged between ST agents that share streams. STATUS
   packets can be used to ask other ST agents about a particular stream.
   These agents then send back a STATUS-RESPONSE message. NOTIFY
   messages serve to inform ST agents of changes such as a route change.

   ST2 offers a wealth of functionalities for stream management. Streams
   can be grouped together to minimize allocated resources or to process
   them in the same way in case of failures. During audio conferences,
   for example, only one person should speak at a time. Using the group
   mechanism, resources for only one audio stream of the group need to
   be reserved. Using the same concept, an entire group of related audio
   and video streams can be dropped if one of them fails.


L. Delgrossi (ed.)                                              [Page 8]

INTERNET DRAFT                                              October 1994


   1.3  Data Transmission

   Data transfer in ST2 is simplex in the downstream direction. Data
   transport through streams is very efficient. ST2 puts only a small
   header in front of the user data. The header contains a protocol
   identification that distinguishes ST2 from IP packets, an ST2 version
   number, a priority field (specifying a relative importance of streams
   in cases of conflict), a length counter, a stream identification, and
   a checksum. These elements form an 8-byte header which can be
   extended by an optional 8-byte timestamp.

   Efficiency is also achieved by avoiding fragmentation and reassembly
   on router nodes. Negotiations at stream establishment time yield a
   maximum transmission unit (MTU) for data packets on a stream. This
   MTU is communicated to the upper layers, so that they provide data
   packets of suitable size to ST2.

   Communication with multiple next-hops can be made even more efficient
   using MAC Layer multicast. If a subnet supports multicast, a single
   multicast packet is sufficient to reach all next- hops connected to
   this subnet. This leads to a significant reduction of the bandwidth
   requirements of a stream. If multicast is not provided, separate
   packets need to be sent to each next-hop.

   As ST2 relies on reservation, it does not contain error correction
   mechanisms features for data exchange such as retransmission known
   from TCP. It is assumed that digital audio and video require
   partially correct delivery only. In many cases, retransmitted packets
   would arrive too late to meet their real-time delivery requirements.
   On the other hand, depending on the data encoding and the particular
   application, a small number of errors in audio and video streams are
   acceptable. In any case, reliability can be provided by layers on top
   of ST2 if needed.

   1.4  Flow Specifications

   As part of establishing a connection, SCMP negotiates quality-of-
   service parameters for a stream. In ST2 terminology, these parameters
   form a flow specification (FlowSpec, for short) which is associated
   with the stream. Different versions of FlowSpecs exist and can be
   distinguished by a version number. Typically, they contain parameters
   such as average and maximum throughput, end-to-end delay, and delay
   variance of a stream.

   Three kinds of entities participate in the quality-of-service
   negotiation: application entities on the origin and target sites as
   the service users, ST agents, and local resource managers (LRM). The
   origin application supplies the initial FlowSpec requesting a


L. Delgrossi (ed.)                                              [Page 9]

INTERNET DRAFT                                              October 1994


   particular service quality. Each ST agent which obtains the
   specification as part of a connection establishment message initiates
   the reservation of local resources by the corresponding resource
   manager. These resource managers control the usage of CPU capacity
   for protocol processing, buffer space for storing messages, and
   bandwidth in the outgoing network. ST2 does not determine how
   resource managers make reservations and how resources are scheduled
   according to these reservations; ST2, however, assumes these
   mechanisms as its basis.

   The FlowSpec negotiation procedure is illustrated in Figure 3.
   Depending on the success of its local reservations, an ST agent
   updates the FlowSpec while the connection establishment message
   passes downstream (for example, keeping track of accumulated delay).
   The final FlowSpec is communicated to the target application which
   may base its accept/reject decision for establishing the connection
   on it and may finally also modify the FlowSpec. If a target accepts
   the connection, the (possibly modified) FlowSpec is propagated back
   to the origin which can then calculate an overall service quality for
   all targets. If all targets in a particular ST2 connection need to
   adhere to the same FlowSpec, the origin may - during a second phase
   of connection establishment - issue a CHANGE request to adjust
   reservations.

   1.5  ST2 and IP

   ST2 is designed to coexist with IP on each node. A typical
   distributed multimedia application would use both protocols: IP for
   the transfer of traditional data and control information, and ST2 for
   the transfer of digital audio and video. Whereas IP typically will be
   accessed from TCP or UDP, ST2 will have new multimedia end-to-end
   protocols on top of it.

   Both ST2 and IP apply the same addressing schemes to identify
   different hosts and use ARP for address resolution. ST2 can easily be
   modified to include the longer host addresses of the next generation
   IP. ST2 uses the same Layer 2 SAPs as IP. ST2 and IP packets differ
   in the first four bits, containing the internetwork protocol version
   number: number 5 is reserved for ST2 (IP itself has version number
   4). An ST agent receives a packet over the IP SAP using the first 4
   bits of the frame to select ST2 packets.

   As a special function, ST2 messages can be encapsulated in IP
   packets. This allows them to pass through routers which do not run
   ST2. Resource management is typically not available for these IP
   route segments. IP encapsulation is, therefore, suggested only for
   portions of the network which do not constitute a system bottleneck.


L. Delgrossi (ed.)                                             [Page 10]

INTERNET DRAFT                                              October 1994


   1.6  Outline of This Document

   This document contains the specification for the ST2Plus version of
   the ST2 protocol. In the rest of the document, whenever the terms
   "ST" or "ST2" are used, they refer to ST2Plus.

   The document is organized as follows: Section 2 describes the ST user
   service; Section 3 through Section 7 describe stream setup,
   modification, and tear down; exceptional cases are handled in Section
   8; failure detection and groups of streams respectively in Section 9
   and Section 10; the FlowSpec is presented in Section 12; finally, the
   formats of the different protocol elements and PDUs are defined by
   Section 14 through Section 20.  Figure 1: ST2 Data and Control Path
   Figure 2: The Stream Concept Figure 3: Quality-of-Service Negotiation
   with FlowSpecs

   Figure 6: ST Service at the Target Figure 4: Primitives for the OPEN
   Stream Operation Figure 5: ST Service at the Origin

   2  ST User Service Description

   This section describes the ST user service from the high-level point
   of view of an application. It defines the ST stream operations and
   primitive functions. It specifies which operations on streams can be
   invoked by the applications built on top of ST and when the ST
   primitive functions can be legally executed. Note that the ST
   primitives do not form an API. They are used here with the only
   purpose of illustrating the service model for ST.

   2.1  Stream Operations and Primitive Functions

   An ST application at the origin may create, expand, reduce, change,
   send data to, and delete a stream. When a stream is expanded, new
   targets are added to the stream; when a stream is reduced, some of
   the current targets are dropped from it. When a stream is changed,
   the associated quality of service is modified.

   An ST application at the target may join, receive data from, and
   leave a stream.

   This translates into the following stream operations:

   o       OPEN: create new stream [origin], CLOSE: delete stream
   [origin],

   o       ADD: expand stream, i.e. add new targets to it [origin],

   o       DROP: reduce stream, i.e. drop targets from it [origin],


L. Delgrossi (ed.)                                             [Page 11]

INTERNET DRAFT                                              October 1994


   o       JOIN: join a stream [target], LEAVE: leave a stream [target],

   o       DATA: send data through stream [origin],

   o       CHG: change a stream's qos [origin],

   Each stream operation may require the execution of several primitive
   functions to be completed. For instance, to open a new stream, a
   request is first issued by the sender and an indication is generated
   at the receiver; then, the receiver may accept or refuse the request
   and the correspondent indication is generated at the sender. This is
   shown in Figure 4 below.

   Table 1 defines the ST service primitive functions associated to each
   stream operation. The column labelled "O/T" indicates whether the
   primitive is executed at the origin or at the target.

   2.2  State Diagrams

   It is not sufficient to define the set of ST stream operations. It is
   also necessary to specify when the operations can be legally
   executed. For this reason, a set of states are now introduced and the
   transitions from one state to the others are specified. States are
   defined with respect to a single stream. The previously defined
   stream operations can be legally executed only from an appropriate
   state.

   An ST agent may, with respect to an ST stream, be in one of the
   following states:

   o       IDLE: the stream has not been created yet.

   o       PENDING: the stream is in the process of being established.

   o       ACTIVE: the stream is established and active.

   o       ADDING: the stream is established. A stream expansion is
   underway.

   o       CHGING: the stream is established. A stream change is
   underway.

   Previous experience with ST suggested to impose limits on the stream
   operations that can be executed at the same time. These restrictions
   are:

   1.      A single ADD or CHG operation can be processed at one time.
   If another ADD or CHG is already underway, further requests are


L. Delgrossi (ed.)                                             [Page 12]

INTERNET DRAFT                                              October 1994


   queued by the ST agent and handled only after the previous operation
   has been completed. It also applies to two subsequent requests of the
   same kind, e.g. two ADD or two CHG operations. The second operation
   is not executed until the first one has been completed.

   2.      Deleting a stream, leaving a stream, or dropping targets from
   a stream is possible only after stream establishment has been
   completed. A stream is considerate to be established when all the
   next-hops of the origin have either accepted or refused the stream.
   Note that stream refuse is automatically forced after timeout if no
   reply comes from a next-hop.

   3.      An ST agent forwards data only along already established
   paths to the targets. A path is con- sidered to be established when
   the next-hop on the path has explicitly accepted the stream. This
   implies that the target and all other intermediate ST agents are
   ready to handle the incoming data packets. In no cases an ST agent
   will forward data to a next-hop agent that has not explicitly
   accepted the stream. To be sure that all targets receive the data, an
   application should send the data only after all paths have been
   established, i.e. the stream is established.

   4.      It is allowed to send data from the CHGING and ADDING states.
   When sending data from the CHGING state the quality of service to the
   targets affected by the change is undefined. When sending data from
   the ADDING state the targets that receive the data include at least
   all the targets that were already part of the stream at the time the
   ADD operation was invoked.

   The rules introduced above require ST agents to queue incoming
   requests when the current state does not allow to process them
   immediately. In order to preserve the semantics, ST agents have to
   maintain the order of the requests, i.e. implement FIFO queuing.
   Exceptionally, the CLOSE request at the origin and the LEAVE request
   at the target may be immediately processed: in this cases, the queue
   is deleted and it is possible that requests in the queue are not
   processed.

   The following state diagrams define the ST service. Separate diagrams
   are presented for the origin and the targets. To keep the figure
   simple, only the primitives that cause state transitions are
   represented.

   The symbol (a/r)* indicates that all targets in the target list have
   explicitly accepted or refused the stream, or refuse has been forced
   after timeout. If the target list is empty, i.e. it contains no
   targets, the (a/r)* condition is immediately satisfied, so the empty
   stream is created and state ESTBL is entered.


L. Delgrossi (ed.)                                             [Page 13]

INTERNET DRAFT                                              October 1994


   2.3  State Transition Tables

   Table 2 and Table 3 define which primitives can be processed from
   which states and the possible state transitions.

   Figure 7: Sample Topology for an ST Stream

   SCMP Functional Description

   ST agents create and manage streams using the ST Control Message
   Protocol (SCMP). Conceptually, SCMP resides immediately above ST (as
   does ICMP above IP). SCMP follows a request-response model. SCMP
   messages are made reliable through the use of retransmission after
   timeout, cf. Section 11.2.

   This section contains a functional description of SCMP. To help
   clarify the SCMP exchanges used to setup and maintain ST streams, we
   include an example of a simple network topology, represented in
   Figure 7. The topology is used to illustrate the protocol
   interactions during the execution of stream operations. For instance,
   an ST application may:

   o       Create a stream from A to the peers at B, C and D,

   o       Add a peer at E,

   o       Drop peers B and C, and

   o       Let F join the stream

   o       Delete the stream.

   We begin with a description of stream setup, see Section 3; stream
   option are presented in Section 4; data transfer in Section 5;
   Section 6 illustrates stream modification including stream expansion,
   reduction, changes of the quality of service associated to a stream.
   Finally, stream deletion is handled in Section 7.

   3  Stream Setup

   This section presents a description of stream setup. For simplicity,
   we assume that everything succeeds, e.g. any required resources are
   available, and the routing is correct. Possible failures in the setup
   phase are handled in Section 8.1.

   3.1  Initial Setup at the Origin

   Before stream setup can be started, the application has to collect


L. Delgrossi (ed.)                                             [Page 14]

INTERNET DRAFT                                              October 1994


   the necessary information to determine the structure of the
   communication. This includes identifying the participants and
   selecting the characteristics of the data flow. Such information is
   passed to the ST agent at the stream's origin. The ST agent performs
   the following operations:

   o       allocates a stream ID (SID) for the stream, cf. Section 11.1,

   o       invokes the routing function to determine the set of next-
   hops for the stream, cf. Section 3.1.1,

   o       invokes the Local Resource Manager (LRM), cf. Section 3.1.2,
   to reserve local and network resources

   o       creates local database entries to store information on the
   new stream,

   o       propagates the stream creation request to the next-hops
   determined by the routing function, see Section 3.2.

   3.1.1  Invoking the Routing Function

   An ST agent that is setting up a stream invokes the routing function
   to find a path to reach each of the targets specified by the target
   list provided by the application. This is similar to the routing
   decision in IP. However, in this case the route is to a multitude of
   targets rather than to a single destination. The routing function is
   not part of the ST protocol and therefore it is not specified by this
   document.

   The result of the routing function is a set of next-hop ST agents.
   The set of next-hops selected by the routing function is not
   necessarily the same as the set of next-hops that IP would select
   given a number of independent IP datagrams to the same destinations.
   The routing algorithm may attempt to optimize parameters other than
   the number of hops that the packets will take, such as delay, local
   network bandwidth consumption, or total internet bandwidth
   consumption.

   3.1.2  Reserving Resources

   An ST agent helps reserving both local and network resources. Local
   resources may include CPU processing time and buffer space at the
   local host. Network resources may comprise bandwidth over the
   outgoing links to the next-hops determined by the routing function.
   Resource reservation is not part of the ST protocol and therefore it
   is not specified by this document. ST invokes at every host the Local
   Resource Manager (LRM) to perform the appropriate reservations.


L. Delgrossi (ed.)                                             [Page 15]

INTERNET DRAFT                                              October 1994


   Functions as resource scheduling and reservation enforcement are part
   of the LRM's tasks and not of an ST agent's.

   The ST FlowSpec contains all the information needed to allocate the
   necessary resources. The information contained in the FlowSpec is
   passed to the LRM as parameter of the reservation functions. The LRM
   updates the FlowSpec information before it passes it back to the ST
   agent. Further information on the ST FlowSpec can be found in Section
   12.

   Note that if the data has to be sent across a network to a single
   next-hop, then only the point-to- point bandwidth needs to be
   reserved. If the data has to be sent to multiple next-hop agents
   across a single network and network layer multicasting is not
   available, the ST agent replicates the data to each next-hop agent
   and therefore bandwidth has to be reserved by the LRM for all the
   next-hops. If network layer multicast is supported, its use reduces
   the bandwidth required since one single copy of the data is received
   by all next-hop agents. The membership of a stream in a Group may
   also affect the amount of resources that have to be allocated by the
   LRM, cf. Section 10.

   Effects similar to reservation of the necessary resources may be
   obtained even when the network cannot provide direct support for the
   reservation. Certainly if total reservations are a small fraction of
   the overall resources, such as packet switch processing bandwidth,
   buffer space, or network bandwidth, then the desired performance can
   be honoured if the degree of confidence is consistent with the
   requirements as stated in the FlowSpec. Other solutions can be
   designed for specific networks.

   3.2  Sending CONNECT Messages

   The ST agent sends a CONNECT message to each of the next-hop ST
   agents identified by the routing function. Each CONNECT message
   contains the SID, an updated FlowSpec, and a TargetList. In general,
   the FlowSpec and TargetList depend on both the next-hop and the
   intervening network. Each TargetList is a subset of the original
   TargetList, identifying the targets that are to be reached through
   the next-hop to which the CONNECT message is being sent.

   The TargetList may be empty, see Section 3.2.1.; if the TargetList
   causes a too long CONNECT message to be generated, the CONNECT
   message is partitioned as explained in Section 3.2.2. If multiple
   next-hops are to be reached through a network that supports network
   level multicast, a different CONNECT message must nevertheless be
   sent to each next-hop since each will have a different TargetList.


L. Delgrossi (ed.)                                             [Page 16]

INTERNET DRAFT                                              October 1994


   Let us consider the network topology in Figure 7 on page 16. Suppose
   that the original TargetList contains targets B, C, and D. The
   routing function invoked at A returns that B is reachable via Router
   1 and C and D are reachable via Router 2. Thus, A generates two
   CONNECT messages, one for Router 1 and one for Router 2. The CONNECT
   message for Router 1 contains a TargetList including target B only;
   the CONNECT message for Router 2 contains a TargetList including
   targets C and D.

   3.2.1  Empty Target List

   An application at the origin may request the local ST agent to create
   empty streams. It does so by passing an empty TargetList to the local
   ST agent during the initial stream setup. When the local ST agent
   receives request to create an empty stream, it allocates the stream
   ID (SID), updates its local database entries to store information on
   the new stream and notifies the application that stream setup is
   complete. The local ST agent does not generate any CONNECT message
   for streams with an empty TargetList.

   3.2.2  Long Target Lists

   Each ST agent knows the MTU of the networks to which it is connected,
   and those MTUs restrict the size of the SCMP message it can send.
   SCMP messages with long TargetList can cause the size of the SCMP
   message to exceed the network MTU. The ST agent which receives an
   SCMP message bigger than its MTU must break the original message into
   multiple fragments, each carrying part of the TargetList. The effect
   of this partition is to compromise the performance but still carry
   out the function of the SCMP message. If the original SCMP message
   contains any Userdata parameters, these parameters are replicated in
   each fragment for delivery to all targets. Applications that support
   a large number of receivers may avoid using long target lists by
   exploiting the stream joining functions, cf. Section 6.2.

   3.3  Processing CONNECT Messages

   3.3.1  CONNECT Processing by an Intermediate Agent

   An ST agent receiving a CONNECT message, assuming no errors, responds
   to the previous-hop with an ACK. The ACK message must identify the
   CONNECT to which it corresponds by including the reference number
   indicated by the Reference field of the CONNECT message. The
   intermediate ST agent invokes the routing function, reserves
   resources via the LRM, and then propagates the CONNECT messages to
   its next-hops, as described in the previous section.

   3.3.2  Setup at the Targets


L. Delgrossi (ed.)                                             [Page 17]

INTERNET DRAFT                                              October 1994


   An ST agent that is the target of a CONNECT message, assuming no
   errors, responds to the previous-hop with an ACK. The ST agent
   reserves local resources and inquires from the specified application
   process whether or not it is willing to accept the connection.

   In particular, the application must be presented with parameters from
   the CONNECT, such as the SID, FlowSpec, Options, and Group, to be
   used as a basis for its decision. The application is identified by a
   combination of the NextPcol field and the SAP field included in the
   correspondent (usually single remaining) Target of the TargetList.
   The contents of the SAP field may specify the port or other local
   identifier for use by the protocol layer above the host ST layer.
   Subsequently received data packets will carry the SID, that can be
   mapped into this information and be used for their delivery.

   Finally, based on the application's decision, the ST agent sends to
   the previous-hop from which the CONNECT was received an ACCEPT or
   REFUSE message. Since the ACCEPT (or REFUSE) message has to be
   acknowledged by the previous-hop, it is assigned a new Reference
   number that will be returned in the ACK. The CONNECT to which the
   ACCEPT (or REFUSE) is a reply is identified by placing the CONNECT's
   Reference number in the LnkReference field of the ACCEPT (or REFUSE).
   The ACCEPT message contains the FlowSpec as accepted by the
   application at the target.

   3.4  Processing ACCEPT Messages

   3.4.1  ACCEPT Processing by an Intermediate Agent

   When an intermediate ST agent receives an ACCEPT, it first verifies
   that the message is a response to an earlier CONNECT. If not, it
   responds to the next-hop ST agent with an ERROR message, with
   ReasonCode (LnkRefUnknown). Otherwise, it responds to the next-hop ST
   agent with an ACK, and propagates the ACCEPT message to the
   previous-hop along the same path traced by the CONNECT but in the
   reverse direction toward the origin.

   The FlowSpec is included in the ACCEPT message so that the origin and
   intermediate ST agents can gain access to the information that was
   accumulated as the CONNECT traversed the internet. Note that the
   resources, as specified in the FlowSpec in the ACCEPT message, may
   differ from the resources that were reserved by the agent when the
   CONNECT was originally processed. However, the agent does not adjust
   the reservation in response to the ACCEPT. It is expected that any
   excess resource allocation will be released for use by other stream
   or datagram traffic through an explicit CHANGE message initiated by
   the application at the origin if it does not wish to be charged for
   any excess resource allocations.


L. Delgrossi (ed.)                                             [Page 18]

INTERNET DRAFT                                              October 1994


   3.4.2  ACCEPT Processing by the Origin

   The origin will eventually receive an ACCEPT (or REFUSE) message from
   each of the targets. As each ACCEPT is received, the application is
   notified of the target and the resources that were successfully
   allocated along the path to it, as specified in the FlowSpec
   contained in the ACCEPT message. The application may then use the
   information to either adopt or terminate the portion of the stream to
   each target. When ACCEPT (or REFUSE) from all targets have been
   received at the origin, the application is notified that stream setup
   is complete. For problems due to CONNECT timeout, please refer to
   Section 8.1.1.

   When an ACCEPT is received by the origin, the path to the target is
   considered to be established and the ST agent is allowed to forward
   the data along this path as explained in Section 5 and in the ST user
   service description in Section 2.

   3.5  Processing REFUSE Messages

   3.5.1  REFUSE Processing by the Intermediate Agent

   If an application at a target does not wish to participate in the
   stream, it sends a REFUSE message back to the origin with ReasonCode
   (ApplDisconnect). An intermediate ST agent that receives a REFUSE
   message with ReasonCode (ApplDisconnect) acknowledges it by sending
   an ACK to the next-hop, considers which resources are to be released,
   deletes the target entry from the internal database, and propagates
   the REFUSE message back to the previous-hop ST agent.

   If, after deleting the specified target, the next-hop has no
   remaining targets, then those resources associated with that next-hop
   agent may be released. Note that network resources may not actually
   be released if network multicasting is being used since they may
   still be required for traffic to other next-hops in the multicast
   group.

   3.5.2  REFUSE Processing by the Origin

   When the REFUSE reaches the origin, the origin sends an ACK and
   notifies the application that the target is no longer part of the
   stream and also if the stream has no remaining targets. If there are
   no remaining targets, the application may wish to terminate the
   stream or keep the stream active to allow stream joining as described
   in Section 6.2.

   4  Stream Options


L. Delgrossi (ed.)                                             [Page 19]

INTERNET DRAFT                                              October 1994


   An application may select among some stream options. The desired
   options are indicated to the ST agent at the origin when a new stream
   is created. Options apply to single streams and are valid during the
   whole stream's lifetime. The options chosen by the application at the
   origin are included into the initial CONNECT message(s). When a
   CONNECT message reaches a target, the application at the target is
   notified of the stream options that have been selected.

   4.1  No Recovery

   The NoRecovery option is used to indicate that ST agents should not
   attempt recovery in case of network or component failure. If a
   failure occurs, the origin will be notified via a REFUSE message and
   the targets via a DISCONNECT, with an appropriate ReasonCode
   indicating the reason of the failure. The application at the origin
   may decide whether to rebuild the deleted portion of the stream by
   sending a CONNECT message. The NoRecovery option is specified by
   setting the S-bit in the CONNECT message, see Section 16.4.

   4.2  Join Authorization

   To Be Written

   5  Data Transfer

   An application is not guaranteed that the data reaches its
   destinations: ST is unreliable and it does not make any attempt to
   recover from packet loss, e.g. due to the underlying network. In case
   the data reaches its destination, it does it accordingly to the
   negotiated quality of service.

   An ST agent forwards the data only along already established paths to
   targets. A path is considered to be established when the ST next-hop
   agent on the path sends an ACCEPT message. This implies that the
   target and all other intermediate ST agents on the path to the target
   are ready to handle the incoming data packets. In no cases an ST
   agent will forward data to a next-hop agent that has not explicitly
   accepted the stream.

   To be fairly sure that all targets receive the data with the desired
   quality of service, an application should send the data only after
   the whole stream has been established. Depending on the local API, an
   application may not be prevented to send data before the completion
   of stream setup, but it should be aware that the data could be lost
   or not reach all the intended targets.

   At the end of the connection setup phase, the origin, each target,
   and each intermediate ST agent has a database entry that allows it to


L. Delgrossi (ed.)                                             [Page 20]

INTERNET DRAFT                                              October 1994


   forward the data packets from the origin to the targets and to
   recover from failures of the intermediate agents or networks. The
   database should be optimized to make the packet forwarding task most
   efficient. The time critical operation is an intermediate agent
   receiving a packet from the previous-hop agent and forwarding it to
   the next- hop agents. The database entry must also contain the
   FlowSpec, utilization information, the address of the origin and
   previous-hop, and the addresses of the targets and next-hops, so it
   can perform enforcement and recover from failures.

   An ST agent receives data packets encapsulated by an ST header. A
   data packet received by an ST agent contains the SID. This SID was
   selected at the origin so that it is globally unique and thus can be
   used as an index into the database, to obtain quickly the necessary
   replication and forwarding information.

   The forwarding information will be network and implementation
   specific, but must identify the next-hop agents. It is suggested that
   the cached information for a next-hop agent include the local network
   address of the next- hop. If the data packet must be forwarded to
   multiple next- hops across a single network that supports multicast,
   the database may specify the next-hops by a (local network) multicast
   address. If the network does not support multicast, or the next-hops
   are on different networks, multiple copies of the data packet must be
   sent.

   No data fragmentation is supported during the data transfer phase.
   The application is expected to segment its PDUs according to the
   minimum MTU over all paths in the stream. The application receives
   information on the MTUs relative to the paths to the targets as part
   of the FlowSpec contained in the ACCEPT message, see also Section 12.
   The minimum MTU over all paths has to be calculated from the MTUs
   relative to the single paths. If the application at the origin sends
   a too large data packet, the ST agent at the origin generates an
   error and it does not forward the data.

   6  Modifying an Existing Stream

   Some applications may wish to modify a stream after it has been
   created. Possible changes include expanding a stream, reducing it,
   and changing its FlowSpec. In ST, changes to a stream may be
   initiated both by the origin and the targets. Targets may be added by
   the origin as described in Section 6.1 or they may request to join
   the stream as described in Section 6.2. The origin can reduce a
   stream by dropping some or all of its targets. This is described in
   Section 6.3. Targets may spontaneously decide to leave a stream as
   described in Section 6.4. Section 6.5 explains how to change a
   stream's FlowSpec.


L. Delgrossi (ed.)                                             [Page 21]

INTERNET DRAFT                                              October 1994


   As defined by the ST service model, see Section 2, an ST agent can
   handle only one stream modification at a time. If a stream
   modification operation is already underway, further requests are
   queued and handled when the previous operation has been completed.
   This also applies to two subsequent requests of the same kind, e.g.
   two subsequent changes to the FlowSpec.

   6.1  The Origin Adding New Targets

   It is possible for an application at the origin to add new targets to
   an existing stream any time after the stream has been established.
   Before new targets are added, the application has to collect the
   necessary information on the new targets. Such information is passed
   to the ST agent at the origin.

   The ST agent at the origin issues a CONNECT message that contains the
   SID, the FlowSpec, and the TargetList specifying the new targets.
   This is similar to sending a CONNECT message during stream
   establishment, with the following exceptions: the origin checks that
   a) the SID is valid, b) the targets are not already members of the
   stream, c) the FlowSpec of the new target, if present, matches the
   FlowSpec of the existing stream, i.e it requires an equal or smaller
   amount of resources to be allocated. If the FlowSpec of the new
   target does not match the FlowSpec of the existing stream, it is
   simply ignored.

   An intermediate ST agent that is already a node in the stream looks
   at the SID and verifies that the stream is the same. It then checks
   if the intersection of the TargetList and the targets of the
   established stream is empty. If this is not the case, it responds
   with an ERROR message with the appropriate ReasonCode (RouteLoop)
   that contains a TargetList of those targets that were duplicates.

   For each new target in the TargetList, processing is much the same as
   for the original CONNECT. The CONNECT is acknowledged, propagated,
   and network resources are reserved. However, it may be possible to
   route to the new targets using previously allocated paths or an
   existing multicast group. In that case, additional resources do not
   need to be reserved but more next-hops might have to be added to an
   existing multicast group.

   Intermediate or target ST agents that are not already nodes in the
   stream behave as in case of stream setup (see Section 3.3.1 and
   Section 3.3.2).

   6.2  A Target Joining a Stream

   An application may request to join an existing stream. It has to


L. Delgrossi (ed.)                                             [Page 22]

INTERNET DRAFT                                              October 1994


   collect information on the stream including the stream ID (SID) and
   the IP address of the stream's origin. This can be done out-of- band,
   e.g. via regular IP. The information is then passed to the local ST
   agent together with the FlowSpec. The ST agent generates a JOIN
   message containing the application's request to join the stream and
   sends it toward the stream origin.

   An ST agent receiving a JOIN message, assuming no errors, responds
   with an ACK. The ACK message must identify the JOIN message to which
   it corresponds by including the Reference number indicated by the
   Reference field of the Join message. If the ST agent is not traversed
   by the stream that has to be joined, it propagates the JOIN message
   toward the stream's origin. Eventually, an ST agent traversed by the
   stream or the stream's origin itself is reached. This ST agent
   responds to the join request based on the join authorization level
   associated with the stream, cf. Section 4.2.:

   o       level 0 (refuse join)

   It is not allowed to join the stream. No further actions are taken.

   o       level 1 (ask origin)

   The JOIN message is propagated back until the origin is reached. At
   the origin, the appli- cation is requested to either grant or deny
   the permission to join the stream. If the permis- sion is denied, no
   further actions are taken. Otherwise, the origin issues a CONNECT
   message with a TargetList including the target that requested to join
   the stream. The target is then added as in normal stream setup.

   o       level 2 (ok, notify origin)

   The ST agent sends a CONNECT message with a TargetList including the
   target that requested to join the stream. This results in adding the
   target to the stream. When the ST agent which is already part in the
   stream receives the ACCEPT message indicating that the new target has
   been added, it does not propagate the ACCEPT message backwards.
   Instead, it issues a NOTIFY message with ReasonCode(TargetJoined) to
   inform the origin of the new target.

   o       level 3 (ok)

   The ST agent sends a CONNECT message with a TargetList including the
   target that requested to join the stream. This results in adding the
   target to the stream. When the ST agent which is already part in the
   stream receives the ACCEPT message indicating that the new target has
   been added, it does not propagate the ACCEPT message backwards, nor
   it notifies the origin.


L. Delgrossi (ed.)                                             [Page 23]

INTERNET DRAFT                                              October 1994


   6.2.1  FlowSpec

   To Be Written

   6.2.2  Router as Origin

   To Be Written

   6.3  The Origin Removing Targets

   The application at the origin specifies a set of targets that are to
   be removed from the stream and an appropriate ReasonCode
   (ApplDisconnect). The targets are partitioned into multiple
   DISCONNECT messages based on the next-hop to the individual targets.
   If the TargetList is too long to fit into one DISCONNECT message, it
   is partitioned as described in Section 3.2.2.

   An ST agent that receives a DISCONNECT message acknowledges it by
   sending an ACK back to the previous-hop. The DISCONNECT is also
   propagated to the relevant next-hop ST agents. Before propagating the
   message, the TargetList is partitioned based on next-hop ST agents.

   If, after deleting the specified targets, any next-hop has no
   remaining targets, then those resources associated with that next-hop
   agent may be released. Note that network resources may not actually
   be released if network multicasting is being used since they may
   still be required for traffic to other next-hops in the multicast
   group.

   When the DISCONNECT reaches a target, the target sends an ACK and
   notifies the application that it is no longer part of the stream and
   for which reason. The ST agent at the target deletes the stream from
   its database after performing any necessary management and accounting
   functions. Note that the stream is not deleted if the ST agent is
   also a router for the stream and there are remaining downstream
   targets.

   6.4  A Target Deleting Itself

   The application at the target may inform ST that it wants to be
   removed from the stream and the appropriate ReasonCode
   (ApplDisconnect). The agent then forms a REFUSE message with itself
   as the only entry in the TargetList. The REFUSE is sent back to the
   origin via the previous-hop. If a stream has multiple targets and one
   target leaves the stream using this REFUSE mechanism, the stream to
   the other targets is not affected; the stream continues to exist.

   An ST agent that receives such a REFUSE message acknowledges it by


L. Delgrossi (ed.)                                             [Page 24]

INTERNET DRAFT                                              October 1994


   sending an ACK to the next-hop. The target is deleted and, if the
   next-hop has no remaining targets, then the resources associated with
   that next-hop agent may be released. Note that network resources may
   not actually be released if network multicasting is being used since
   they may still be required for traffic to other next-hops in the
   multicast group. The REFUSE is also propagated back to the previous-
   hop ST agent.

   When the REFUSE reaches the origin, the origin sends an ACK and
   notifies the application that the target is no longer part of the
   stream.

   6.5  Changing a Stream's FlowSpec

   The application at the sender may wish to change the FlowSpec of an
   established stream. To do so, it informs the ST agent at the origin
   of the new FlowSpec and of the list of targets relative to the
   change. The origin then issues one CHANGE message with the new
   FlowSpec per next-hop and sends it to the relevant next-hop agents.
   CHANGE messages are structured and processed similarly to CONNECT
   messages.

   A next-hop agent that is an intermediate agent and receives a CHANGE
   message similarly determines if it can implement the new FlowSpec
   along the hop to each of its next-hop agents, and if so, it
   propagates the CHANGE messages along the established paths. If this
   process succeeds, the CHANGE messages will eventually reach the
   targets, which will each respond with an ACCEPT (or REFUSE) message
   that is propagated back to the origin.

   If the change to the FlowSpec is in a direction that makes fewer
   demands of the involved networks, then the change has a high
   probability of success along the path of the established stream. Each
   ST agent receiving the CHANGE message makes the necessary requested
   changes to the network resource allocations, and if successful,
   propagates the CHANGE message along the established paths. If the
   change cannot be made then the ST agent must recover using DISCONNECT
   and REFUSE messages as in the case of a network failure, see Section
   9.2. Note that a failure to change the resources requested for
   specific targets should not cause other targets in the stream to be
   deleted.

   7  Stream Tear Down

   A stream is usually terminated by the origin when it has no further
   data to send, but may also be partially torn down by the individual
   targets. These cases will not be further discussed since they have
   already been described above.


L. Delgrossi (ed.)                                             [Page 25]

INTERNET DRAFT                                              October 1994


   A stream is also torn down if the application should terminate
   abnormally. Processing in this case is identical to the previous
   descriptions except that the ReasonCode (ApplAbort) is different.

   When all targets have left a stream, the origin notifies the
   application of that fact, and the application then is responsible for
   terminating the stream. Note, however, that the application may
   decide to add targets to the stream instead of terminating it.

   8  Exceptional Cases

   The previous descriptions covered the simple cases where everything
   worked. We now discuss what happens when things do not succeed.
   Included are situations where messages are lost, the requested
   resources are not available, the routing fails or is inconsistent.

   8.1  Setup Failures

   8.1.1  Setup Failure due to CONNECT Timeout

   When sending a CONNECT message, an ST agent expects an ACK from the
   next hop ST agent. If the CONNECT fails due to timeout (see Section
   11.2), the ST agent sends a REFUSE message back in the direction of
   the origin with the appropriate ReasonCode (ConnectTimeout).

   8.1.2  Setup Failure due to ACCEPT Timeout

   An ST agent that propagates an ACCEPT message backward toward the
   origin expects an ACK from the previous hop ST agent. If the ACCEPT
   fails due to timeout (see Section 11.2), the ST agent replaces the
   ACCEPT with a REFUSE and sends a DISCONNECT in the direction toward
   the target. Both REFUSE and DISCONNECT must identify the affected
   targets and specify the appropriate ReasonCode (AcceptTimeout).

   8.1.3  Setup Failure due to Routing Failures

   It is possible for an agent to receive a CONNECT message that
   contains a known SID, but from an agent other than the previous-hop
   agent of the stream with that SID. This may be:

   1.      that two branches of the tree forming the stream have joined
   back together,

   2.      the result of an attempted recovery of a partially failed
   stream, or

   3.      an erroneous routing loop.


L. Delgrossi (ed.)                                             [Page 26]

INTERNET DRAFT                                              October 1994


   The TargetList contained in the CONNECT is used to distinguish the
   different cases by comparing each newly received target with those of
   the previously existing stream:

   o       if the IP address of the targets differ, it is case 1;

   o       if the target matches a target in the existing stream, it may
   be case #2 or #3.

   Case #1 is handled in Section 8.2.2 on path convergence. The
   remaining cases requiring recovery, a partially failed stream and an
   erroneous routing loop, are not easily distinguishable. In attempting
   recovery of a failed stream, an agent may issue new CONNECT messages
   to the affected targets. Such a CONNECT may reach an agent downstream
   of the failure before that agent has received a DISCONNECT from the
   neighbourhood of the failure. Until that agent receives the
   DISCONNECT, it cannot distinguish between a failure recovery and an
   erroneous routing loop. That agent must therefore respond to the
   CONNECT with a REFUSE message with the affected targets specified in
   the TargetList and an appropriate ReasonCode (StreamExists).

   The agent immediately preceding that point, i.e., the latest agent to
   send the CONNECT message, will receive the REFUSE message. It must
   release any resources reserved exclusively for traffic to the listed
   targets. If this agent was not the one attempting the stream
   recovery, then it cannot distinguish between a failure recovery and
   an erroneous routing loop. It should repeat the CONNECT after a
   ToConnect timeout, cf. Section 11.2 and Section 8.1.1. If after
   NConnect retransmissions it continues to receive REFUSE messages, it
   should propagate the REFUSE message toward the origin, with the
   TargetList that specifies the affected targets, but with a different
   error code (RouteLoop).

   The REFUSE message with this error code (RouteLoop) is propagated by
   each ST agent without retransmitting any CONNECT messages. At each
   agent, it causes any resources reserved exclusively for the listed
   targets to be released. The REFUSE will be propagated to the origin
   in the case of an erroneous routing loop. In the case of stream
   recovery, it will be propagated to the ST agent that is attempting
   the recovery, which may be an intermediate agent or the origin
   itself. In the case of a stream recovery, the agent attempting the
   recovery may issue new CONNECT messages to the same or to different
   next-hops.

   If an agent receives both a REFUSE message and a DISCONNECT message
   with a target in common then it can release the relevant resources
   and propagate neither the REFUSE nor the DISCONNECT.


L. Delgrossi (ed.)                                             [Page 27]

INTERNET DRAFT                                              October 1994


   If the origin receives such a REFUSE message, it should attempt to
   send a new CONNECT to all the affected targets. Since routing errors
   in an internet are assumed to be temporary, the new CONNECTs will
   eventually find acceptable routes to the targets, if one exists. If
   no further routes exist after NRetryRoute tries, the application
   should be informed so that it may take whatever action it seems
   necessary.

   8.2  Further Issues

   8.2.1  Problems due to Routing Inconsistency

   When an intermediate agent receives a CONNECT, it invokes the routing
   algorithm to select the next-hop agents based on the TargetList and
   the networks to which it is connected. If the resulting next-hop to
   any of the targets is across the same network from which it received
   the CONNECT (but not the previous-hop itself), there may be a routing
   problem. However, the routing algorithm at the previous-hop may be
   optimizing differently than the local algorithm would in the same
   situation. Since the local ST agent cannot distinguish the two cases,
   it should permit the setup but send back to the previous-hop agent an
   informative NOTIFY message with the appropriate ReasonCode
   (RouteBack), pertinent TargetList, and in the NextHopIPAddress
   element the address of the next-hop ST agent returned by its routing
   algorithm.

   The agent that receives such a NOTIFY should ACK it. If the agent is
   using an algorithm that would produce such behaviour, no further
   action is taken; if not, the agent should send a DISCONNECT to the
   next-hop agent to correct the problem.

   Alternatively, if the next-hop returned by the routing function is in
   fact the previous-hop, a routing inconsistency has been detected. In
   this case, a REFUSE is sent back to the previous- hop agent
   containing an appropriate ReasonCode (RouteInconsist), pertinent
   TargetList, and in the NextHopIPAddress element the address of the
   previous-hop. When the previous-hop receives the REFUSE, it will
   recompute the next-hop for the affected targets. If there is a
   difference in the routing databases in the two agents, they may
   exchange CONNECT and REFUSE messages again. Since such routing errors
   in the internet are assumed to be temporary, the situation should
   eventually stabilize.

   8.2.2  Path Convergence

   It is possible for an agent to receive a CONNECT message that
   contains a known SID, but from an agent other than the previous hop
   agent of the stream with that SID. This might be the result of two


L. Delgrossi (ed.)                                             [Page 28]

INTERNET DRAFT                                              October 1994


   branches of the tree forming the stream have joined back together.
   Other cases are discussed in Section 8.1.3.

   This version of ST does not allow streams which have converged path,
   i.e streams are always tree-shaped and not graph-like. The ST agent
   which detects this condition informs the previous hop ST agent (the
   latest ST agent to send the CONNECT message) by sending a NOTIFY
   message with ReasonCode(PathConverge). Upon receipt of the NOTIFY
   message, the previous hop ST agent will find alternate route to the
   listed targets with a different next hop ST agent. If there is no
   next hop ST-agent other than the one it receives the NOTIFY message
   from, the ST agent must release any resources reserved for the listed
   targets and send a REFUSE message with ReasonCode(PathConverge) to
   its previous hop ST agent. In the same way, the REFUSE message is
   possibly propagated back by each ST agent. At each agent, it causes
   any resources reserved exclusively for the listed targets to be
   released. When the REFUSE reaches the origin, the ST agent at the
   origin should attempt to send a CONNECT with the listed targets to a
   different route. If no route exists, or after NRetryRoute tries, the
   application should be informed so that it may take whatever actions
   it seems necessary.

   8.2.3  Problems in Reserving Resources

   If the local or network resources are not available, an ST agent may:

   o       try alternative paths to the targets: the ST agent calls the
   routing function to find a different path to the targets. If an
   alternative path is found, stream connection setup continues in the
   usual way, as described in Section 3.

   o       preempt one or more of the already established streams: this
   way, the ST agent attempts to free enough resources to allow for the
   new stream to be established. Stream preemption is discussed in
   Section 9.3.

   o       refuse to establish the stream along this path: the origin ST
   agent informs the application of the stream setup failure; an ST
   agent at a router or target issues a REFUSE message (as described in
   Section 3.5) with ReasonCode (CantGetResrc).

   It depends on the local implementations whether an ST agent tries
   alternative paths or preempts other streams. Also, the order of the
   actions taken is not defined here. In any case, if enough resources
   cannot be found over different paths or as a consequence of stream
   preemption, the agent has to explicitly refuse to establish the
   stream.


L. Delgrossi (ed.)                                             [Page 29]

INTERNET DRAFT                                              October 1994


   8.2.4  Problems Caused by CHANGE Messages

   A CHANGE might fail for several reasons, including:

   o       the request may be for a larger amount of network resources
   when those resources are not available;

   o       it might be required that all the former resources are
   released before the new ones are requested and, due to unlucky
   timing, an unrelated request for network resources might be processed
   between the time the resources are released and the time the new
   resources are requested, so that the former resources are no longer
   available.

   If the attempt to change the FlowSpec fails then the ST agent where
   the failure occurs must intentionally break the affected portion of
   the stream. This is done by sending REFUSE and DISCONNECT messages
   with ReasonCode (ChgFailed).

   9  Failure Detection and Recovery

   9.1  Failure Detection

   The ST failure detection mechanism is based on two assumptions:

   1.      If a neighbor of an ST agent is up, and has been up without a
   disruption, and has not notified the ST agent of a problem with
   streams that pass through both, then the ST agent can assume that
   there has not been any problem with those streams.

   2.      A network through which an ST agent has routed a stream will
   notify the ST agent if there is a problem that affects the stream
   data packets but does not affect the control packets.

   The purpose of the robustness protocol defined here is for ST agents
   to determine that the streams through a neighbor have been broken by
   the failure of the neighbor or the intervening network. This protocol
   should detect the overwhelming majority of failures that can occur.
   Once a failure is detected, the recovery procedures described in
   Section 9.2 are initiated by the ST agents.

   9.1.1  Network Failures

   An ST agent can detect network failures by two mechanisms:

   o       the network can report a failure, or

   o       the ST agent can discover a failure by itself.


L. Delgrossi (ed.)                                             [Page 30]

INTERNET DRAFT                                              October 1994


   They differ in the amount of information that an ST agent has
   available to it in order to make a recovery decision. For example, a
   network may be able to report that reserved bandwidth has been lost
   and the reason for the loss and may also report that connectivity to
   the neighboring ST agent remains intact. In this case, the ST agent
   may request the network to allocate bandwidth anew. On the other
   hand, an ST agent may discover that communication with a neighboring
   ST agent has ceased because it has not received any traffic from that
   neighbor in some time period. If an ST agent detects a failure, it
   may not be able to determine if the failure was in the network while
   the neighbor remains available, or the neighbor has failed while the
   network remains intact.

   9.1.2  Detecting ST Agents Failures

   Each ST agent periodically sends each neighbour with which it shares
   one or more streams a HELLO message. This message exchange is between
   ST agents, not entities representing streams or applications. That
   is, an ST agent need only send a single HELLO message to a neighbour
   regardless of the number of streams that flow between them. All ST
   agents (host as well as intermediate) must participate in this
   exchange. However, only agents that share active streams need to
   participate in this exchange and it is an error to send a HELLO
   message to a neighbour ST agent with no streams in common, e.g. to
   check whether it is active. Note that STATUS messages can be used to
   poll neighbour ST agents.

   A HELLO message is ACKed if the Reference field is non-zero. As well
   as identifying the sender, the HELLO message has two fields:

   o       a HelloTimer field that is in units of milliseconds modulo
   the maximum for the field size, and

   o       a Restarted-bit specifying that the ST agent has been
   restarted recently.

   The HelloTimer must appear to be incremented every millisecond
   whether a HELLO message is sent or not, but it is allowable for an ST
   agent to create a new HelloTimer only when it sends a HELLO message.
   The HelloTimer wraps around to zero after reaching the maximum value.
   Whenever an ST agent suffers a catastrophic event that may result in
   it losing ST state information, it must reset its HelloTimer to zero
   and must set the Restarted-bit for the following HelloTimerHoldDown
   seconds.

   Each ST stream has a RecoveryTimeout value associated with it. This
   value is assigned by the origin and carried into the CONNECT message,
   see Section 16.4.


L. Delgrossi (ed.)                                             [Page 31]

INTERNET DRAFT                                              October 1994


   An ST agent must send HELLO messages to its neighbour with a period
   shorter than the smallest RecoveryTimeout of all the active streams
   that pass between the two agents, regardless of direction. This
   period must be smaller by a factor, called HelloLossFactor, which is
   at least as large as the greatest number of consecutive HELLO
   messages that could credibly be lost while the communication between
   the two ST agents is still viable.

   An ST agent may send simultaneous HELLO messages to all its neighbors
   at the rate necessary to support the smallest RecoveryTimeout of any
   active stream. Alternately, it may send HELLO messages to different
   neighbors independently at different rates corresponding to
   RecoveryTimeouts of individual streams.

   The agent that receives a HELLO message expects to receive at least
   one new HELLO message from a neighbor during the RecoveryTimeout of
   every active stream through that neighbor. It can detect duplicate or
   delayed HELLO messages by saving the HelloTimer field of the most
   recent valid HELLO message from that neighbor and comparing it with
   the HelloTimer field of incoming HELLO messages. It will only accept
   an incoming HELLO message from that neighbor if it has a HelloTimer
   field that is greater than the most recent valid HELLO message by the
   time elapsed since that message was received plus twice the maximum
   likely delay variance from that neighbor. If the ST agent does not
   receive a valid HELLO message within the RecoveryTimeout of a stream,
   it must assume that the neighboring ST agent or the communication
   link between the two has failed and it must initiate stream recovery
   activity.

   Furthermore, if an ST agent receives a HELLO message that contains
   the Restarted-bit set, it must assume that the sending ST agent has
   lost its ST state. If it shares streams with that neighbor, it must
   initiate stream recovery activity. If it does not share streams with
   that neighbor, it should not attempt to create one until that bit is
   no longer set. If an ST agent receives a CONNECT message from a
   neighbor whose Restarted-bit is still set, it must respond with ERROR
   with the appropriate ReasonCode (RemoteRestart). If it receives a
   CONNECT message while its own Restarted-bit is set, it must respond
   with ERROR with the appropriate ReasonCode (RestartLocal).

   9.2  Failure Recovery

   If an intermediate agent fails or a network or part of a network
   fails, the previous-hop agent and the various next-hop agents will
   discover the fact by the failure detection mechanism described in
   Section 9.1.

   The recovery of an ST stream is a relatively complex and time


L. Delgrossi (ed.)                                             [Page 32]

INTERNET DRAFT                                              October 1994


   consuming effort because it is designed in a general manner to
   operate across a large number of networks with diverse
   characteristics. Therefore, it may require information to be
   distributed widely, and may require relatively long timers. On the
   other hand, since a network is a homogeneous system, failure recovery
   in the network may be a relatively faster and simpler operation.
   Therefore an ST agent that detects a failure should attempt to fix
   the network failure before attempting recovery of the ST stream. If
   the stream that existed between two ST agents before the failure
   cannot be reconstructed by network recovery mechanisms alone, then
   the ST stream recovery mechanism must be invoked.

   If stream recovery is necessary, the different ST agents may need to
   perform different functions, depending on their relation to the
   failure:

   o       An agent that is a next-hop of a failure should first verify
   that there was a failure. It can do this using STATUS messages to
   query its upstream neighbor. If it cannot communicate with that
   neighbor, then it should first send a REFUSE message with the
   appropriate ReasonCode ("failure") to the neighbor to speed up the
   failure recovery in case the hop is unidirectional, i.e., the
   neighbor can hear the agent but the agent cannot hear the neighbor.
   The ST agent detecting the failure must then send DISCONNECT messages
   with the same ReasonCode toward the targets.

   The intermediate agents process this DISCONNECT message just like the
   DISCON- NECT that tears down the stream. However, a target ST agent
   that receives a DISCONNECT message with the appropriate ReasonCode
   ("failure") will maintain the stream state and notify the next higher
   protocol of the failure. In effect, these DISCONNECT messages tear
   down the stream from the point of the failure to the targets, but
   inform the targets that the stream may be fixed shortly.

   o       An agent that is the previous-hop before the failed component
   first verifies that there was a failure by querying the downstream
   neighbor using STATUS messages. If the neighbor has lost its state
   but is available, then the ST agent may reconstruct the stream if the
   NoRecovery option is not selected. If it cannot communicate with the
   next-hop, then the agent detecting the failure releases any resources
   that are dedicated exclusively to sending data on the broken branch
   and sends a DISCONNECT message with the appropriate ReasonCode
   ("failure") toward the affected targets. It does so to speed up
   failure recovery in case the communication may be unidirectional and
   this message might be delivered successfully.

   The agent that is the previous-hop before the failed component can
   attempt to recover the streams for which the NoRecovery option is not


L. Delgrossi (ed.)                                             [Page 33]

INTERNET DRAFT                                              October 1994


   selected:

   o       If the NoRecovery option is selected, then the ST agent sends
   a REFUSE message with the appropriate ReasonCode ("failure") to the
   previous-hop. The TargetList in these messages contains all the
   targets that were reached through the broken branch. Multiple REFUSE
   mes- sages may be required if the PDU is too long for the MTU of the
   intervening network. The REFUSE message is propagated all the way to
   the origin, which can attempt recovery of the stream by sending a new
   CONNECT to the affected targets. The new CONNECT will be treated by
   intermediate ST agents as an addition of new targets into the
   established stream.

   o       If the NoRecovery option is not selected, the ST agent can
   attempt recovery of the stream. It does so by issuing a new CONNECT
   message to the affected targets. If the ST agent can- not find new
   routes to some targets, or if the only route to some targets is
   through the previ- ous-hop, then it sends one or more REFUSE messages
   to the previous-hop with the appropriate ReasonCode ("failure")
   specifying the affected targets in the TargetList. The pre- vious-hop
   can then attempt recovery of the stream by issuing a CONNECT to those
   targets. If it cannot find an appropriate route, it will propagate
   the REFUSE message toward the ori- gin.

   Regardless of which agent attempts recovery of a damaged stream, it
   will issue one or more CONNECT messages to the affected targets.
   These CONNECT messages are treated by intermediate ST agents as
   additions of new targets into the established stream. The FlowSpecs
   of the new CONNECT messages are the same as the ones contained in the
   most recent CONNECT or CHANGE messages that the ST agent had sent
   toward the affected targets when the stream was operational.

   9.2.1  Problems in Stream Recovery

   The reconstruction of a broken stream may not proceed smoothly. Since
   there may be some delay while the information concerning the failure
   is propagated throughout an internet, routing errors may occur for
   some time after a failure. As a result, the ST agent attempting the
   recovery may receive ERROR messages for the new CONNECTs that are
   caused by internet routing errors. The ST agent attempting the
   recovery should be prepared to resend CONNECTs before it succeeds in
   reconstructing the stream. If the failure partitions the internet and
   a new set of routes cannot be found to the targets, the REFUSE
   messages will eventually be propagated to the origin, which can then
   inform the application so it can decide whether to terminate or to
   continue to attempt recovery of the stream.

   The new CONNECT may at some point reach an ST agent downstream of the


L. Delgrossi (ed.)                                             [Page 34]

INTERNET DRAFT                                              October 1994


   failure before the DISCONNECT does. In this case, the agent that
   receives the CONNECT is not yet aware that the stream has suffered a
   failure, and will interpret the new CONNECT as resulting from a
   routing failure. It will respond with an ERROR message with the
   appropriate ReasonCode (StreamExists). Since the timeout that the ST
   agents immediately preceding the failure and immediately following
   the failure are approximately the same, it is very likely that the
   remnants of the broken stream will soon be torn down by a DISCONNECT
   message with the appropriate ReasonCode ("failure"). Therefore, the
   ST agent that receives the ERROR message with ReasonCode
   (StreamExists) should retransmit the CONNECT message after the
   ToConnect timeout expires. If this fails again, the request will be
   retried for NConnect times. Only if it still fails will the ST agent
   send a REFUSE message with the appropriate ReasonCode (RouteLoop) to
   its previous-hop. This message will be propagated back to the ST
   agent that is attempting recovery of the damaged stream. That ST
   agent can issue a new CONNECT message if it so chooses. The REFUSE is
   matched to a CONNECT message created by a recovery operation through
   the LnkReference field in the CONNECT.

   ST agents that have propagated a CONNECT message and have received a
   REFUSE message should maintain this information for some period of
   time. If an agent receives a second CONNECT message for a target that
   recently resulted in a REFUSE, that agent may respond with a REFUSE
   immediately rather than attempting to propagate the CONNECT. This has
   the effect of pruning the tree that is formed by the propagation of
   CONNECT messages to a target that is not reachable by the routes that
   are selected first. The tree will pass through any given ST agent
   only once, and the stream setup phase will be completed faster.

   If a CONNECT message reaches a target, the target should as
   efficiently as possible use the state that it has saved from before
   the stream failed during recovery of the stream. It will then issue
   an ACCEPT message toward the origin. The ACCEPT message will be
   intercepted by the ST agent that is attempting recovery of the
   damaged stream, if not the origin. If the FlowSpec contained in the
   ACCEPT specifies the same selection of parameters as were in effect
   before the failure, then the ST agent that is attempting recovery
   will not propagate the ACCEPT. If the selections of the parameters
   are different, then the agent that is attempting recovery will send
   the origin a NOTIFY message with the appropriate ReasonCode
   (FailureRecovery) that contains a FlowSpec that specifies the new
   parameter values. The origin may then have to change its data
   generation characteristics and the stream's parameters with a CHANGE
   message to use the newly recovered subtree.

   9.3  Stream Preemption


L. Delgrossi (ed.)                                             [Page 35]

INTERNET DRAFT                                              October 1994


   An intermediate ST agent may decide to break a stream intentionally.
   This is called stream preemption. Usually streams are preempted in
   order to free resources for a new stream which has a higher priority.
   ST does not define when stream preemption should be used but it
   provides the means to implement it.

   If an ST agent decides that it is necessary to preempt one or more of
   the stream traversing it, the decision on which streams have to be
   preempted has to be made. ST provides two ways of optimizing such
   decision:

   1.      Streams can be assigned an StreamImportance value from 0
   (most important) to 7 (least important). This value is carried in the
   CONNECT message when the stream is setup, see Section 16.4.

   2.      An application may specify that a set of streams are related
   to each other and that they are all candidate for preemption if one
   of them gets preempted. It can be done by using the fate- sharing
   relationship defined in Section 10. This helps making a good choice
   when more than one stream have to be preempted, because it leads to
   breaking a single application as oppo- site to as many applications
   as the number of preempted streams.

   Stream preemption requires the following actions from the ST agents:

   o       An intermediate agent that breaks the stream intentionally
   sends DISCONNECT mes- sages with the appropriate ReasonCode
   (StreamPreempted) toward the affected targets. It sends a REFUSE
   message with the appropriate ReasonCode (StreamPreempted) to the
   previ- ous-hop.

   o       A previous-hop agent of the preempted stream acts as in case
   of failure recovery, cf. Sec- tion 9.2. If the NoRecovery option is
   set, is propagates the REFUSE message back to the ori- gin. If the
   NoRecovery option is not set, it attempts to rebuild the deleted
   paths and, in case this does not work, it propagates the REFUSE
   message to the previous-hop.

   o       A target or next-hop agent of the preempted stream acts as in
   case of failure recovery, cf. Section 9.2. It releases resources that
   are allocated to the stream, but it maintains the internal state
   information describing the stream for some time in case the stream is
   quickly fixed.

   Note that, as opposite to failure recovery, there is no need to
   verify that the failure actually occurred, because this is explicitly
   indicated by the ReasonCode (StreamPreempted).


L. Delgrossi (ed.)                                             [Page 36]

INTERNET DRAFT                                              October 1994


   10  A Group of Streams

   There may be need to associate related streams. The group mechanism
   is simply an association technique that allows ST agents to identify
   the different streams that are to be associated.

   A group consists of a set of streams and a relationship. The set of
   streams may be empty. The relationship applies to all group members.
   Each group is identified by a group name. The group name is unique
   across the Internet.

   Streams belong to the same group if they have the same GroupName in
   the GroupName field of the Group parameter. The relationship is
   defined by the Relationship field. Group membership must be specified
   at stream creation time and persists for the whole stream lifetime. A
   single stream may belong to multiple groups.

   The ST agent that creates a new group is called group initiator. Any
   ST agent can be a group initiator. The initiator allocates the
   GroupName and the Relationship among group members. The initiator may
   or may not be the origin of a stream belonging to the group. The
   group name has to be generated as described in Section 10.1.
   Relationships defined by this version of the protocol are listed in
   Section 10.2.

   10.1  Group Name Generator

   The GroupName includes a 16-bit unique identifier, a 32-bit IP
   address, and a 32-bit creation timestamp. It is defined in Section
   10.3. An ST implementation has to provide a group name generator
   facility, so that an application or higher layer protocol can obtain
   a unique GroupName from the ST layer. This is a mechanism for the
   application to request the allocation of a GroupName that is
   independent of the request to create a stream. The GroupName is used
   by the application or higher layer protocol when creating the streams
   that are to be part of the group.

   For instance, the following two functions could be made available:

   o       AllocateGroupName() -> result, GroupName

   o       ReleaseGroupName() -> result

   10.2  Basic ST Relationships

   This version of ST defines four basic relationships. An ST2PLus
   implementation must support all four basic relationships. The basic
   relationships are described in detail below in Section 10.2.1 -


L. Delgrossi (ed.)                                             [Page 37]

INTERNET DRAFT                                              October 1994


   Section 10.2.4.

   ST provides the means to define new relationships as the need for
   them becomes clear in the future. This can be done by assigning one
   of the unused bits of the Relationship field of the Group parameter.

   The next sections describe the four basic relationships.

   10.2.1  Bandwidth Sharing

   Streams belonging to this group share the same network bandwidth.
   This is intended to support applications as audio conferences where,
   of all participants, only some at a time are allowed to speak. In
   such a scenario, global bandwidth utilization can be optimized, e.g.
   it is sufficient to reserve bandwidth for a small set of audio
   streams.

   The N parameter indicates for how many streams at the same time
   bandwidth should be allocated. An ST agent allocates N times the
   bandwidth required by the most demanding stream in the group, say
   Bmax. If the application intends for instance to allow three
   participants to speak at the same time, N has a value of three and
   the ST agent will allocate for the group an amount of bandwidth equal
   to 3*Bmax.

   This mechanism does not always allocate an optimal amount of
   bandwidth (as when a stream requires 4 Mbits/s and all the other
   streams in the same group require 1 Mbits/s only: N=3 causes the
   allocation of 12 Mbits/s). However, it is simple to implement and it
   works well with streams that have homogeneous requirements. An
   alternative would be to keep track of the single streams requirements
   and allocate the exact amount of bandwidth.

   An ST agent always attempts to reserve N*Bmax bandwidth. If less
   bandwidth than N*Bmax is available, the new stream is not built. If
   bandwidth for the group has already been allocated and a new stream
   with a bandwidth demand inferior to Bmax is being established, the ST
   agent, depending on the local implementation, may not need to contact
   the local resource manager and it can proceed directly with the
   stream setup.

   Note that ST agents become aware of a group's requirements only when
   the streams belonging to the group are created. In case of the
   bandwidth sharing relationship, an application should attempt to
   establish the most demanding streams first to minimize stream setup
   efforts. If on the contrary the less demanding streams are built
   first, it will be always necessary to allocate additional bandwidth
   in consecutive steps as the most demanding streams are built.


L. Delgrossi (ed.)                                             [Page 38]

INTERNET DRAFT                                              October 1994


   10.2.2  Fate Sharing

   Streams belonging to this group share the same fate. If a stream is
   deleted, the other members of the group are also deleted. This is
   intended to support stream preemption by indicating which streams are
   mutually related. If preemption of multiple streams is necessary,
   this information can be used to delete a set of related streams, e.g.
   with impact on a single application, instead of making a random
   choice with the possible effect of interrupting several different
   applications.

   This relationship provides a hint on which streams should be
   preempted. Still, the entity responsible for the preemption is not
   forced to behave accordingly, and other streams could be preempted
   first based on different criteria.

   10.2.3  Route Sharing

   Streams belonging to this group share the same paths. This can be
   desirable for several reasons, e.g. to exploit the same allocated
   resources or in the attempt to maintain the transmission order. An ST
   agent attempts to select the same path although the way this is
   implemented depends heavily on the routing algorithm which is used.

   If the routing algorithm is sophisticated enough, an ST agent can
   suggest that a stream is routed over an already established path.
   Otherwise, it can ask the routing algorithm for a set of legal routes
   to the destination and check whether the desired path is included in
   those feasible.

   Route sharing is a hint to the routing algorithm used by ST. Failing
   to route a stream through the shared path does not normally cause the
   deletion of the stream: the stream is built over an alternative path
   whenever possible.

   10.2.4  Subnet Resources Sharing

   Streams belonging to this group share the same MAC layer subnetwork
   addresses. As an example, the same MAC layer multicast address can be
   used for all the streams in a given group. This mechanism allows for
   a better utilization of MAC layer multicast addresses and it is
   especially useful when used with network adapters that offer a very
   small number of MAC layer multicast addresses.

   This relationship provides a hint to the data link layer functions.

   10.3  Relationships Orthogonality


L. Delgrossi (ed.)                                             [Page 39]

INTERNET DRAFT                                              October 1994


   The four basic relationships, as they have been defined, are
   orthogonal. This means, any combinations of the basic relationships
   are allowed. For instance, let's consider an application that
   requires full-duplex service for a stream with multiple targets.
   Also, let's suppose that only N targets are allowed to send data back
   to the origin at the same time. In this scenario, all the reverse
   streams could belong to the same group. They could be sharing both
   the paths and the bandwidth. The Path&Bandwidth sharing relationship
   is obtained from the basic set of relationships. This example is
   important because it shows how full-duplex service can be obtained in
   ST.

   As new relationships are defined, it should be indicated whether they
   are or not orthogonal with respect to the previously defined ones.
   This will be reflected by illegal values for the Relationship field
   of the Group parameter (see Section 15.2.3).

   11  Ancillary Functions

   11.1  Stream IDs Generation

   To Be Written

   11.2  SCMP Reliability

   The ST Control Message Protocol is made reliable through the use of
   retransmission when response is not received in a timely manner. In
   general, when sending a SCMP messages which requires an ACK back, the
   sending ST agent needs to set the Toxxxx timer (where xxxx is the
   SCMP message type, e.g. ToConnect). If it does not receive an ACK
   back before the Toxxxx timer expires, the ST agent should retransmit
   the SCMP message. If no ACK has been received within Nxxxx
   retransmissions, then a SCMP timeout condition occurs and the ST
   agent enters its SCMP timeout recovery state. The actions performed
   by the ST agent as the result of the SCMP timeout condition differ
   for different SCMP message. In some cases (CONNECT,ACCEPT) the ST
   agent handles the timeout by sending additional SCMP message
   (REFUSE/DISCONNECT) to its neighbour ST agents (see Section 8.1.1 &
   Section 8.1.2), while in other cases (REFUSE, DISCONNECT) it simply
   gives up sine there is nothing else it can do.

   For some SCMP messages (CONNECT,CHANGE) the sending ST agent also
   expects a response back (ACCEPT/REFUSE) after ACK has been received.
   For these cases, the ST agent needs to set the Rtoxxxx timer after it
   receives the ACK. If it does not receive the appropriate response
   back when Rtoxxxx expires, the ST agent updates its state data and
   perform appropriate recovery action as described in other sections.


L. Delgrossi (ed.)                                             [Page 40]

INTERNET DRAFT                                              October 1994


   Timeout and retransmission algorithm is implementation dependent and
   it is outside the scope of this document. However, it must be
   reasonable enough not to cause excessive retransmission of SCMP
   message while maintain the robustness of the protocol. Algorithms on
   this subject are described in [RFC1122], [Jaco88], [KaPa87].

   11.3  IP Encapsulation of ST

   ST packets may be encapsulated in IP to allow them to pass through
   routers that don't support the ST Protocol. Of course, ST resource
   management is precluded over such a path, and packet overhead is
   increased by encapsulation, but if the performance is reasonably
   predictable this may be better than not communicating at all.

   IP-encapsulated ST packets begin with a normal IP header. Most fields
   of the IP header should be filled in according to the same rules that
   apply to any other IP packet. Three fields of special interest are:

   o       Protocol is 5 to indicate an ST packet is enclosed, as
   opposed to TCP or UDP, for example. The assignment of protocol 5 to
   ST is an arranged coincidence with the assignment of IP Version 5 to
   ST [RFC1190].

   o       Destination Address is that of the next-hop ST agent. This
   may or may not be the target of the ST stream. There may be an
   intermediate ST agent to which the packet should be routed to take
   advantage of service guarantees on the path past that agent. Such an
   intermediate agent would not be on a directly-connected network (or
   else IP encapsulation wouldn't be needed), so it would probably not
   be listed in the normal routing table. Additional routing mechanisms,
   not defined here, will be required to learn about such agents.

   o       Type-of-Service may be set to an appropriate value for the
   service being requested (usually low delay, high throughput, normal
   reliability). This feature is not implemented uniformly in the
   Internet, so its use can't be precisely defined here.

   IP encapsulation adds little difficulty for the ST agent that
   receives the packet. However, when IP encapsulation is performed it
   must be done in both directions. To process the encapsulated IP
   message, the ST agents simply remove the IP header and proceed with
   ST header as usual.

   The more difficult part is during setup, when the ST agent must
   decide whether or not to encapsulate. If the next-hop ST agent is on
   a remote network and the route to that network is through a router
   that supports IP but not ST, then encapsulation is required. The ST
   agents make encapsulation decision based on information provided by


L. Delgrossi (ed.)                                             [Page 41]

INTERNET DRAFT                                              October 1994


   routing function to indicate whether the routers in the path support
   ST. It is outside the scope of this document to address routing
   function and therefore neither its algorithm nor implementation is
   specified here. ST assumes that appropriate routing algorithm exists
   to which ST has access.

   On forwarding, the (mostly constant) IP Header must be inserted and
   the IP checksum appropriately updated.

   11.4  IP Multicasting

   To Be Written

   11.5  Routing

   To Be Written

   11.6  Security

   To Be Written

   12  FlowSpec

   The FlowSpec is used to convey stream service requirements end-to-
   end. The contents of the FlowSpec are transparent to the ST agents.
   An ST agent extracts the FlowSpec from the correspondent incoming
   SCMP message and passes it to the LRM as required. The LRM updates
   the FlowSpec values based on the amount of resources that it has
   allocated to the stream.

   12.1  FlowSpec Versions

   ST is not dependent on a particular FlowSpec format and it is
   expected that other versions of the FlowSpec than those introduced
   below in this section will be needed in the future. Different
   FlowSpec formats are distinguished by the value of the Version field.
   The following values are reserved:

   0 - Null FlowSpec /* mandatory */

   1 - ST Version 1

   2 - ST Vesrion 1.5

   3 - RFC 1190 FlowSpec

   4 - HeiTS FlowSpec


L. Delgrossi (ed.)                                             [Page 42]

INTERNET DRAFT                                              October 1994


   5 - BerKom FlowSpec

   6 - RFC 1363 FlowSpec

   7 - ST2Plus FlowSpec /* mandatory */

   A single stream is always associated to a single FlowSpec format.
   Changes to the FlowSpec are also relative to the same FlowSpec
   format, i.e. the value of the Version field cannot be changed during
   the lifetime of the stream.

   12.2  The Null FlowSpec (#0)

   The FlowSpec identified by a value of 0 for its Version field is
   called the "Null FlowSpec". An ST agent that receives the Null
   FlowSpec always assumes that sufficient resources for the stream are
   available. The Null FlowSpec fields values are never updated. Stream
   setup takes place in the usual way, but no resources are actually
   reserved.

   The main purpose of the Null FlowSpec is that of facilitating
   interoperability tests by allowing streams to be built without
   actually allocating the correspondent amount of resources. The Null
   FlowSpec may also be used for testing and debugging purposes.

   The complete format is specified in Section 15.2.2.

   12.3  The ST Current FlowSpec (#7)

   FlowSpec #7 is the FlowSpec to be used by the current version of ST.
   It contains values for 3 basic QoS parameters: message size,
   throughput, and delay. Also, it is possible to specify a QoS class,
   e.g. guaranteed. Each parameter has to be expressed via a set of
   values:

   o       the "desired" values are assigned by the application and
   never changed by the LRM

   o       the "limit" values are assigned by the application and never
   changed by the LRM

   o       the "actual" values indicate the guarantees that the system
   is able to provide. They are updated by the LRM at each node. The
   "actual" values are always bounded by the "limit" values.

   12.3.1  Qos Classes

   We also define two QoS classes:


L. Delgrossi (ed.)                                             [Page 43]

INTERNET DRAFT                                              October 1994


   1.      QOS_GUARANTEED

   2.      QOS_PREDICTIVE

   o       The QOS_GUARANTEED service class implies that the negotiated
   QoS for the stream is never violated during the data transfer. For
   instance, the desired rate is the peak rate for the transmission.
   This may sometimes lead to overbooking of resources, but it provides
   strict real-time guarantees.

   o       The QOS_PREDICTIVE service class implies that the negotiated
   QoS may be violated for short time intervals. Reservations are done
   for the average case as opposite to the peak case required by the
   QOS_GUARANTEED service class.

   If a LRM that doesn't support class QOS_PREDICTIVE (QOS_GUARANTEED)
   receives a FlowSpec containing a QOS_PREDICTIVE (QOS_GUARANTEED)
   class, it informs the local ST agent. The ST agent may try different
   paths or delete the correspondent portion of the stream with
   ReasonCode (QoSClassUnknown).

   12.3.2  Maximum Message Size

   This parameter is expressed in bytes. It represents the maximum size
   allowed for messages sent as part of the stream. The LRM first checks
   whether it is possible to get the value desired by the application
   (DesMaxSize). If not, it updates the actual value (ActMaxSize) with
   the available size unless this value is inferior to the minimum
   allowed by the application (LimitMaxSize), in which case it informs
   the local ST agent that it is not possible to build the stream along
   this path.

   12.3.3  Rate or Throughput

   This parameter is expressed in bytes/seconds. It represents the
   transmission rate for the stream. The LRM first checks whether it is
   possible to get the value desired by the application (DesRate). If
   not, it updates the actual value (ActRate) with the available rate
   unless this value is inferior to the minimum allowed by the
   application (LimitRate), in which case it informs the local ST agent
   that it is not possible to build the stream along this path.

   12.3.4  Maximum Delay and Delay Jitter

   This parameter is expressed in milliseconds. It represents the
   maximum end-to-end for the stream. The LRM first checks whether it is
   possible to get the value desired by the application (DesMaxDelay).
   If not, it updates the actual value (ActMaxDelay) with the available


L. Delgrossi (ed.)                                             [Page 44]

INTERNET DRAFT                                              October 1994


   rate unless this value is greater than the maximum delay allowed by
   the application (LimitMaxDelay), in which case it informs the local
   ST agent that it is not possible to build the stream along this path.

   The LRM also updates the MinDelay field by adding the minimum
   possible delay to the next- hop. Information on the minimum possible
   delay allows to calculate another important QoS parameter, the delay
   jitter.

   The complete format is specified in Section 15.2.2.

   13  ST State Machines

   To Be Written

   ST Protocol Data Units

   The ST PDUs sent between ST agents consist of an ST Header
   encapsulating either a higher layer PDU or an ST Control Message.
   Since ST operates as an extension of IP, the packet arrives at the
   same network service access point that IP uses to receive IP
   datagrams, e.g., ST would use the same ethertype (0x800) as does IP.
   The two types of packets are distinguished by the IP Version Number
   field (the first four bits of the packet); IP currently uses a value
   of 4, while ST has been assigned the value 5 (see [RFC791]). There is
   no requirement for compatibility between IP and ST packet headers
   beyond the first four bits.

   The ST Header also includes an ST Version Number, a total length
   field, a header checksum, a unique id, and the origin IP address as
   shown in Figure 8. See Section 18 for an explanation of the notation.

   Figure 8: ST Header

   o       ST is the IP Version Number assigned to identify ST packets.
   The value for ST is 5.

   o       Ver is the ST Version Number. This document defines ST
   Version 3.

   o       ??? (TBD)

   o       D (bit 8) is set to 1 in all ST data packets and to 0 in all
   SCMP control messages.

   o       TotalBytes is the length, in bytes, of the entire ST packet,
   it includes the ST Header but does not include any local network
   headers or trailers. In general, all length fields in the ST Proto-


L. Delgrossi (ed.)                                             [Page 45]

INTERNET DRAFT                                              October 1994


   col are in units of bytes.

   o       HeaderChecksum covers only the ST Header (12 bytes). The ST
   Protocol uses 16-bit checksums here in the ST Header and in each
   Control Message. The standard Internet check- sum algorithm is used:
   "The checksum field is the 16-bit one's complement of the one's com-
   plement sum of all 16-bit words in the header. For purposes of
   computing the checksum, the value of the checksum field is zero (0)."
   See [RFC1071], [RFC1141], and [RFC791] for sug- gestions for
   efficient checksum algorithms.

   o       UniqueID is the first element of the stream id (SID). It is
   locally unique at the origin.

   o       OriginIPAddress is the second element of the SID. It is the
   IP address of the origin.

   14  ST Data Packets

   ST packets whose D-bit is non-zero are data packets. Their
   interpretation is a matter for the higher layer protocols and
   consequently is not specified here. The data packets are not
   protected by an ST checksum and will be delivered to the higher layer
   protocol even with errors. ST agents will not pass data packets over
   a new hop whose setup is not complete.

   14.1  Stream ID

   The UniqueID and OriginIPAddress fields form the Stream ID (SID),
   which is used by the ST agents to identify which stream the data
   packet belongs to. The same SID is used in data packets and control
   messages.

   In certain situations, e.g. usually due to a crash and subsequent
   reboot, it is possible that an ST agent receives a data packet
   belonging to a stream of which the agent has lost state information.
   In this case, the agent is not able to forward the packet and has to
   discard it. SIDs include the IP address of the origin. This allows to
   request the origin that the unidentified data flow is stopped.

   15  SCMP Protocol Data Units

   ST Control Messages are between a previous-hop agent and its next-hop
   agents using a D-bit of zero (0). The control protocol follows a
   request-response model with all requests expecting responses.
   Retransmission after timeout (see Section 11.2) is used to allow for
   lost or ignored messages. Control messages do not extend across
   packet boundaries; if a control message is too large for the MTU of a


L. Delgrossi (ed.)                                             [Page 46]

INTERNET DRAFT                                              October 1994


   hop, its information is partitioned and a control message per
   partition is sent (see Section 3.2.2). All control messages have the
   following format:

   Figure 9: ST Control Message Format

   o       OpCode identifies the type of control message. Each is
   described in detail in following sec- tions.

   o       Options is used to convey OpCode-specific variations for a
   control message.

   o       TotalBytes is the length of the control message, in bytes,
   including all OpCode specific fields and optional parameters. The
   value is always divisible by four (4).

   o       Reference is a transaction number. Each sender of a request
   control message assigns a Refer- ence number to the message that is
   unique with respect to the stream. The Reference number is used by
   the receiver to detect and discard duplicates. Each acknowledgment
   carries the Reference number of the request being acknowledged.
   Reference zero (0) is never used, and Reference numbers are assumed
   to be monotonically increasing with wraparound so that the older-than
   and more-recent-than relations are well defined.

   o       LnkReference contains the Reference field of the request
   control message that caused this request control message to be
   created. It is used in situations where a single request leads to
   multiple responses from the same ST agent. Examples are CONNECT and
   CHANGE mes- sages that are first acknowledged hop-by-hop and then
   lead to an ACCEPT or REFUSE response from each target.

   o       SenderIPAddress is the 32-bit IP address of the network
   interface that the ST agent used to send the control message. This
   value changes each time the packet is forwarded by an ST agent (hop-
   by-hop).

   o       Checksum is the checksum of the control message. Because the
   control messages are sent in packets that may be delivered with bits
   in error, each control message must be checked before it is acted
   upon.

   o       ReasonCode is set to zero (0 = NoError) in most SCMP
   messages. Otherwise, it can be set to an appropriate value to
   indicate an error situation as defined in Section 17.3.

   o       OpCode Specific Data contains any additional information that
   is associated with the con- trol message. It depends on the specific


L. Delgrossi (ed.)                                             [Page 47]

INTERNET DRAFT                                              October 1994


   control message and is explained further below. In some response
   control messages, fields of zero (0) are included to allow the format
   to match that of the corresponding request message. The OpCode
   Specific Data may also contain any of the optional Parameters defined
   in Section 15.2.

   15.1  ST Control Messages

   The CONNECT message is used to establish a stream. It is an end-to-
   end message created by the origin. It propagates all the way to the
   targets, and require an ACK in response. It causes the targets to
   issue ACCEPT or REFUSE messages. The CONNECT message is also used to
   add one or more targets to an existing stream and during recovery of
   a broken stream.

   The CHANGE message is used to change the characteristics of an
   established stream. It is processed similarly to the CONNECT message,
   but it propagates along an already established stream.

   The ACCEPT message is an end-to-end message generated by a target and
   is used to signify the successful completion of the setup of a stream
   or part of a stream, or the change of the FlowSpec. There are no
   other messages that are similar to it.

   The REFUSE message is sent by a target to refuse the setup of a
   stream or the change of the FlowSpec. In these cases, it is an end-
   to-end message. An intermediate ST agent issues a REFUSE if it cannot
   find a route to a target, can only find a route to a target through
   the previous-hop, preempts a stream, or detects a failure in a next-
   hop ST agent or network. In all cases a REFUSE propagates in the
   direction toward the origin.

   The JOIN-REQUEST message is used to request to join an already
   established stream. It propagates in the upstream direction until
   either the origin is hit or a router that is traversed by the stream.
   As a consequence of this message, a CONNECT will be delivered to the
   target that requested to join. An authorization scheme prevents
   undesired destinations to join the stream.

   The DISCONNECT message is used to tear down streams or parts of
   streams. It propagates in the direction from the origin toward the
   targets. It is either used as an end-to-end message generated by the
   origin that is used to completely tear down a stream, or is generated
   by an intermediate ST agent that preempts a stream or detects the
   failure of its previous-hop agent or network in the stream. In the
   latter case, it is used to tear down the part of the stream from the
   failure to the targets, thus the message propagates all the way to
   the targets.


L. Delgrossi (ed.)                                             [Page 48]

INTERNET DRAFT                                              October 1994


   Usually, SCMP messages are acknowledged by the receiver. This is done
   via the ACK message. If an SCMP message contains errors and it cannot
   be identified or interpreted, an ERROR message is issued. Other SCMP
   messages include HELLO, NOTIFY, STATUS, and STATUS- RESPONSE.

   The following sections contain descriptions of common fields and
   parameters, followed by descriptions of the individual control
   messages, both listed in alphabetical order. A brief description of
   the use of the control message is given. The packet format is shown
   graphically.

   15.2  Common SCMP Elements

   Several fields and parameters (referred to generically as elements)
   are common to two or more PDUs. They are described in detail here
   instead of repeating their description several times. In many cases,
   the presence of a parameter is optional. To permit the parameters to
   be easily defined and parsed, each is identified with a PCode byte
   that is followed by a PBytes byte indicating the length of the
   parameter in bytes (including the PCode, PByte, and any padding
   bytes). If the length of the information is not a multiple of 4
   bytes, the parameter is padded with one to three zero (0) bytes.
   PBytes is thus always a multiple of four (4). Parameters can be
   present in any order.

   15.2.1  ErroredPDU

   The ErroredPDU parameter (PCode = 1) is used for diagnostic purposes
   to encapsulate a received ST PDU that contained an error. It may be
   optionally included in the ERROR message. Its use is primarily
   diagnostic.

   Figure 10: ErroredPDU

   o       PDUBytes indicates how many bytes of the PDUInError are
   actually present.

   o       PDUInError is the PDU in error, beginning with the ST Header.

   15.2.2  FlowSpec

   The FlowSpec is used to convey stream service requirements end-to-
   end. We expect that other versions of FlowSpec will be needed in the
   future, which may or may not be subsets or supersets of the version
   described here. PBytes will allow new constraints to be added to the
   end without having to simultaneously update all implementations in
   the field.


L. Delgrossi (ed.)                                             [Page 49]

INTERNET DRAFT                                              October 1994


   The FlowSpec parameter (PCode = 2) is used in several messages to
   convey the FlowSpec. The format of the FlowSpec field depends on the
   FlowSpec version. For details on the current Flowspec version, see
   Section 12.

   Figure 11: FlowSpec

   15.2.3  Group

   The Group parameter (PCode = 3) is an optional argument used to
   indicate that the stream is a member of the specified group.

   Figure 12: Group Parameter

   o       GroupUniqueID, GroupInitiatorIPAddress, and GroupCreationTime
   are allocated by the group name generator functions (see Section
   10.1). These three fields together form the GroupName field.

   o       Relationship has the following format:

   Figure 13: Relationship Field

   The B, F, P, S bits correspond to Bandwidth, Fate, Path, and Subnet
   resources sharing. A value of 1 indicates that the relationship
   exists. All combinations of these four bits are allowed because the
   four basic relationships are orthogonal. Bits 0-11 of the
   Relationship field are reserved for future use and must be set to 0.

   N contains a legal value only if the B-bit is set. It is the value of
   the N parameter to be used as explained in Section 10.2.1.

   15.2.4  MulticastAddress

   The MulticastAddress parameter (PCode = 4) is an optional parameter
   that is used when setting up a network level multicast group, to
   communicate an IP and/or local network multicast address to the
   next-hop agents that should become members of the group.

   Figure 14:  MulticastAddress

   o       LocalNetBytes is the length of the Local Net Multicast
   Address.

   o       IPMulticastAddress is described in [RFC1112]. This field is
   zero (0) if no IP multicast address is known or is applicable. The
   block of addresses 224.1.0.0 - 224.1.255.255 has been allocated for
   use by ST, see Section 17.4.


L. Delgrossi (ed.)                                             [Page 50]

INTERNET DRAFT                                              October 1994


   o       Local Net Multicast Address is the multicast address to be
   used on the local network. It cor- responds to the IPMulticastAddress
   when the latter is non-zero.

   15.2.5  NextHopIPAddress

   The NextHopIPAddress parameter (PCode = 5) is an optional parameter
   of NOTIFY or REFUSE and contains the IP address of a suggested next-
   hop ST agent.

   Figure 15:  NextHopIPAddress

   15.2.6   Origin

   The Origin parameter (PCode = 6) is used to identify the next higher
   protocol, and the SAP being used in conjunction with that protocol.

   Figure 16: Origin

   o       NextPcol is an 8-bit field used in demultiplexing operations
   to identify the protocol to be used above ST. The values of NextPcol
   are in the same number space as the IP Header's Pro- tocol field and
   are consequently defined in the Assigned Numbers RFC [RFC791].

   o       OriginSAPBytes specifies the length of the OriginSAP,
   exclusive of any padding required to maintain 32-bit alignment.

   o       OriginSAP identifies the origin's SAP associated with the
   NextPcol protocol.

   Note that the IP address of the origin is not included in this
   parameter because it is always available as part of the ST header.

   15.2.7  RecordRoute

   The RecordRoute parameter (PCode = 7) may be used to request that the
   route between the origin and a target be recorded and returned to the
   origin. It is included in the CONNECT and ACCEPT messages.

   Figure 17: RecordRoute

   o       FreeOffset is the offset to the position where the next
   next-hop IP address should be inserted. It is initialized to four (4)
   and incremented by four each time an agent inserts its IP address.

   15.2.8  Target and TargetList

   Several control messages use a parameter called TargetList (PCode =


L. Delgrossi (ed.)                                             [Page 51]

INTERNET DRAFT                                              October 1994


   13), which contains information about the targets to which the
   message pertains. For each Target in the TargetList, the information
   includes the IP address of the target, the SAP applicable to the next
   higher layer protocol, and the length of the SAP (SAPBytes).
   Consequently, a Target structure can be of variable length. Each
   entry has the format shown in Figure 18.

   Figure 18: Target

   o       TargetIPAddress is the IP Address of the Target.

   o       TargetBytes is the length of the Target structure, beginning
   with the TargetIPAddress and including any SrcRoute parameters.

   o       SAPBytes is the length of the SAP, excluding any padding
   required to maintain 32-bit align- ment.

   o       SAP may be longer than 2 bytes and it includes a padding when
   required. There would be no padding required for SAPs with lengths of
   2, 6, 10, etc., bytes.


   Figure 19: TargetList

   15.2.9  UserData

   The UserData parameter (PCode = 14) is an optional parameter that may
   be used by the next higher protocol or an application to convey
   arbitrary information to its peers. Note that since the size of
   control messages is limited by the smallest MTU in the path to the
   targets, the maximum size of this parameter cannot be specified a
   priori. If the parameter is too large for some network's MTU, a
   UserDataSize error will occur. The parameter must be padded to a
   multiple of 32 bits.

   Figure 20:  UserData

   o       UserBytes specifies the number of valid UserInformation
   bytes.

   o       UserInformation is arbitrary data meaningful to the next
   higher protocol layer or applica- tion.

   16  ST Control Message PDUs

   Each control message is described in a following section. See Section
   18 for an explanation of the notation.


L. Delgrossi (ed.)                                             [Page 52]

INTERNET DRAFT                                              October 1994


   16.1  ACCEPT

   ACCEPT (OpCode = 1) is issued by a target as a positive response to a
   CONNECT message. It implies that the target is prepared to accept
   data from the origin along the stream that was established by the
   CONNECT. ACCEPT is also issued as a positive response to a CHANGE
   message. It implies that the target accepts the proposed stream
   modification.

   The ACCEPT includes the FlowSpec that contains the cumulative
   information that was calculated by the intervening ST agents as the
   CONNECT (or CHANGE) made its way from the origin to the target, as
   well as any modifications made by the application at the target. The
   FlowSpec is not modified on this trip from the target back to the
   origin.

   The ACCEPT is relayed by the ST agents from the target to the origin
   along the path established by the CONNECT (or CHANGE) but in the
   reverse direction. The ACCEPT must be acknowledged with an ACK at
   each hop.

   Since the cumulative FlowSpec information can be different for
   different targets, no attempt is made to combine the ACCEPTs from the
   various targets. The TargetList included in each ACCEPT contains the
   IP address of a single target, i.e. the one that issued the ACCEPT.

   Figure 21: ACCEPT Control Message

   Reference contains a number assigned by the agent sending the ACCEPT
   for use in the acknowledging ACK.

   LnkReference is the Reference number from the corresponding CONNECT
   (or CHANGE).

   16.2  ACK

   ACK (OpCode = 2) is used to acknowledge a request. The ACK message is
   not propagated beyond the previous-hop or next-hop agent.

   Reference is the Reference number of the control message being
   acknowledged.

   ReasonCode is usually NoError, but other possibilities exist, e.g.,
   DuplicateIgn.

   Figure 22: ACK Control Message

   16.3  CHANGE


L. Delgrossi (ed.)                                             [Page 53]

INTERNET DRAFT                                              October 1994


   CHANGE (OpCode = 3) is used to change the FlowSpec of an established
   stream. The CHANGE message is processed similarly to the CONNECT
   message, except that it travels along the path of an established
   stream. The CHANGE must be propagated until it reaches all the
   stream's targets. It must be ACKed at every hop.

   G (bit 8) is used to request a global, stream-wide change; the
   TargetList parameter may be omitted when the G bit is specified.

   Figure 23: CHANGE Control Message

   16.4  CONNECT

   CONNECT (OpCode = 5) requests the setup of a new stream or an
   addition to or recovery of an existing stream. Only the origin can
   issue the initial set of CONNECTs to setup a stream, and the first
   CONNECT to each next-hop is used to convey the SID.

   The CONNECT message must fit within the maximum allowable packet size
   (MTU) for the intervening network. If a CONNECT message is too large,
   it must be fragmented into multiple CONNECT messages by partitioning
   the TargetList (see Section 3.2.2). Any UserData parameter will be
   replicated in each fragment for delivery to all targets.

   The next-hop initially responds with an ACK, which implies that the
   CONNECT was valid and is being processed. The next-hop will later
   relay back either an ACCEPT or REFUSE from each target.

   An intermediate ST agent that receives a CONNECT selects the next-hop
   ST agents, partitions the TargetList accordingly, reserves network
   resources in the direction toward the next-hop, updates the FlowSpec
   accordingly, and sends the resulting CONNECTs.

   If the intermediate ST agent that is processing a CONNECT fails to
   find a route to a target, it responds with a REFUSE with the
   appropriate reason code, e.g., NoRouteToDest. If the next- hop to a
   target is by way of the network from which it received the CONNECT,
   then it sends a NOTIFY with the appropriate reason code, e.g.,
   RouteBack. In either case, the TargetList specifies the affected
   targets. The intermediate ST agent will only route to and propagate a
   CONNECT to the targets for which it does not issue a REFUSE.

   If a received CONNECT contains a new SID, a new stream should be
   created. If the SID is known, there are four cases

   TargetList is the list of IP addresses of the target processes. It is
   of arbitrary size up to the maximum allowed for packets travelling
   across the specific network.


L. Delgrossi (ed.)                                             [Page 54]

INTERNET DRAFT                                              October 1994


   Figure 24: CONNECT Control Message

   16.5  DISCONNECT

   DISCONNECT (OpCode = 6) is used by an origin to tear down an
   established stream or part of a stream, or by an intermediate agent
   that detects a failure between itself and its previous-hop, as
   distinguished by the ReasonCode. The DISCONNECT message specifies the
   list of targets that are to be disconnected. An ACK is required in
   response to a DISCONNECT message. The DISCONNECT message is
   propagated all the way to the specified targets. The targets are
   expected to terminate their participation in the stream.

   Note that in the case of a failure it may be advantageous to retain
   state information as the stream should be repaired shortly, see
   Section 9.2.

   G (bit 8) is used to request a DISCONNECT of all the stream's
   targets; the TargetList parameter may be omitted when the G bit is
   set (1).

   Figure 25: DISCONNECT Control Message

   16.6  ERROR

   ERROR (OpCode = 7) is sent in acknowledgment to a request in which an
   error is detected. No action is taken on the erroneous request. No
   ACK is expected. The ERROR message is not propagated beyond the
   previous-hop or next-hop agent.

   An ERROR is never sent in response to another ERROR. The receiver of
   an ERROR is encouraged to try again without waiting for a
   retransmission timeout.

   Reference is the Reference number of the erroneous request.

   Figure 26: ERROR Control Message

   16.7  HELLO

   HELLO (OpCode = 8) is used as part of the ST failure detection
   mechanism, see Section 9.1.

   R (bit 8) is used for the Restarted-bit.

   Reference is non-zero to inform the receiver that an ACK should be
   promptly sent so that the sender can update its round-trip time
   estimates. If the Reference is zero, no ACK should be sent.


L. Delgrossi (ed.)                                             [Page 55]

INTERNET DRAFT                                              October 1994


   TBD: HelloTimer

   Figure 27: HELLO Control Message

   16.8  JOIN-REQUEST

   TBD

   Figure 28: JOIN-REQUEST Control Message

   16.9  NOTIFY

   NOTIFY (OpCode = 10) is issued by an agent to inform other agents,
   the origin, or targets of events that may be significant. The action
   taken by the receiver of a NOTIFY depends on the ReasonCode. Possible
   events are suspected routing problems or resource allocation changes
   that occur after a stream has been established. These changes occur
   when network components fail and when competing streams preempt
   resources previously reserved by a lower precedence stream. We also
   anticipate that NOTIFY can be used in the future when additional
   resources become available, as is the case when network components
   recover or when higher precedence streams are deleted.

   NOTIFY may contain a FlowSpec that reflects that revised guarantee
   that can be promised to the stream. NOTIFY may also identify those
   targets that are affected by the change. In this way, NOTIFY is
   similar to ACCEPT.

   When NOTIFY is received at the origin, the application should be
   notified of the target and the change in resources allocated along
   the path to it, as specified in the FlowSpec contained in the NOTIFY
   message. The application may then use the information to either
   adjust or terminate the portion of the stream to each affected
   target.

   The NOTIFY may be propagated beyond the previous-hop or next-hop
   agent; it must be acknowledged with an ACK.

   Reference contains a number assigned by the agent sending the NOTIFY
   for use in the acknowledging ACK.

   ReasonCode identifies the reason for the notification.

   LnkReference, when non-zero, is the Reference number from a command
   that is the subject of the notification.

   NextHopIPAddress is an optional parameter and contains the IP address
   of a suggested next- hop ST agent.


L. Delgrossi (ed.)                                             [Page 56]

INTERNET DRAFT                                              October 1994


   TargetList is present when the notification is related to one or more
   targets.

   Figure 29: NOTIFY Control Message

   16.10  REFUSE

   REFUSE (OpCode = 11) is issued by a target that either does not wish
   to accept a CONNECT message or wishes to remove itself from an
   established stream. It might also be issued by an intermediate agent
   in response to a CONNECT or CHANGE either to terminate a routing
   loop, or when a satisfactory next-hop to a target cannot be found. It
   may also be a separate command when an existing stream has been
   preempted by a higher precedence stream or an agent detects the
   failure of a previous-hop, next-hop, or the network between them. In
   all cases, the TargetList specifies the targets that are affected by
   the condition. Each REFUSE must be acknowledged by an ACK.

   The REFUSE is relayed back by the agents to the origin (or
   intermediate agent that created the CONNECT or CHANGE) along the path
   traced by the CONNECT. The agent receiving the REFUSE will process it
   differently depending on the condition that caused it, as specified
   in the ReasonCode field. In some cases, such as if a next-hop cannot
   obtain resources, the agent can release any resources reserved
   exclusively for transmissions in the stream in question to the target
   specified in the TargetList, and the previous-hop can attempt to find
   an alternate route. In some cases, such as a routing failure, the
   previous-hop cannot determine where the failure occurred, and it
   propagates the REFUSE back to the origin, which can attempt recovery
   of the stream by issuing a new CONNECT.

   No special effort is made to combine multiple REFUSE messages since
   it is considered most unlikely that separate REFUSEs will happen to
   both pass through an agent at the same time and be easily combined,
   e.g., have identical ReasonCodes and parameters.

   Reference contains a number assigned by the agent sending the REFUSE
   for use in the acknowledging ACK.

   LnkReference is either the Reference number from the corresponding
   CONNECT or CHANGE, if it is the result of such a message, or zero
   when the REFUSE was originated as a separate command.

   Figure 30: REFUSE Control Message

   16.11  STATUS

   STATUS (OpCode = 12) is used to inquire about the existence of a


L. Delgrossi (ed.)                                             [Page 57]

INTERNET DRAFT                                              October 1994


   particular stream identified by the SID.

   Use of STATUS is intended for diagnostic purposes and to assist in
   stream cleanup operations. It is possible in cases of multiple
   failures or network partitioning for an ST agent to have information
   about a stream after the stream has either ceased to exist or has
   been rerouted around the agent. When an agent concludes that a stream
   has not been used for a period of time and might no longer be valid,
   it can probe the stream's previous-hop or next-hops to see if they
   believe that the stream still exists through the interrogating agent.

   When a stream has been identified, a STATUS-RESPONSE is returned that
   will contain no optional parameters if the specified stream is
   unknown, or will otherwise contain the current FlowSpec, TargetList,
   and possibly Groups of the stream.

   Q (bit 9) is set to one (1) for remote diagnostic purposes when the
   receiving agent should return a stream's parameters, whether or not
   the source of the message is believed to be a previous-hop or next-
   hop in the specified stream. Note that this use has potential for
   disclosure of sensitive information.

   Figure 31: STATUS Control Message

   16.12  STATUS-RESPONSE

   STATUS-RESPONSE (OpCode = 13) is the reply to a STATUS message. If
   the stream specified in the STATUS message is not known, the STATUS-
   RESPONSE will contain the specified SID but no other parameters. It
   will otherwise contain the current SID, FlowSpec, TargetList, and
   possibly Groups of the stream.

   Figure 32: STATUS-RESPONSE Control Message

   17  Suggested Protocol Constants

   The ST Protocol uses several fields that must have specific values
   for the protocol to work, and also several values that an
   implementation must select. This section specifies the required
   values and suggests initial values for others. It is recommended that
   the latter be implemented as variables so that they may be easily
   changed when experience indicates better values. Eventually, they
   should be managed via the normal network management facilities.

   ST uses IP Version Number 5.

   When encapsulated in IP, ST uses IP Protocol Number 5.


L. Delgrossi (ed.)                                             [Page 58]

INTERNET DRAFT                                              October 1994


   17.1  SCMP Messages

   1.      ACCEPT

   2.      ACK

   3.      CHANGE

   4.      CONNECT

   5.      DISCONNECT

   6.      ERROR

   7.      HELLO

   8.      JOIN

   9.      NOTIFY

   10.     REFUSE

   11.     STATUS

   12.     STATUS-RESPONSE

   17.2  SCMP Parameters

   1.      ErroredPDU

   2.      FlowSpec

   3.      Group

   4.      MulticastAddress

   5.      NextHopIPAddress

   6.      Origin

   7.      RecordRoute

   8.      TargetList

   9.      UserData

   17.3  ReasonCode


L. Delgrossi (ed.)                                             [Page 59]

INTERNET DRAFT                                              October 1994


   Several errors may occur during protocol processing. All ST error
   codes are taken from a single number space. The currently defined
   values and their meaning is presented in the list below. Note that
   new error codes may be defined from time to time. All implementations
   are expected to handle new codes in a graceful manner. If an unknown
   ReasonCode is encountered, it should be assumed to be fatal. The
   ReasonCode is an 8-bit field. Following values are defined:

   To Be Written

   17.4  IP Multicast Addresses

   The following permanent IP multicast addresses have been assigned to
   ST:

   224.0.0.7 All ST routers

   224.0.0.8 All ST hosts

   In addition, a block of transient IP multicast addresses, 224.1.0.0 -
   224.1.255.255, has been allocated for ST multicast groups. Note that
   in the case of Ethernet, an ST Multicast address of 224.1.cc.dd maps
   to an Ethernet Multicast address of 01:00:5E:01:cc:dd, see [RFC1112].

   18  Notation

   To Be Written

   19  Further Study

   To Be Written

   20  References

   [RFC1071]

   Braden, Borman, Partridge: Computing the Internet Checksum, RFC 1071,
   USC/Information Sciences Institute, Cray Research, BBN Laboratories,
   Sep- tember 1988.

   [RFC1112]

   Deering, S.: Host Extensions for IP multicasting, RFC 1112, Stanford
   Univer- sity, August 1989.

   [RFC1122]

   Braden, R.: Requirements for Internet Hosts -- Communication Layers,


L. Delgrossi (ed.)                                             [Page 60]

INTERNET DRAFT                                              October 1994


   RFC 1122, USC/Information Sciences Institute, October 1989.

   [Jaco88]

   Jacobson, V.: Congestion Avoidance and Control, ACM SIGCOMM-88,
   August 1988.

   [KaPa87]

   Karn, P. and C. Partridge: Round Trip Time Estimation, ACM SIGCOMM-
   87, August 1987.

   [RFC1141]

   Mallory, T. and A. Kullberg: Incremental Updating of the Internet
   Checksum, RFC 1141, BBN, January 1990.

   [RFC 1363]

   C. Partridge: A Proposal Flow Specification, RFC 1363.

   [RFC791]

   Postel: Internet Protocol, RFC 791, DARPA, September 1981.

   [RFC1060]

   Reynolds, Postel: Assigned Numbers, RFC 1060, USC/ISI, March 1990.

   [RFC1190]

   Topolcic C.: Internet Stream Protocol Version 2 (ST2), October 1990.


L. Delgrossi (ed.)                                             [Page 61]