FIND Working Group J. Allen Internet Draft WebTV Paul J. Leach Expires October 99 Microsoft Roland Hedberg Catalogix CIP Transport Protocols Status of this Memo This document is an Internet-Draft and is in full conformance with all provisions of section 10 of RFC 2026. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its wroking groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress". The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt. Distribution of this document is unlimited. Please send comments to the FIND working group at . Discussions of the working group are archived at . Abstract This document specifies three protocols for transporting CIP requests, responses and index objects, utilizing TCP, mail, and HTTP. The objects themselves are defined in [CIP-MIME] and the overall CIP architecture is defined in [CIP-ARCH]. 1. Protocol In this section, the actual protocol for transmitting CIP index objects and maintaining the mesh is presented. While companion documents ([CIP-ARCH] and [CIP-MIME]) describe the concepts involved and the formats of the CIP MIME objects, this document is the authoritative definition of the message formats and transfer mechanisms of CIP used over TCP, HTTP and mail. 1.1 Philosophy The philosophy of the CIP protocol design is one of building-block design. Instead of relying on bulky protocol definition tools, or ad-hoc text encodings, CIP draws on existing, well understood Internet technologies like MIME, RFC-822, Whois++, FTP, and SMTP. Hopefully this will serve to ease implementation and consensus building. It should also stand as an example of a simple way to leverage existing Internet technologies to easily implement new application-level services. 1.2 Conventions The key words "MUST" and "MAY" in this document are to be interpreted as described in "Key words for use in RFCs to Indicate Requirement Levels" [KEYWORDS]. Formal syntax is defined using ABNF [ABNF]. In examples octets sent by the sender-CIP are preceded by ">>> " and those sent by the receiver-CIP by "<<< ". 2 MIME message exchange mechanisms CIP relies on interchange of standard MIME messages for all requests and replies. These messages are passed over a bidirectional, reliable transport system. This document defines transport over reliable network streams (via TCP), via HTTP, and via the Internet mail infrastructure. The CIP server which initiates the connection (conventionally referred to as a client) will be referred to below as the sender-CIP. The CIP server which accepts a sender-CIP's incoming connection and responds to the sender-CIP's requests is called a receiver-CIP. 2.1 The Stream Transport CIP messages are transmitted over bi-directional TCP connections via a simple text protocol. The transaction can take place over any TCP port, as specified by the mesh configuration. There is no "well known port" for CIP transactions. All configuration information in the system must include both a hostname and a port. All sender-CIP actions (including requests, connection initiation, and connection finalization) are acknowledged by the receiver-CIP with a response code. See section 2.1.1 for the format of these codes, a list of the responses a CIP server may generate, and the expected sender-CIP action for each. In order to maintain backwards compatibility with existing Whois++ servers, CIPv3 sender-CIPs MUST first verify that the newer protocol is supported. They do this by sending the following illegal Whois++ system command: "# CIP-Version: 3". On existing Whois++ servers implementing version 1 and 2 of CIP, this results in a 500-series response code, and the server terminates the connection. If the server implements CIPv3, it MUST instead respond with response code 300. Future versions of CIP can be correctly negotiated using this technique with a different string (i.e. "CIP-Version: 4"). An example of this short interchange is given below. Note: If a sender-CIP can safely assume that the server implements CIPv3, it may choose to send the "# CIP-Version: 3" string and immediately follow it with the CIPv3 request. This optimization, useful only in known homogeneous CIPv3 meshes, avoids waiting for the roundtrip inherent in the negotiation. Once a sender-CIP has successfully verified that the server supports CIPv3 requests, it can send the request, formatted as a MIME message with Mime-Version and Content-Type headers (only), using the network standard line ending: "". Cip-Req = Req-Hdrs CRLF Req-Body Req-Hdrs = *( Version-Hdr | Req-Cntnt-Hdr ) Req-Body = Body ; format of request body as in [CIP-MIME] Body = Data CRLF "." CRLF Data = ; data with CRLF "." CRLF replaced by ; CRLF ".." CRLF Version-Hdr = "Mime-Version:" "1.0" CRLF Req-Cntnt-Hdr = "Content-Type:" Req-Content CRLF Req-Content = ; format is specified in [CIP-MIME] Cip-Rsp = Rsp-Code CRLF [ Rsp-Hdrs CRLF Rsp-Body ] [ Indx-Cntnt-Hdr CRLF Index-Body ] Rsp-Code = DIGIT DIGIT DIGIT Comment Comment = ; any chars except CR and LF Rsp-Hdrs = *( Version-Hdr | Rsp-Cntnt-Hdr ) Rsp-Cntnt-Hdr = "Content-Type:" Rsp-Content CRLF Rsp-Content = ; format is specified in [CIP-MIME] Rsp-Body = Body ; format of response body as in [CIP-MIME] Indx-Cntnt-Hdr = "Content-Type:" Indx-Obj-Type CRLF Indx-Obj-Type = ; any registered index object's MIME-type ; the format is specified in [RFC2045] Index-Body = Body ; format defined in each index ; specifications CRLF = CR LF ; Internet standard newline CR = %x0D ; carriage return LF = %x0A ; linefeed DIGIT = %x30-39 The message is terminated using SMTP-style message termination. The data is sent octet-for-octet, except when the pattern 1*["."] is seen, in which case one more period is added. When the data is finished, the octet pattern "." is transmitted to the receiver-CIP. On the receiver-CIP's side, the reverse transformation is applied, and the message read consists of all bytes up to, but not including, the terminating pattern. In response to the request, the receiver-CIP sends a response code, from either the 200, 400, or 500 series. The receiver-CIP then processes the request and replies, if necessary, with a MIME message. This reply is also delimited by an SMTP-style message terminator. After responding with a response code, the receiver-CIP MUST prepare to read another request message, resetting state to the point when the sender-CIP has just verified the CIP version. If the sender-CIP is finished making requests, it may close the connection. In response the receiver-CIP MUST abort reading the message and prepare for a new sender-CIP connection (resetting its state completely). An example is given below. It is again worth reiterating that the command format is defined in [CIP-MIME] whereas the message body is defined in each index object definition. In this example the index object definition in [CIP-TIO] will be used. Line endings are explicitly shown in anglebrackets; newlines in this text are added only for readability. Comments occur in curly-brackets. { sender-CIP connects to receiver-CIP } <<< % 220 Example CIP server ready >>> # CIP-Version: 3 <<< % 300 CIPv3 OK! >>> Mime-Version: 1.0 >>> Content-type: application/index.cmd.datachanged; type= >>> x-tagged-index-1; dsi=1.2.752.17.5.10 >>> >>> updatetype: incremental tagbased >>> thisupdate: 855938804 >>> lastupdate: 855940000 >>> . <<< % 200 Good MIME message received >>> MIME-Version: 1.0 >>> Content-Type: application/index.obj.tagged; >>> dsi=1.2.752.17.5.10; >>> base-uri="ldap://ldap.umu.se/dc=umu,dc=se" >>> >>> version: x-tagged-index-1 >>> updatetype: incremental >>> lastupdate: 855940000 >>> thisupdate: 855938804 >>> BEGIN IO-schema >>> cn: TOKEN >>> sn: FULL >>> title: FULL >>> END IO-Schema >>> BEGIN Update Block >>> BEGIN Old >>> title: 3/testpilot >>> END Old >>> BEGIN New >>> title: 3/chiefpilot >>> END New >>> END Update Block >>> . <<< % 200 Good MIME message received { Sender-CIP shuts down socket for writing } <<< % 222 Connection closing in response to sender-CIP shutdown { receiver-CIP closes its side, resets, and awaits a new sender-CIP } An example of an unsuccessful version negotiation looks like this: { sender-CIP connects to receiver-CIP } <<< % 220 Whois++ server ready >>> # CIP-Version: 3 <<< % 500 Syntax error { server closes connection } The sender-CIP may attempt to retry using version 1 or 2 protocol. Sender-CIP may cache results of this unsuccessful negotiation to avoid later attempts. 2.1.1 Transport specific response codes The following response codes are used with the stream transport: Code Suggested description Sender-CIP action text 200 MIME request received Expect no output, continue session and processed (or close) 201 MIME request received Read a response, delimited by SMTP- and processed, output style message delimiter. follows 220 Initial server banner Continue with Whois++ interaction, message or attempt CIP version negotiation. 222 Connection closing (in Done with transaction. response to sender-CIP close) 300 Requested CIP version Continue with CIP transaction, in accepted the specified version. 400 Temporarily unable to Retry at a later time. May be used process request to indicate that the server does not currently have the resources available to accept an index. 500 Bad MIME message format Retry with correctly formatted MIME 501 Unknown or missing Retry with correct CIP command request in application/index.cmd 502 Request is missing Retry with correct CIP attributes. required CIP attributes 520 Aborting connection for Alert local administrator. some unexpected reason 530 Request requires valid Sign the request, if possible, and signature retry. Otherwise, report problem to the administrator. 531 Request has invalid Report problem to the administrator. signature 532 Cannot check signature Alert local administrator, who should cooperate with remote administrator tp diagnose and resolve the problem. (Probably missing a public key.) 2.2 Internet mail infrastructure as transport As an alternative to TCP streams, CIP transactions can take place over the existing Internet mail infrastructure. There are two motivations for this feature of CIP. First, it lowers the barriers to entry for leaf servers. When the need for a full TCP implementation is relaxed, leaf nodes (which, by definition, only send index objects) can consist of as little as a database and an indexing program (possibly written in a very high level language) to participate in the mesh. Second, it keeps with the philosophy of making use of existing Internet technology. The MIME messages used for requests and responses are, by definition of the MIME specification, suitable for transport via the Internet mail infrastructure. With a few simple rules, we open up an entirely different way to interact with CIP servers which choose to implement this transport. See Protocol Conformance, below, for details on what options server implementers have about supporting the various transports. The basic rhythm of request/response is maintained when using the mail transport. The following sections clarify some special cases which need to be considered for mail transport of CIP objects. In general, all mail protocols and mail format specifications (especially MIME Security Multiparts) can be used with the CIP mail transport. 2.2.1 CIP-Version negotiation Since no information on which CIP-version is in use is present in the MIME message, this information has to be carried in the mailheader. Therefor CIP requests sent using the mail transport MUST include a CIP-version headerline, to be registered according to [MHREG]. The format of this line is: DIGIT = %x30-39 number = 1*DIGIT cipversion = "CIP-Version:" number["." number] 2.2.2 Return path When CIP transactions take place over a bidirectional stream, the return path for errors and results is implicit. Using mail as a transport introduces difficulties to the recipient, because it's not always clear from the headers exactly where the reply should go, though in practice there are some heuristics used by MUA's. CIP solves this problem by fiat. CIP requests sent using the mail transport MUST include a Reply-To header as specified by RFC-822. Any mail received for processing by a CIP server implementing the mail transport without a Reply-To header MUST be ignored, and a message should be logged for the local administrator. The receiver MUST not attempt to reply with an error to any address derived from the incoming mail. If there are no circumstances under which a response is to be sent to a CIP request, the sender should include a Reply-To header with the address "<>" in it. Receivers MUST never attempt to send replies to that address, as it is defined to be invalid (both here, and by the BNF grammar in RFC-822). It should be noted that, in general, it is a bad idea to turn off error reporting in this way. However, in the simplest case of an index pushing program, this MAY be a desirable simplification. 2.3 HTTP transport HTTP MAY also be used to transport CIP objects, since they are just MIME objects. A transaction is performed by using the POST method to send an application/index.cmd and returning an application/index.response or an application/index.obj in the HTTP reply. The URL that is the target of the post is a configuration parameter of the CIP-sender to CIP-receiver relationship. Example: { the client opens the connection and sends a POST } >>> POST / HTTP/1.1 >>> Host: cip.some.corp >>> Content-type: application/index.cmd.noop >>> Date: Thu, 6 Jun 1997 18:16:03 GMT >>> Content-Length: 2 >>> Connection: close >>> { the server processes the request } <<< HTTP/1.1 204 No Content { the server closes the connection } In addition to leveraging the security capabilities that come with HTTP, there are other HTTP features that MAY be useful in a CIP context. A CIP client MAY use the Accept-Charset and Accept-Language HTTP headers to express a desire to retrieve an index in a particular character set or natural language. It MAY use the Accept-Encoding header to (e.g.) indicate that it can handle compressed responses, which the CIP server MAY send in conjunction with the Transfer- Encoding header. It MAY use the If-Modified-Since header to prevent wasted transmission of an index that has not changed since the last poll. A CIP server can use the Retry-After header to request that the client retry later when the server is less busy. 3. Security considerations There are two levels at which the index information can be protected; the first is by use of the technology available for securing MIME [MIME-SEC] objects, and secondly by using the technology available for securing the transport. When it comes to transport the stream transport can be protected by the use of TLS [TLS] . For HTTP the Security is handled by using HTTP Basic Authentication [RFC 2068], HTTP Message Digest Authentication [RFC2069] or SSL/TLS. Extra protection for the SMTP exchange can be achieve by the use of Secure SMTP over TLS [SMTPTLS]. 4. References [RFC 2045] Freed, N., Borenstein, N., "Multipurpose Internet Mail Extensions (MIME) Part One: Format of Internet Message Bodies", November, 1996 [RFC 2068] Fielding, et.al., "Hypertext Transfer Protocol -- HTTP/1.1", January, 1997. [RFC 2069] Franks, et. al., "An Extension to HTTP: Digest Access Authentication", January, 1997. [CIP-ARCH] Allen, J., Mealling, M, "The Common Indexing Protocol", work in progress. [CIP-MIME] Allen, J., Mealling, M., "MIME Object Definitions for the Common Indexing Protocol", work in progress. [ABNF] Crocker, D. and P. Overell, "Augmented BNF for Syntax Specifications: ABNF", RFC 2234, November 1997. [CIP-TIO] Hedberg, et.al. "A Tagged Index Object for use in the Common Indexing Protocol", work in progress. [KEYWORDS] Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", RFC 2119, Harvard University, March 1997. [MIME-SEC] Galvin, Murphy, Crocker, Freed, "Security Multiparts for MIME: Multipart/Signed and Multipart/Encrypted", RFC 1847, Trusted Information Systems, CyberCash, Innosoft International, October 1995. [TLS] Dierks, T., Allen, C., "The TLS Protocol Version 1.0", RFC 2246, Certicom, January 1999. [SMTPTLS] Hoffman, P., "SMTP Service Extension for Secure SMTP over TLS", RFC 2487, Internet Mail Consortium, January 1999. [MHREG] Jacob, P., "Mail and Netnews Header Registration Procedure", work in progress. 5. Authors' Addresses Jeff R. Allen Paul J. Leach 246 Hawthorne St. Microsoft Palo Alto, CA 94301 1 Microsoft Way USA Redmond, WA 98052 EMail: jeff.allen@acm.org Email: paulle@microsoft.com Roland Hedberg Catalogix Dalsveien 53 0775 Oslo Norway EMail: roland@catalogix.ac.se