INTERNET-DRAFT Editor: Kurt D. Zeilenga Intended Category: Standard Track OpenLDAP Foundation Expires: 21 April 2001 21 October 2000 Obsoletes: RFC 2253 Lightweight Directory Access Protocol (v3bis): UTF-8 String Representation of Distinguished Names Status of Memo This document is an Internet-Draft and is in full conformance with all provisions of Section 10 of RFC2026. This document is intended to be, after appropriate review and revision, submitted to the RFC Editor as a Standard Track document. Distribution of this memo is unlimited. Technical discussion of this document will take place on the IETF LDAP Revision (Proposed) Working Group (LDAPbis) mailing list . Please send editorial comments directly to the author . Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as ``work in progress.'' The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. Copyright 2000, The Internet Society. All Rights Reserved. Please see the Copyright section near the end of this document for more information. Abstract The X.500 Directory uses distinguished names as the primary keys to entries in the directory. Distinguished Names are encoded in ASN.1 in the X.500 Directory protocols. In the Lightweight Directory Access Protocol, a string representation of distinguished names is transferred. This specification defines the string format for Zeilenga [Page 1] INTERNET-DRAFT draft-zeilenga-ldapbis-rfc2253-00 21 October 2000 representing names, which is designed to give a clean representation of commonly used distinguished names, while being able to represent any distinguished name. The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119 [6]. 1. Background This specification assumes familiarity with X.500 [1], and the concept of Distinguished Name. It is important to have a common format to be able to unambiguously represent a distinguished name. The primary goal of this specification is ease of encoding and decoding. A secondary goal is to have names that are human readable. It is not expected that LDAP clients with a human user interface would display these strings directly to the user, but would most likely be performing translations (such as expressing attribute type names in one of the local national languages). 2. Converting DistinguishedName from ASN.1 to a String In X.501 [X.501] the ASN.1 structure of distinguished name is defined as: DistinguishedName ::= RDNSequence RDNSequence ::= SEQUENCE OF RelativeDistinguishedName RelativeDistinguishedName ::= SET SIZE (1..MAX) OF AttributeTypeAndValue AttributeTypeAndValue ::= SEQUENCE { type AttributeType, value AttributeValue } The following sections define the algorithm for converting from an ASN.1 structured representation to a UTF-8 string representation. 2.1. Converting the RDNSequence If the RDNSequence is an empty sequence, the result is the empty or zero length string. Otherwise, the output consists of the string encodings of each Zeilenga [Page 2] INTERNET-DRAFT draft-zeilenga-ldapbis-rfc2253-00 21 October 2000 RelativeDistinguishedName in the RDNSequence (according to 2.2), starting with the last element of the sequence and moving backwards toward the first. The encodings of adjoining RelativeDistinguishedNames are separated by a comma character (',' ASCII 44). 2.2. Converting RelativeDistinguishedName When converting from an ASN.1 RelativeDistinguishedName to a string, the output consists of the string encodings of each AttributeTypeAndValue (according to 2.3), in any order. Where there is a multi-valued RDN, the outputs from adjoining AttributeTypeAndValues are separated by a plus ('+' ASCII 43) character. 2.3. Converting AttributeTypeAndValue The AttributeTypeAndValue is encoded as the string representation of the AttributeType, followed by an equals character ('=' ASCII 61), followed by the string representation of the AttributeValue. The encoding of the AttributeValue is given in section 2.4. If the AttributeType is in a published table of attribute types associated with LDAP [RFC2252], then the type name string from that table is used, otherwise it is encoded as the dotted-decimal encoding of the AttributeType's OBJECT IDENTIFIER. The dotted-decimal notation is described in [RFC2251]. As an example, strings for a few of the attribute types frequently seen in RDNs include: String X.500 AttributeType ------------------------------ CN commonName L localityName ST stateOrProvinceName O organizationName OU organizationalUnitName C countryName STREET streetAddress DC domainComponent UID userid 2.4. Converting an AttributeValue from ASN.1 to a String Zeilenga [Page 3] INTERNET-DRAFT draft-zeilenga-ldapbis-rfc2253-00 21 October 2000 If the AttributeValue is of a type which does not have a string representation defined for it, then it is simply encoded as an octothorpe character ('#' ASCII 35) followed by the hexadecimal representation of each of the bytes of the BER encoding of the X.500 AttributeValue. This form SHOULD be used if the AttributeType is of the dotted-decimal form. Otherwise, if the AttributeValue is of a type which has a string representation, the value is converted first to a UTF-8 string according to its syntax specification (see for example section 6 of [RFC2252]). If the UTF-8 string does not have any of the following characters which need escaping, then that string can be used as the string representation of the value. - a space or "#" character occurring at the beginning of the string - a space character occurring at the end of the string - one of the characters ",", "+", """, " Implementations MAY escape other characters. If a character to be escaped is one of the list shown above, then it is prefixed by a backslash ('' ASCII 92). Otherwise the character to be escaped is replaced by a backslash and two hex digits, which form a single byte in the code of the character. Examples of the escaping mechanism are shown in section 5. 3. Parsing a String back to a Distinguished Name The structure of the string is specified using the following Augmented BNF [RFC2234] grammar. distinguishedName = [name] ; may be empty name = name-component *("," name-component) name-component = attrTypeAndValue *("+" attrTypeAndValue) attrTypeAndValue = attrType "=" attrValue attrType = (ALPHA 1*keychar) | oid Zeilenga [Page 4] INTERNET-DRAFT draft-zeilenga-ldapbis-rfc2253-00 21 October 2000 keychar = ALPHA | DIGIT | "-" oid = 1*DIGIT *("." 1*DIGIT) attributeValue = string | hexstring string = *( stringchar | pair ) quotechar = special = "," | "=" | "+" | "<" | ">" | "#" | ";" pair = ESC ( ESC | special | QUOTE | hexpair ) stringchar = ; decimal 65-90 and 97-122 DIGIT = ; decimal 48-57 QUOTE = ; decimal 34 ESC = or the LDAP Revision (proposed) Working Group (LDAPbis) mailing list: Appendix A. Changes made to RFC 2253 The following substantive changes were made to RFC 2253: - Removed IESG Note - Removed Relationship with RFC 1779 and LDAPv2 Section. - Removed all LDAPv2 requirements. - Rewrote DN grammer using ABNF - Rewrote examples In addition, mumerous editorial changes were made. Copyright 2000, The Internet Society. All Rights Reserved. This document and translations of it may be copied and furnished to others, and derivative works that comment on or otherwise explain it or assist in its implementation may be prepared, copied, published and distributed, in whole or in part, without restriction of any kind, provided that the above copyright notice and this paragraph are included on all such copies and derivative works. However, this document itself may not be modified in any way, such as by removing the copyright notice or references to the Internet Society or other Internet organizations, except as needed for the purpose of developing Internet standards in which case the procedures for copyrights defined in the Internet Standards process must be followed, Zeilenga [Page 8] INTERNET-DRAFT draft-zeilenga-ldapbis-rfc2253-00 21 October 2000 or as required to translate it into languages other than English. The limited permissions granted above are perpetual and will not be revoked by the Internet Society or its successors or assigns. This document and the information contained herein is provided on an "AS IS" basis and THE AUTHORS, THE INTERNET SOCIETY, AND THE INTERNET ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Zeilenga [Page 9]