INTERNET-DRAFT                                        Paul Gauthier 
   Expires: December 1999                          Inktomi Corporation 
   Category: Standards Track                                Josh Cohen 
   draft-ietf-wrec-wpad-01.txt                   Microsoft Corporation 
                                                       Martin Dunsmuir 
                                                    RealNetworks, Inc. 
                                                       Charles Perkins 
                                                Sun Microsystems, Inc.
                                    
                                    
                     Web Proxy Auto-Discovery Protocol 
    
Status of This Memo 
    
   This document is a submission by the WREC Working Group of the 
   Internet Engineering Task Force (IETF).  Comments should be 
   submitted to the wrec@cs.utk.edu mailing list. 
    
   Distribution of this memo is unlimited. 
    
   This document is an Internet-Draft and is in full conformance with 
   all provisions of Section 10 of RFC2026. Internet-Drafts are working 
   documents of the Internet Engineering Task Force (IETF), its areas, 
   and its working groups.  Note that other groups may also distribute 
   working documents as Internet-Drafts. 
    
   Internet-Drafts are draft documents valid for a maximum of six 
   months and may be updated, replaced, or obsoleted by other documents 
   at any time.  It is inappropriate to use Internet-Drafts as 
   reference material or to cite them other than as "work in progress." 
    
     The list of current Internet-Drafts can be accessed at: 
          http://www.ietf.org/ietf/1id-abstracts.txt 
     The list of Internet-Draft Shadow Directories can be accessed at: 
          http://www.ietf.org/shadow.html. 
    
Abstract 
    
   A mechanism is needed to permit web clients to locate nearby web 
   proxy caches. Current best practice is for end users to hand 
   configure their web client (i.e., browser) with the URL of an "auto 
   configuration file". In large environments this presents a 
   formidable support problem.  It would be much more manageable for 
   the web client software to automatically learn the configuration 
   information for its web proxy settings. This is typically referred 
   to as a resource discovery problem. 
    
   Web client implementers are faced with a dizzying array of resource 
   discovery protocols at varying levels of implementation and 
   deployment. This complexity is hampering deployment of a "web proxy 
   auto-discovery "facility.  This document proposes a pragmatic 
   approach to web proxy auto-discovery.  It draws on a number of 
   proposed standards in the light of practical deployment concerns. It 
   proposes an escalating strategy of resource discovery attempts in 
   order to find a nearby web proxy server. It attempts to provide rich 
    
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 1] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   mechanisms for supporting a complex environment, which may contain 
   multiple web proxy servers. 
    
Table of Contents 
    
Status of This Memo...................................................1 
Abstract..............................................................1 
Table of Contents.....................................................2 
1.   Conventions used in this document................................2 
2.   Introduction.....................................................2 
3.   Defining Web Proxy Auto-Discovery................................3 
4.   The Discovery Process............................................4 
 4.1.  WPAD Overview................................................4 
 4.2.  When to Execute WPAD.........................................6 
   4.2.1.  Upon Startup of the Web Client............................7 
   4.2.2.  Network Stack Events......................................7 
   4.2.3.  Expiration of the CFILE...................................7 
 4.3.  WPAD Protocol Specification..................................7 
 4.4.  Discovery Mechanisms.........................................9 
   4.4.1.  DHCP......................................................9 
   4.4.2.  SVRLOC/SLP...............................................10 
   4.4.3.  DNS A/CNAME  "Well Known Aliases”........................10 
   4.4.4.  DNS SRV Records..........................................10 
   4.4.5.  DNS TXT service: Entries.................................11 
   4.4.6.  Fallback.................................................11 
   4.4.7.  Timeouts.................................................11 
 4.5.  Composing a Candidate CURL..................................12 
 4.6.  Retrieving the CFILE at the CURL............................12 
 4.7.  Resuming Discovery..........................................12 
5.   Client Implementation Considerations............................12 
6.   Proxy Server Considerations.....................................13 
7.   Administrator Considerations....................................13 
8.   Conditional Compliance..........................................14 
 8.1.  Class 0 - Minimally compliant...............................15 
 8.2.  Class 1 - Compliant.........................................15 
 8.3.  Class 2 - Maximally compliant...............................15 
9.   Security Considerations.........................................15 
10.  Acknowledgements................................................16 
11.  Copyright.......................................................16 
12.  References......................................................16 
13.  Author Information..............................................17 
    
1.   Conventions used in this document 
    
   The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", 
   "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this 
   document are to be interpreted as described in "Key words for use in 
   RFCs to Indicate Requirement Levels" [KEYWORDS]. 
    
2.   Introduction 
    
   The problem of locating nearby web proxy cache servers can not wait 
   for the implementation and large scale deployment of various 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 2] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   upcoming resource discovery protocols. The widespread success of the 
   HTTP protocol and the recent popularity of streaming media has 
   placed unanticipated strains on the networks of corporations, ISPs 
   and backbone providers. There currently is no effective method for 
   these organizations to realize the obvious benefits of web caching 
   without tedious and error prone configuration by each and every end 
   user. 
    
   The de-facto mechanism for specifying a web proxy server 
   configuration in web clients is the download of a script or 
   configuration file named by a URL. Users are currently expected to 
   hand configure this URL into their Browser or other web client.  
   This mechanism suffers from a number of drawbacks: 
    
   - Difficulty in supporting a large body of end-users. Many users 
   misconfigure their proxy settings and are unable to diagnose the 
   cause of their problems. 
    
   - Lack of support for mobile clients who require a different proxy 
   as their point of access changes. 
    
   - Lack of support for complex proxy environments where there may 
   exist a number of proxy servers with different affinities for 
   different clients (based on network proximity, for example). 
   Currently, clients would have to "know" which proxy server was 
   optimal for their use. 
    
   Currently available methods for resource discovery need to be 
   exploited in the context of a well defined framework. Simple, 
   functional and efficient mechanisms stand a good chance of solving 
   this pressing and basic need. As new resource discovery mechanisms 
   mature they can be folded into this framework with little 
   difficulty. 
    
   This document is a specification for implementers of web client 
   software. It defines a protocol for automatically configuring those 
   clients to use a local proxy. It also defines how an administrator 
   should configure various resource discovery services in their 
   network to support WPAD compatible web clients. 
    
   While it does contain suggestions for web proxy server implementers, 
   it does not make any specific demands of those parties. 
    
3.   Defining Web Proxy Auto-Discovery 
    
   As mentioned above, currently web client software needs to be 
   configured with the URL of a proxy auto-configuration file or 
   script. The contents of this script are vendor specific and not 
   currently standardized. This document does not attempt to discuss 
   the contents of these files (see[8] for an example file format). 
    
   Thus, the Web Proxy Auto-Discovery (WPAD) problem reduces to 
   providing the web client a mechanism for discovering the URL of the 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 3] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   Configuration File. Once this Configuration URL (CURL) is known, the 
   client software already contains mechanisms for retrieving and 
   interpreting the Configuration File (CFILE) to enable access to the 
   specified proxy cache servers. 
    
   It is worth carefully noting that the goal of the WPAD process is to 
   discover the correct CURL at which to retrieve the CFILE. The client 
   is *not* trying to directly discover the name of the proxy server. 
   That would circumvent the additional capabilities provided by proxy 
   Configuration Files (such as load balancing, request routing to an 
   array of servers, automated fail-over to backup proxy server [6,8]). 
    
   It is worth noting that different clients requesting the CURL may 
   receive completely different CFILEs in response. The web server may 
   send back different CFILES based on a number of criteria such as the 
   "User-Agent" header, "Accept" headers, client IP address/subnet, 
   etc.  The same client could conceivably receive a different CFILE on 
   successive retrievals (as a method of round-robin load balancing, 
   for example). 
    
   This document will discuss a range of mechanisms for discovering the 
   Configuration URL. The client will attempt them in a predefined 
   order, until one succeeds. Existing widely deployed facilities may 
   not provide enough expressiveness to specify a complete URL. As 
   such, we will define default values for portions of the CURL which 
   may not be expressible by some discovery mechanisms: 
    
   http://<HOST>:<PORT><PATH> 
    
   <HOST> - There is no default for this potion. Any succeeding 
        discovery mechanism will provide a value for the <HOST> portion 
        of the CURL. The client MUST NOT provide a default. 
    
   <PORT> - The client MUST assume port 80 if the successful discovery 
        mechanism does not provide a port component. 
    
   <PATH> - The client MUST assume a path of "/wpad.dat" if the 
        successful discovery mechanism does not provide a path 
        component. 
    
    
4.   The Discovery Process 
    
4.1. WPAD Overview 
    
   This sub-section will present a descriptive overview of the WPAD 
   protocol. It is intended to introduce the concepts and flow of the 
   protocol. The remaining sub-sections (3.2-3.7) will provide the 
   rigorous specification of the protocol details. WPAD uses a 
   collection of pre-existing Internet resource discovery mechanisms to 
   perform web proxy auto-discovery. Readers may wish to refer to [1] 
   for a similar approach to resource discovery, since it was a basis 
   for this strategy.  The WPAD protocol specifies the following: 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 4] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
   - how to use each mechanism for the specific purpose of web proxy  
        auto-discovery 
   - the order in which the mechanisms should be performed 
   - the minimal set of mechanisms which must be attempted by a WPAD  
        compliant web client 
    
   The resource discovery mechanisms utilized by WPAD are as follows. 
   - Dynamic Host Configuration Protocol (DHCP, [3,7]). 
   - Service Location Protocol (SLP, [4]). 
   - "Well Known Aliases” using DNS A records [5,9]. 
   - DNS SRV records [2,9]. 
   - "service: URLs" in DNS TXT records [10]. 
    
   Of all these mechanisms only the DHCP and “Well Known Aliases” are 
   required in WPAD clients. This decision is based on three reasons: 
   these facilities are currently widely deployed in existing vendor 
   hardware and software; they represent functionality that should 
   cover most real world environments; they are relatively simple to 
   implement. 
 
   DNS servers supporting A records are clearly the most widely 
   deployed of the services outlined above. It is reasonable to expect 
   API support inside most web client development environments (POSIX 
   C, Java, etc). The hierarchical nature of DNS makes it possible to 
   support hierarchies of proxy servers. 
    
   DNS is not suitable in every environment, unfortunately. 
   Administrators often choose a DNS domain name hierarchy that does 
   not correlate to network topologies, but rather with some 
   organizational model (for example, foo.development.bar.com and 
   foo.marketing.bar.com). DHCP servers, on the other hand, are 
   frequently deployed with concern for network topologies. DHCP 
   servers provide support for making configuration decisions based on 
   subnets, which are directly related to network topology. 
    
   Full client support for DHCP is not as ubiquitous as for DNS. That 
   is, not all clients are equipped to take advantage of DHCP for their 
   essential network configuration (assignment of IP address, network 
   mask, etc). APIs for DHCP are not as widely available. Luckily, 
   using DHCP for WPAD does not require either of these facilities. It 
   is relatively easy for web client developers to speak just the 
   minimal DHCP protocol to perform resource discovery. It entails 
   building a simple UDP packet, sending it to the subnet broadcast 
   address, and parsing the reply UDP packet(s) which are received to 
   extract the WPAD option field. A reference implementation of this 
   code in C is available [11]. 
    
   The WPAD client attempts a series of resource discovery requests, 
   using the discovery mechanisms mentioned above, in a specific order. 
   Clients only attempt mechanisms that they support (obviously). Each 
   time the discovery attempt succeeds; the client uses the information 
   obtained to construct a CURL. If a CFILE is successfully retrieved 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 5] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   at that CURL, the process completes. If not, the client resumes 
   where it left of in the predefined series of resource discovery 
   requests. If no untried mechanisms remain and a CFILE has not been 
   successfully retrieved, the WPAD protocol fails and the client is 
   configured to use no proxy server. 
    
   First the client tries DHCP, followed by SLP. If no CFILE has been 
   retrieved the client moves on to the DNS based mechanisms. The 
   client will cycle through the DNS SRV, “Well Known Aliases” and DNS 
   TXT record methods multiple times. Each time through the QNAME being 
   used in the DNS query is made less and less specific. In this manner 
   the client can locate the most specific configuration information 
   possible, but can fall back on less specific information. Every DNS 
   lookup has the QNAME prefixed with “wpad” to indicate the resource 
   type being requested. 
    
   As an example, consider a client with hostname johns-
   desktop.development.foo.com. Assume the web client software supports 
   all of the mechanisms listed above. This is the sequence of 
   discovery attempts the client would perform until one succeeded in 
   locating a valid CFILE: 
    
   - DHCP 
   - SLP 
   - DNS A lookup on QNAME=wpad.development.foo.com. 
   - DNS SRV lookup on QNAME=wpad.development.foo.com. 
   - DNS TXT lookup on QNAME=wpad.development.foo.com. 
   - DBS A lookup on QNAME=wpad.foo.com. 
   - DNS SRV lookup on QNAME=wpad.foo.com. 
   - DNS TXT lookup on QNAME=wpad.foo.com. 
    
4.2. When to Execute WPAD 
    
   Web clients need to perform the WPAD protocol periodically to 
   maintain correct proxy settings. This should occur on a regular 
   basis corresponding to initialization of the client software or the 
   networking stack below the client. As well, WPAD will need to occur 
   in response to expiration of existing configuration data.  The 
   following sections describe the details of these scenarios.  3.2.1. 
   Periodic Discovery 
    
   The web proxy auto-discovery process MUST occur at least as 
   frequently as one of the following two options. A web client can use 
   either option depending on which makes sense in their environment. 
   Clients MUST use at least one of the following options. They MAY 
   also choose to implement both options. 
   - Upon startup of the web client. 
   - Whenever there indication from the networking stack that the  IP 
   address of the client host either has, or could have, changed. 
    
   In addition, the client MUST attempt a discovery cycle upon 
   expiration of a previously downloaded CFILE in accordance with 
   HTTP/1.1. 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 6] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
4.2.1.    Upon Startup of the Web Client 
    
   For many types of web client (like web browsers) there can be many 
   instances of the client operating for a given user at one time. This 
   is often to allow display of multiple web pages in different 
   windows, for example. There is no need to re-perform WPAD every time 
   a new instance of the web client is opened. WPAD MUST be performed 
   when the number of web client instances transitions from 0 to 1. It 
   SHOULD NOT be performed as additional instances are created. 
    
4.2.2.    Network Stack Events 
    
   Another option for clients is to tie the execution of WPAD to 
   changes in the networking environment. If the client can learn about 
   the change of the local host’s IP address, or the possible change of 
   the IP address, it MUST re-perform the WPAD protocol.  Many 
   operating systems provide indications of “network up” events, for 
   example. Those types of events and system-boot events might be the 
   triggers for WPAD in many environments. 
    
4.2.3.    Expiration of the CFILE 
    
   The HTTP retrieval of the CURL may return HTTP headers specifying a 
   valid lifetime for the CFILE returned. The client MUST obey these 
   timeouts and rerun the PAD process when it expires. A client MAY 
   rerun the WPAD process if it detects a failure of the currently 
   configured proxy (which is not otherwise recoverable via the 
   inherent mechanisms provided by the currently active Configuration 
   File). 
    
   Whenever the client decides to invalidate the current CURL or CFILE, 
   it MUST rerun the entire WPAD protocol to ensure it discovers the 
   currently correct CURL. Specifically, if the valid lifetime of the 
   CFILE ends(as specified by the HTTP headers provided when it was 
   retrieved),the complete WPAD protocol MUST be rerun. The client MUST 
   NOT simply re-use the existing CURL to obtain a fresh copy of the 
   CFILE. 
    
   A number of network round trips, broadcast and/or multicast 
   communications may be required during the WPAD protocol. The WPAD 
   protocol SHOULD NOT be invoked at a more frequent rate than 
   specified above (such as per-URL retrieval). 
    
4.3. WPAD Protocol Specification 
    
   The following pseudo-code defines the WPAD protocol.  If a 
   particular discovery mechanism is not supported, treat it as a 
   failed discovery attempt in the pseudo-code. 
    
   In addition, this logic is expressed below in pseudo-code. 
   The following pseudo-code fragment defines WPAD.  Unsupported 
   discovery mechanisms are treated as failure in the pseudo-code. 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 7] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
   Two subroutines need explanation. The subroutine 
   strip_leading_component(dns_string) strips off the leading  
   characters, up to and including the first dot (`.') in the string 
   which is passed as a parameter, and is expected to contain DNS name.  
   The Boolean subroutine is_not_canonical(dns_string) returns FALSE if 
   dns_string is one of the canonical domain suffixes defined in RFC 
   1591 [13] (for example, "com"). 
    
   The slp_list and dns_list elements below are assumed to be linked 
   lists containing a data field and a pointer to the next element.  
   The data field contains the elements used to override the default 
   values in creating a CURL, as detailed in section 3.5. 
    
      load_CFILE() { 
        /* MUST use DHCP */ 
        curl = dhcp_query(/*WPAD option  (section 4.4.1) */); 
        if (curl != null) {  /* DHCP succeeded */ 
           if isvalid (read_CFILE(curl)) 
              return SUCCESS;  /* valid CFILE */ 
        } 
    
        /* Should use SLP */ 
        slp_list = slp_query(/*(WPAD attributes  (Section 4.4.2)*/); 
        while (slp_list != null) {  /* test each curl  */ 
           if isvalid(read_CFILE(slp_list.curl_data)) 
              return SUCCESS;   /* valid CFILE */ 
           else 
              slp_list = slp_list.next; 
        } 
    
        /* all the DNS mechanisms */ 
        TGTDOM = gethostbyname(me); 
        TGTDOM = strip_leading_component(TGTDOM); 
    
        while (is_not_canonical(TGTDOM)) { 
    
           /* SHOULD try DNS SRV records */ 
           dns_list = dns_query(/*QNAME=wpad.TGTDOM., 
                                        QTYPE=SRV (section 4.4.4)*/); 
           while (dns_list != null) { /* each TXT record */ 
              if isvalid(read_CFILE(dns_list, curl_data)) 
                 return SUCCESS;   /* valid CFILE */ 
              else 
                 dns_list = dns_list.next; 
           } 
    
           /* SHOULD try DNS TXT records */ 
           dns_list = dns_query(/*QNAME=wpad.TGTDOM., 
                                        QTYPE=TXT (section 4.4.5)*/); 
           while (dns_list != null) { /* each TXT record */ 
              if isvalid(read_CFILE(dns_list, curl_data)) 
                 return SUCCESS;   /* valid CFILE */ 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 8] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

              else 
                 dns_list = dns_list.next; 
           } 
            
           /* MUST try DNS A records */ 
           dns_list = dns_query(/*QNAME=wpad.TGTDOM., 
                                    QTYPE=A  (Section 4.4.3)*/); 
    
           while (dns_list != null) { /* check each A record */ 
              if isvalid(read_CFILE(dns_list, curl_data)) 
                 return SUCCESS;   /* valid CFILE */ 
              else 
                 dns_list = dns_list.next; 
           } 
    
    
           /* Still no match, remove leading component and iterate */ 
           TGTDOM = strip_leading_component(TGTDOM); 
    
        } /* no A, TXT or SRV records for wpad.* */ 
    
        return FAILED;  /* could not locate valid CFILE */ 
      } 
    
4.4. Discovery Mechanisms 
    
   Each of the resource discovery methods will be marked as to whether 
   the client MUST, SHOULD, MAY, or MUST NOT implement them to be 
   compliant. Client implementers are encouraged to implement as many 
   mechanisms as possible, to promote maximum interoperability. 
    
   +-------------------------+--------+----------+   
   | Discovery               |        | Document |   
   | Mechanism               | Status | Section  |   
   +-------------------------+--------+----------+   
   | DHCP                    | MUST   | 4.4.1    |   
   | SLP                     | SHOULD | 4.4.2    |   
   | "Well Known Alias"      | MUST   | 4.4.3    |   
   | DNS SRV Records         | SHOULD | 4.4.4    |   
   | DNS TXT "service: URLs" | SHOULD | 4.4.5    |   
   +-------------------------+--------+----------+         
    
   SUMMARY OF DISCOVERY MECHANISMS 
    
4.4.1.    DHCP 
    
   Client implementations MUST support DHCP. DHCP has widespread 
   support innumerous vendor hardware and software implementations, and 
   is widely deployed. It is also perfectly suited to this task, and is 
   used to discover other network resources (such a time servers, 
   printers, etc). The DHCP protocol is detailed in RFC 2131 [3]. 
   We propose a new DHCP option with code 252 for use in web proxy 
   auto-discovery. See RFC 2132 [7] for a list of existing DHCP 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                         [Page 9] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   options. See "Conditional Compliance" for more information on DHCP 
   requirements. 
    
   The client should obtain the value of the DHCP option code 252 as 
   returned by the DHCP server. If the client has already conducted 
   DHCP protocol during its initialization, the DHCP server may already 
   have supplied that value. If the value is not available through a 
   client OS API, the client SHOULD use a DHCPINFORM message to query 
   the DHCP server to obtain the value. 
    
   The DHCP option code for WPAD is 252 by agreement of the DHC working 
   group chair.  This option is of type STRING.  This string contains a 
   URL which points to an appropriate config file.  The STRING is of 
   arbitrary size. 
   An example STRING value would be: 
   "http://server.domain/proxyconfig.pac" 
    
    
4.4.2.    Service Location Protocol /SLP 
    
   The Service Location Protocol [RFC2608] is a Proposed Standard for 
   discovering services in the Internet.  SLP has several reference 
   implementations available; for details, check the following web 
   page: 
    
          http://www.svrloc.org/ 
    
   A service type for use with WPAD has been defined and is available 
   as an Internet Draft. 
    
   Client implementations SHOULD implement SLP.   SLP Service Replies 
   will provide one or more complete CURLs. Each candidate CURL so 
   created should be pursued as specified in section 4.5 and beyond. 
    
4.4.3.    DNS A/CNAME  "Well Known Aliases”  
    
   Client implementations MUST support this mechanism. This should be 
   straightforward since only basic DNS lookup of A records is 
   required. See RFC 2219 [5] for a description of using "well known" 
   DNS aliases for resource discovery. We propose the "well known 
   alias” of "wpad" for web proxy auto-discovery. 
    
   The client performs the following DNS lookup: 
   QNAME=wpad.TGTDOM., QCLASS=IN, QTYPE=A 
    
   Each A RR, which is returned, contains an IP address which is used 
   to replace the <HOST> default in the CURL. 
    
   Each candidate CURL so created should be pursued as specified in 
   section 4.5 and beyond. 
    
4.4.4.    DNS SRV Records 
    
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 10] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   Client implementations SHOULD support the DNS SRV mechanism. Details 
   of the protocol can be found in RFC 2052 [2]. If the implementation 
   language/environment provides the ability to perform DNS lookups on 
   QTYPEs other than A, client implementers are strongly encouraged to 
   provide this support. It is acknowledged that not all resolver APIs 
   provide this functionality. 
 
   The client issues the following DNS lookup: 
   QNAME=wpad.tcp.TGTDOM., QCLASS=IN, QTYPE=SRV 
    
   If it receives SRV RRs in response, the client should use each valid 
   RR in the order specified in RFC 2052 [2]. Each valid record will 
   specify both a <HOST> and a <PORT> to override the CURL defaults. 
    
   Each candidate CURL so created should be pursued as specified in 
   section 4.5 and beyond. 
    
4.4.5.    DNS TXT service: Entries 
    
   Client implementation SHOULD support this mechanism.  If the 
   implementation language/environment provides the ability to perform 
   DNS lookups on QTYPEs other than A, the vendor is strongly 
   encouraged to provide this support. It is acknowledged that not all 
   resolver APIs provide this functionality. 
   The client should attempt to retrieve TXT RRs from the DNS to obtain 
   “service: URLs” contained therein. The “service: URL” will be of the 
   following format, specifying a complete candidate CURL for each 
   record located: 
    
   service: wpad:http://<HOST>:<PORT><PATH> 
    
   The client should first issue the following DNS query: 
   QNAME=wpad.TGTDOM., QCLASS=IN, QTYPE=TXT 
    
   It should process each TXT RR it receives (if any) using each 
   service:URL found (if any) to generate a candidate CURL. These CURLs 
   should be pursued as described in section 3.5 and beyond. 
   Readers familiar with [1] should note that WPAD clients MUST NOT 
   perform the QNAME=TGTDOM., QCLASS=IN, QTYPE=TXT lookup which would 
   be suggested by that document. 
    
4.4.6.    Fallback 
    
   Clients MUST NOT implement the "Fallback" mechanism described in 
   [1]. It is unlikely that a client will find a web server prepared to 
   handle the CURL request at a random suffix of its FQDN.  This will 
   only increase the number of DNS probes and introduce an excess of 
   spurious "GET" requests on those hapless web servers. 
    
   Instead, the "Well Known Aliases” method of section 3.4.4 provides 
   equivalent functionality. 
    
4.4.7.    Timeouts 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 11] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
   Implementers are encouraged to limit the time elapsed in each 
   discovery phase.  When possible, limiting each phase to 10 seconds 
   is considered reasonable.  Implementers may choose a different value 
   which is more appropriate to their network properties.  For example, 
   a device implementation, which operated over a wireless network, may  
   use a much larger timeout to account for low bandwidth or high 
   latency. 
    
4.5. Composing a Candidate CURL 
    
   Any successful discovery mechanism response will provide a 
   <HOST>(perhaps in the form of an IP address). Some mechanisms will 
   also provide a <PORT> and/or a <PATH>. The client should override 
   the default CURL fields with all of those supplied by the discovery 
   mechanism. 
    
4.6. Retrieving the CFILE at the CURL 
    
   The client then requests the CURL via HTTP. 
   When making the request it MUST transmit HTTP "Accept" headers 
   indicating what CFILE formats it is capable of accepting. For 
   example, Netscape Navigator browsers with versions 2.0 and beyond 
   might include the following line in the HTTP Request: 
    
   Accept: application/x-ns-proxy-autoconfig 
    
   The client MUST follow HTTP redirect directives (response codes 3xx) 
   returned by the server. The client SHOULD send a valid "User-Agent" 
   header. 
    
    
4.7. Resuming Discovery 
    
   If the HTTP request fails for any reason (fails to connect, server 
   error response, etc) the client MUST resume the search for a 
   successful CURL where it left off. It should continue attempting 
   other sub-steps in a specific discovery mechanism, and then move on 
   to the next mechanism or TGTDOM iteration, etc. 
    
5.   Client Implementation Considerations 
    
   The large number of discovery mechanisms specified in this document 
   may raise concerns about network traffic and performance. The DHCP 
   portion of the process will result in a single broadcast by the 
   client, and perhaps a few replies by listening DHCP servers. 
 
   The remaining mechanisms are all DNS based. All DNS queries should 
   have the QNAME terminated with a trailing '.' to indicate a FQDN and 
   expedite the lookup. As such each TGTDOM iteration will cause 3 DNS 
   lookups, each a unicast UDP packet and a reply. Most clients will 
   have fewer than 2TGTDOM iterations, limiting the total number of DNS 
   request/replies to6. 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 12] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
   All total, 7 UDP request/reply packets on client startup is quite a 
   low overhead.  The first web page downloaded by the client will 
   likely dwarf that packet count. Each of the DNS lookups should stand 
   a high chance of hitting the cache in the client's DNS server, since 
   other clients will have likely looked them up recently, providing a 
   low total elapsed time. 
    
   This is of course the worst case, where no CURLS are obtained, and 
   assuming a long client FQDN. Often, a successful CURL will be found 
   early in the protocol, reducing the total packet count. 
   Client implementations are encouraged to overlap this protocol work 
   with other startup activities. Also, client implementers with 
   concerns about performance can choose to implement only the 
   discovery mechanisms listed as MUST in section 3.4. 
    
   A longer delay could occur if a CURL is obtained, but the hosting 
   web server is down. The client could spend considerable time waiting 
   for the TCP “connect ()” call to fail. Luckily this is an extremely 
   rare case where the web server hosting the CFILE has failed. See 
   section 5, where proxy server implementers are encouraged to provide 
   support for hosting CURLs on the proxy itself (acting as web 
   server).  Since proxy servers are often deployed with considerable 
   attention to fault tolerance, this corner case can be further 
   minimized. 
    
6.   Proxy Server Considerations 
    
   As mentioned in the previous section, it is suggested that proxy 
   servers be capable of acting as a web server, so that they can host 
   the CURL directly. 
    
   The implementers of proxy servers are most likely to understand the 
   deployment situations of proxy caches, the formats of proxy 
   configuration files, etc. They can also build in the ability select 
   a CFILE based on all the various inputs at the time of the CURL 
   request("User-Agent", "Accept", client IP address/subnet/hostname, 
   topological distribution of nearby proxy servers, etc). 
    
7.   Administrator Considerations 
    
   Administrators should configure at least one of the DHCP or DNS A RR 
   methods in their environment (since those are the only two all 
   compatible clients MUST implement). Beyond that, configuring to 
   support mechanisms earlier in the search order will improve client 
   startup time. 
    
   One of the major motivations for this protocol structure was to 
   support client location of "nearby" proxy servers. In many 
   environments there may be a number of proxy servers (workgroup, 
   corporate gateway, ISP, backbone). There are a number of possible 
   points at which "nearness" decisions can be made in this framework: 
    
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 13] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   - DHCP servers for different subnets can return different answers. 
        They can also base decisions on the client cipaddr field or the 
        client identifier option. 
    
   - DNS servers can be configured to return different SRV/A/TXT RRs 
        for Different domain suffixes (for example, QNAMEs 
        wpad.marketing.bigcorp.com and wpad.development.bigcorp.com). 
    
   - The web server handling the CURL request can make decisions based 
        on the "User-Agent", "Accept", client IP 
        address/subnet/hostname, and the topological distribution of 
        nearby proxy servers, etc.  This can occur inside a CGI 
        executable created to handle the CURL.  As mentioned above it 
        could be a proxy server itself handing the CURL request and 
        making those decisions. 
    
   - The CFILE may be expressive enough to select from a set of 
        alternatives at "runtime" on the client. CARP [6] is based on 
        this premise for an array of caches. It is not inconceivable 
        that the CFILE could compute some network distance or fitness 
        metrics to a set of candidate proxy servers and then select the 
        "closest" or "most responsive" server. 
    
   Note that it is valid to configure a DHCP daemon to respond only to 
   INFORM option queries in static IP environments 
    
   Not all of the above mechanisms can be supported in all currently 
   deployed vendor hardware and software. The hope is that enough 
   flexibility is provided in this framework that administrators can 
   select which mechanisms will work in their environments. 
    
8.   Conditional Compliance 
    
   In light of the fact that many of the discovery technologies 
   described in this document are not well deployed or not available on 
   all platforms, this specification permits conditional compliance.  
   Conditional compliance is designated by three class identifications. 
    
   Additionally, due to the possible security implications of a DHCP 
   broadcast request, it is onerous to REQUIRE an implementer to put 
   himself or his implementation at undue risk.  It is quite common to 
   have rogue DHCP servers on a network which may fool a DHCP broadcast 
   implementation into using a malicious configuration file.  On 
   platforms which do not support DHCP natively and cannot get the WPAD 
   option along with its IP address, and which cannot support the DHCP 
   INFORM unicast request, presumably to a known and trusted DHCP 
   server, the likelihood of an undetected spoofing attack is 
   increased.  Having an individual program, such as a browser, trying 
   to detect a DHCP server on a network is unreasonable, in the 
   authors' opinion.  On platforms which use DHCP for their system IP 
   address and have previously trusted a DHCP server, a unicast DHCP 
   INFORM to that same trusted server does not introduce any additional 
   trust to that server. 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 14] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
8.1. Class 0 - Minimally compliant 
    
   A WPAD implementation which implements only the following discovery 
   mechanisms and interval schemes is considered class 0 compliant: 
    
   DNS A record queries 
   Browser or System session refresh intervals 
    
   Class 0 compliance is only applicable to systems or implementations 
   which do not natively support DHCP and or cannot securely determine 
   a trusted local DHCP server. 
    
8.2. Class 1 - Compliant 
    
   A WPAD implementation which implements only the following discovery 
   mechanisms and interval schemes is considered class 1 compliant: 
    
   DNS A record queries 
   DHCP INFORM Queries  
    
   Network stack change refresh intervals 
   CFILE expiration refresh intervals 
    
8.3. Class 2 - Maximally compliant 
    
   A WPAD implementation which implements only the following discovery 
   mechanisms and interval schemes is considered class 1 compliant: 
    
   DNS A record queries 
   DHCP INFORM Queries  
   DNS TXT service: queries 
   DNS SRV RR queries 
   SVRLOC Queries 
   Network stack change refresh intervals 
   CFILE expiration refresh intervals 
    
   To be considered compliant with a given class, an implementation 
   MUST support the features listed above corresponding to that class. 
    
9.   Security Considerations 
    
   This document does not address security of the protocols involved.  
   The WPAD protocol is vulnerable to existing identified weaknesses in 
   DHCP and DNS. The groups driving those standards, as well as the SLP 
   protocol standards, are addressing security. 
    
   When using DHCP discovery, clients are encouraged to use unicast 
   DHCP INFORM queries instead of broadcast queries which are more 
   easily spoofed in insecure networks. 
    
   Minimally, it can be said that the WPAD protocol does not create new 
   security weaknesses. 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 15] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

    
10.  Acknowledgements 
    
   The authors' work on this specification would be incomplete without 
   the assistance of many people.  Specifically, the authors would like 
   the express their gratitude to the following people: 
    
   Chuck Neerdaels, Inktomi, for providing assistance in the design of 
   the WPAD protocol as well as for providing reference 
   implementations. 
    
   Arthur Bierer, Darren Mitchell, Sean Edmison, Mario Rodriguez, Danpo 
   Zhang, and Yaron Goland, Microsoft, for providing implementation 
   insights as well as testing and deployment.  
    
   Ari Luotonen, Netscape, for his role in designing the first web 
   proxy. 
    
   In addition, the authors are grateful for the feedback provided by 
   the following people: 
    
     Jeremy Worley - RealNetworks 
     Eric Twitchell - United Parcel Service 
      
11.  Copyright  
    
   Copyright (C) The Internet Society 1998. All Rights Reserved. This 
   document and translations of it may be copied and furnished to 
   others, and derivative works that comment on or otherwise explain it 
   or assist in its implementation may be prepared, copied, published 
   and distributed, in whole or in part, without restriction of any 
   kind, provided that the above copyright notice and this paragraph 
   are included on all such copies and derivative works. However, this 
   document itself may not be modified in any way, such as by removing 
   the copyright notice or references to the Internet Society or other 
   Internet organizations, except as needed for the purpose of 
   developing Internet standards in which case the procedures for 
   copyrights defined in the Internet Standards process must be 
   followed, or as required to translate it into languages other than 
   English. The limited permissions granted above are perpetual and 
   will not be revoked by the Internet Society or its successors or 
   assigns. This document and the information contained herein is 
   provided on an "AS IS" basis and THE INTERNET SOCIETY AND THE 
   INTERNET ENGINEERING TASK FORCE DISCLAIMS ALL WARRANTIES, EXPRESS OR 
   IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF 
   THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED 
   WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.  
    
12.  References 
    
   [1] Moats, R., Hamilton, M., and P. Leach, "Finding Stuff (How to    
        discover services)", Internet Draft, October 1997. 
    
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 16] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   [2] Gulbrandsen, A., and P. Vixie, "A DNS RR for specifying the 
        location    of services (DNS SRV)", RFC 2052, October 1996 
    
   [3] Droms, R., "Dynamic Host Configuration Protocol", RFC 2131, 
        March    1997. 
    
   [4] Veizades, J., Guttman, E., Perkins, C., and M. Day, "Service    
        Location Protocol," Internet Draft, October 1997. 
    
   [5] Hamilton, M., and R. Wright, "Use of DNS Aliases for Network    
        Services", RFC 2219, October 1997. 
    
   [6] Valloppillil, V., and K. Ross, "Cache Array Routing Protocol",    
        Internet Draft, October 1997. 
    
   [7] Alexander, S., and R. Droms, "DHCP Options and BOOTP Vendor    
        Extensions", RFC 2132, March 1997. 
    
   [8] Luotonen, A., "Navigator Proxy Auto-Config File Format", 
        Netscape    Corporation, 
        http://home.netscape.com/eng/mozilla/2.0/relnotes/    
        demo/proxy-live.html, March 1996. 
    
   [9] Mockapetris, P., "Domain Names - Concepts and Facilities",    
        RFC 1034, November 1987. 
    
   [10] Perkins, C., Guttman, E., and J. Kempf, "Service Templates and     
        service: Schemes", Internet Draft, December 1997. 
    
   [11] “A Sample DHCP Implementation for WPAD”, Inktomi Corporation,     
        http://www.inktomi.com/TBD.html, February 1998. 
    
   [KEYWORDS] Bradner, S., "Key words for use in RFCs to Indicate 
        Requirement Levels", BCP 14, RFC 2119, March 1997. 
      
13.  Author Information 
    
   Paul Gauthier 
   Inktomi Corporation 
   1900 South Norfolk Street Suite 310, San Mateo, CA 94403-1151 
   Phone: (650) 653-2800 
   Email: gauthier@inktomi.com 
    
   Josh Cohen 
   Microsoft Corporation 
   One Microsoft Way, Redmond, WA 98052 
   Phone: (425) 703-5812 
   Email: joshco@microsoft.com 
    
   Martin Dunsmuir 
   RealNetworks, Inc. 
   1111 3rd Ave, Suite 2900, Seattle, WA 98101 
   Phone: (206) 674-2237 
    
   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 17] 


   INTERNET-DRAFT Web Proxy Auto-Discovery Protocol           7/28/99 

   Email: martind@real.com 
    
   Charles Perkins 
   Sun Microsystems, Inc. 
   15 Network Circle, Menlo Park, CA  94025 
   Phone: (650) 786-6464 
   Email: charles.perkins@Sun.COM 
    

   Category: Standards Track                    Expires: December 1999 
   Gauthier, Cohen, Dunsmuir, Perkins                        [Page 18]