Network Working Group M. Nottingham Internet-Draft August 5, 2010 Intended status: Informational Expires: February 6, 2011 Considerations for Captive Portals in HTTP draft-nottingham-http-portal-00 Abstract "Captive portals" are a commonly-deployed means of obtaining access credentials and/or payment for a network. This memo discusses issues of their use for HTTP applications, and proposes one possible mitigation strategy. This memo should be discussed on the ietf-http-wg@w3.org mailing list, although it is not a work item of the HTTPbis WG. Status of this Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at http://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on February 6, 2011. Copyright Notice Copyright (c) 2010 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Simplified BSD License text as described in Section 4.e of Nottingham Expires February 6, 2011 [Page 1] Internet-Draft HTTP Captive Portals August 2010 the Trust Legal Provisions and are provided without warranty as described in the Simplified BSD License. Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3 2. HTTP Issues Encountered . . . . . . . . . . . . . . . . . . . . 3 3. Proposal . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4. What about Non-HTTP Applications and Techniques? . . . . . . . 5 5. Security Considerations . . . . . . . . . . . . . . . . . . . . 6 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . . 6 Appendix A. Acknowledgements . . . . . . . . . . . . . . . . . . . 6 Author's Address . . . . . . . . . . . . . . . . . . . . . . . . . 6 Nottingham Expires February 6, 2011 [Page 2] Internet-Draft HTTP Captive Portals August 2010 1. Introduction It has become common for networks to require authentication, payment and/or acceptance of terms of service before granting access. Typically, this occurs when accessing "public" networks such as those in hotels, trains, conference centres and similar networks. While there are several potential means of providing credentials to a network, these are not yet universally supported, and in some instances the network administrator requires that information (e.g., terms of service, login information) be displayed to end users. In such cases, it has become widespread practice to use a "captive portal" that diverts HTTP requests to the administrator's web page. Once the user has satisfied requirements (e.g., for payment, acceptance of terms), the diversion is ended and "normal" access to the network is allowed. Typically, this diversion is accomplished by one of several possible techniques; o IP interception - all requests on port 80 are intercepted and send to the portal. o HTTP redirects - all requests on port 80 are intercepted and an HTTP redirect to the portal's URL is returned. o DNS interception - all DNS lookups return the portal's IP address. In each case, the intent is that users connecting to the network will open a Web browser and see the portal. This memo examines the HTTP-related issues that these techniques raise, and proposes a potential mitigation strategy. 2. HTTP Issues Encountered Since clients cannot differentiate between a portal's response and that of the HTTP server that they intended to communicate with, a number of issues arise. One example is the "favicon.ico" commonly used by browsers to identify the site being accessed. If the favicon for a given site is fetched from a captive portal instead of the intended site (e.g., because the user is unauthenticated), it will often "stick" in the browser's cache (most implementations cache favicons aggressively) beyond the portal session, so that it seems as if the portal's favicon has "taken over" the legitimate site. Nottingham Expires February 6, 2011 [Page 3] Internet-Draft HTTP Captive Portals August 2010 Another browser-based issue comes about when P3P is supported. Depending on how it is implemented, it's possible a browser might interpret a portal's response for the p3p.xml file as the server's, resulting in the privacy policy (or lack thereof) advertised by the portal being interpreted as applying to the intended site. Other Web-based protocols such as WebFinger , CORS and OAuth may also be vulnerable to such issues. Although HTTP is most widely used with Web browsers, a growing number of non-browsing applications use it as a substrate protocol. For example, WebDAV and CalDAV both use HTTP as the basis (for network filesystem access and calendaring, respectively). Using these applications from behind a captive portal can result in spurious errors being presented to the user, and might result in content corruption, in extreme cases. Similarly, other non-browser applications using HTTP can be affected as well; e.g., widgets , software updates, and other specialised software such as Twitter clients and the iTunes Music Store. It should be noted that it's sometimes believed that using HTTP redirection to direct traffic to the portal addresses these issues. However, since many of these uses "follow" redirects, this is not a good solution. 3. Proposal The heart of the issues seen is that the client doesn't understand that a response from the portal does not represent the requested resource. As such, the response needs to indicate that it is non-authoritative. In HTTP, response status codes indicate the type of response, and therefore defining a new one is the most appropriate way to do this. Status codes are divided into general classes; 1xx - Informational 2xx - Successful 3xx - Redirection Nottingham Expires February 6, 2011 [Page 4] Internet-Draft HTTP Captive Portals August 2010 4xx - Client errors 5xx - Server errors Although it's common for captive portals to use redirection status codes, defining a new 3xx code for them isn't practical; current implementations won't recognise the new status code, and therefore won't follow it. Error status codes, on the other hand, have a nice property in that browsers will generally display the response content if they don't understand the status code. The one exception to this is Internet Explorer, which will display a "friendly" message if the response body is too small; however, this is easy enough to work around, by padding the response message as necessary. HTTP defines 4xx status codes as those where the error lies in the client; i.e., the client shouldn't retry the same request without changing something. This is arguably more appropriate than using a 5xx error, where the error is said to lie in the server's area of responsibility, because clients might automatically retry a request upon seeing a 5xx error. In fact, there's already an existing status code with similar (but not quite suitable) semantics; 407 Proxy Authentication Required. What's needed is a new status code with the semantics of "Network Authentication Required." As such, this memo proposes (but does not yet define) using a new HTTP response status code in the 4xx range with the semantics "Network Authentication Required" to mitigate the risks of captive portals. Captive portals that deploy this status code will return it for all requests other than those to the actual portal resources (e.g., images). Clients that are unaware of the specific semantics of the new status code will fall back to treating it as a generic 400 error, and browsers will display the portal page to users. Note that this would make the HTTP redirection technique described above obsolete; the portal page would be served directly with the new status code. 4. What about Non-HTTP Applications and Techniques? This memo does not address non-HTTP applications, such as IMAP, POP, or even TLS-encapsulated HTTP. Since captive portals almost always target Web browsers (has anyone ever seen one that inserts an e-mail Nottingham Expires February 6, 2011 [Page 5] Internet-Draft HTTP Captive Portals August 2010 into your inbox asking you to authenticate?), this is appropriate. Instead, it is anticipated that well-behaved portals will block all non-HTTP ports (especially port 443) until the user has successfully authenticated. Overall, there may also be an interesting discussion to be had about improving network access methods to the point where a user interface can be presented for the same purposes, without resorting to intercepting HTTP traffic. However, since such a mechanism would by necessity require modifying the network stack and operating system of the client, this memo takes a more modest approach. 5. Security Considerations This memo does not (yet) define any protocol elements, and therefore does not (yet) have any security considerations. 6. IANA Considerations This document has no actions for IANA. Appendix A. Acknowledgements The author takes all responsibility for errors and omissions. Author's Address Mark Nottingham Email: mnot@mnot.net URI: http://www.mnot.net/ Nottingham Expires February 6, 2011 [Page 6]