Internet Draft Johan Ihren draft-ihren-dnsop-interim-signed-root-00.txt Autonomica October 2002 Expires in six months An Interim Scheme for Signing the Public DNS Root Status of this Memo This document is an Internet-Draft and is in full conformance with all provisions of Section 10 of RFC2026. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. Abstract This memo documents a proposed mechanism for a first stage of a transition from an unsigned DNS root to a signed root, such that the data in the root zone is accompanied by DNSSEC signatures to allow validation. The underlying reason for signing the root is to be able to provide a more secure DNS hierarchy, where it is possible to distinguish false answers from correct answers. For the special case of the DNS root zone, an interim scheme is proposed. This scheme is mostly aimed at securing the root zone itself for technical and operational reasons, and to give operational experience of DNSSEC. 1. Terminology The key words "MUST", "SHALL", "REQUIRED", "SHOULD", "RECOMMENDED", and "MAY" in this document are to be interpreted as described in RFC 2119. 2. Motivation for signing the DNS root. In the special case of the root zone there are very strong reasons to take a slow and conservative approach to any changes with operational impact. Signing the root is such a change. DNSSEC[RFC2535, RFC3090] has been in development for a number of years now and still has not reached the point where the last flag day is behind us. However, during the years of DNSSEC development and refinement[RFC2930, RFC3007, RFC3008, RFC3110, RFC3225, RFC3226, AD-secure, Opt-in, Wild-card-optimize], the Internet has matured and more and more businesses and other organizations have become dependent on the stability and constant availability of the Internet. It is therefore prudent to do everything in our power to ensure that the DNS infrastructure works as well as possible, as well as, when appropriate and possible, also enhancing the functionality of DNS. In the particular case of the maintenance of the distribution mechanisms for the DNS root zone this is exactly what the root server operators have been doing for more than fifteen years. The time is now right for yet another step of improvement by signing the root zone. By doing that any Internet user that so wish will obtain the ability of verifying the responses received from the root nameservers. Since this is new operational ground the objective is not maximum security but rather an "Internet-wide" controlled experiment with a signed root zone where the trade off is that we utilize the fact that there are operators in place that can help but they are not sufficiently staffed to do off-line signing in a 24x7 mode. For this reason it is fully possible to even do automatic signing, since the purpose is to ensure that DNSSEC works operationally with a signed root zone and gain experience from the exercise. 2.1. Motivation for signing the root zone now. The reason DNSSEC is not yet widely deployable is that the problem of key management remains unsolved, especially where communication between the zone administrators for a parent zone and child zone is required. However, during the past year a solution to this problem has been found (in the form of a new record type, DS aka Delegation Signer)[DS], discussed, implemented and tested. Therefore it is finally reasonable to consider DNSSEC deployment to be a real possibility within the next 12 months. In the case of the root zone the particular importance of managing the transition with zero operational mistakes is a strong reason to separate the signing of the zone itself, with the associated key management issues, from the future management of signed delegations (of top level domains). Although support for Delegation Signer has been implemented and tested it is not yet generally available except experimentally. For this reason signed delegations for productions zones will have to wait a bit longer. Furthermore, it will take some time to ensure that the entire RSS aka Root Server System is capable of supporting the protocol changes that accompany the new support for Delegation Signer. By signing now it will be possible to work through the operational issues with signing the zone itself without at the same time having to manage the additional complexity of signed delegations. Also, by explicitly not supporting any signed delegations, it is even possible recover in a graceful way should new operational problems turn up. Because of these operational and technical issues there is good reason to sign the root zone before the status of implementation of Delegation Signer support change to "generally available" thereby increasing the pressure for signed delegations. 3. Interim signing of the root zone. At present all the authoritative servers for the root zone pull the zone from a so-called hidden primary master. I.e. any changes at the hidden master will propagate via NOTIFY[RFC1996] and subsequent AXFR/IXFR[RFC1995, AXFR-clarify] to the slave servers. Between the hidden primary master and the slaves transactions are signed with TSIG[RFC2845] signatures. This mechanism is already in place, and there is operational experience with periodic key rollover of the TSIG keys. A workable interim scheme should fit into this existing structure. 3.1. Design philosophy. By introducing a signing step into the distribution of the zone file complexity is increased. To avoid introducing new single points of failure it is therefore important to ensure that any added or changed steps are as redundant as possible. The assumption is that the operators of the root servers are trusted and that consistency of the zone among the root servers is a crucial property that MUST be preserved in emergencies. To ensure that consistency is maintained new single points of failure SHOULD NOT be introduced by the signing step. Therefore a structure where several parties have the ability to sign the zone is proposed. Furthermore, such a design helps avoid any individual party becoming a de facto single zone signing authority. 3.2. Proposed new management structure for the root zone. Taking into account the already existing infrastructure for management of TSIG keys and rollover of these keys the following structure is proposed: * Day-to-day signing of the root zone is performed by a subgroup of the root server operators. For this task individual operator keys are used. * The set of operator keys are signed by one key-signing key (sometimes referred to as a "master key") at any particular time. The key-signing key in use has to be statically configured in all resolvers that want to be able to perform validation of responses. * Key rollover, the operation when an old key-signing key is exchanged for a new key-signing key, is managed with minimal overlap and on a frequent basis of no less than three times per year. The rollover schedule is chosen to be frequent enough to not make long-term trust possible while being low enough to not become operationally infeasible. 3.2.1. Management and distribution of the zone file. The present, unsigned zone is published by the official slaves, the "root nameservers" transferring the zone securely from a hidden primary master and subsequently using the data to respond to queries. This mechanism is changed into: * The unsigned root zone is transferred securely from the hidden primary master to a set of "signing primaries" managed by the operators participating in signing the zone. This is done via the traditional mechanisms NOTIFY + AXFR/IXFR + TSIG. * The root nameservers change their configuration so that they replace the present, single, hidden primary master as the source of the zone with a set of multiple possible "signing primaries" as masters that provide the signed zone. * When a new, unsigned zone, is published by the hidden primary master it will automatically be transferred to the pre-signing servers. The responsible operator will then sign the new zone and publish it from his signing primary. Since the serial number is higher than what the official root nameservers presently have the official root nameservers will all transfer the zone from this source (because of the redundant configuration with multiple possible masters for the signed zone). * The operators that participate in signing rotate the signing responsibility among themselves. Should the responsible operator be unable to do this for any reason then any of the others are able to do the signing instead. Traceability is maintained since the operator keys are individual. * To avoid having several "backup signing operators" possibly sign at exactly the same time backups are allocated "time windows" within which they are allowed to publish a signed zone. With N signers, each signer is assigned a unique number R in [1..N]. T = 2*k*((S-R) mod N) T is time to wait in seconds, S current serial number. The length of the window is k, thereby separating each signing window with an interval where signing is not allowed. The constant k is used to create sufficient separation of the signers with respect to time used to sign and time needed to distribute the zone. A reasonable value for k would be in the order of 1800 seconds. * Because the digital signatures in the signed root zone MUST NOT expire it is necessary to ensure that the unsigned zone actually changes sufficiently often to cause new signing to occur within the validity period of the old signatures. This is most easily accomplished by a dummy update that only increments the serial number in the SOA record. This new requirement will not cause any operational problems, since in fact the root zone is maintained this way since several years back. 3.2.2. Management of the key-signing keys. The key-signing keys are periodically issued by the Regional Internet Registries, RIRs, individually. The RIRs should all perform the task of generating individual key-signing keys and signing the "keyset". However, only one keyset should be included in the signed zone file. The RIRs were chosen on the grounds of * their proven record of service to the Internet community * not having the domain name system as their primary field of activity * their geographical diversity * the fact that they are, by definition, not a single entity, but rather a collective of organizations, thereby alleviating the risk of the "signing authority" sticking in one place. The requirement on the individual RIRs is that they must be able to * establish a secure out-of-band communication path in collaboration with the signing operators which will be used for authenticated exchange of the unsigned keyset. * periodically generate strong keys using a good random number generator * manage their keys (i.e. use them for signing the operator keyset and keeping the private key appropriately secret) Technically the operation of signing the operator keys works by the key-signing key signing the set of keys (i.e. itself plus all the operator keys). The result is called a "signed keyset". 3.2.3. Management of the operator keys and the signed "keyset". A subgroup of the root operators that choose to participate in signing the zone each hold an individual "operator key". The subgroup of operators may include all operators, but that is not necessary as long as a sufficient number to achieve redundancy in "signing ability" participates. * The set of operator keys should be sent as a signed, authenticated block of data to all of the RIRs via an out-of-band mechanism. * Each RIR should generate a new signed keyset and send this back to the signing operators via an out-of-band mechanism. * The signing operators should include one of the received signed keysets in the zone file according to an agreed upon rotation schedule. 3.2.4. Management of key rollover. The key-signing keys should be short-lived and rolled over frequently. The direct intent is that it should not be possible to put long term trust in the keys. Therefore, by design, every resolver that chooses to configure the key (to be able to validate lookups) will have to change the "trusted key" periodically. This is, after all, only an interim method of root zone signing. * Key rollover is executed by changing from one signed keyset to another. I.e. from a keyset signed by a key-signing key held by one RIR to a keyset signed by a key-signing key held by the next RIR in the chain. * Technically the signing operators are able to initiate key rollover, although except for the case of a suspected key compromise (with subsequent immediate key rollover) this ability should only be used according to an established schedule. * New key-signing keys will be published as widely as possible, however exactly how and where to publish the keys is by itself an area where it is necessary to acquire more experience. Especially for the case of individual hosts and other devices this will be a significant tissue to deal with. * Since the keys expire within a few months the aim is to make it impossible to configure an interim key and then forget about it long enough to still trust an interim key when a long term design for signing the root zone emerges. 4. Risk Analysis A change in the management of the root zone is always a risk. But that is all the more reason to do it carefully and after due consideration, rather than as a result of an immediate and urgent need. The gains of a signed root zone have to be judged against the added complexity of the signing step. However, added complexity, in one form or another, is basically unavoidable. It is possible to argue for or against implementation details, but in the end the benefits of a signed root will have to be compared against some amount of added complexity. If the cost or risk of this complexity is deemed to be too high then it is not possible to sign the DNS root zone. If that is the conclusion then it is obviously important to reach it as soon as possible. The same is true for the need for operational experience with a signed root zone. There is no method of acquiring this experience except by signing the root zone, so that is what is being proposed. Some identified scenarios: 4.1. What will the consequences of a signed root zone be for old resolver software? Nameservers that are authoritative for signed zones only give out these signatures when explicitly asked for them. Therefore the presence of signatures in the root zone will not have any impact at all on the majority of resolvers on the Internet that does not validate lookups. 4.2. What happens if a signing operator fails to sign the zone on time? Literally nothing. I.e. the root zone that is published will be the version prior to the last change. This is not believed to be a major problem, since all changes to the data in the root zone are characterized by long overlaps in time. Furthermore every change is preceded by an administrative process that takes several days. Because of this, a failure to install a new version of the root zone for even so long as a day will not noticeably change the turn-around time for changes in the root zone. 4.3. What happens if several signing operators by mistake sign the same version? Literally nothing. One signing operator will be first, according to the mechanism designed to separate the different backup signing operators described in 3.2.1. The signed zone published by this operator will be the version of the signed root zone that all the root nameservers pick up. When the second signing operator signs the zone the serial number in the zone will be unchanged, and therefore the root nameservers will not pick this zone up but instead stay with the first version. 4.4. What happens if the unsigned zone is compromised between the hidden primary master and the signing primaries? This is basically the same threat as the zone being compromised between the hidden primary master and the root servers in the traditional unsigned case. To guard against this possibility every zone transfer is already protected by a TSIG signature. Because of this the root zone file cannot get transferred to the signing primaries unless the signature matches thereby proving that the zone contents are uncompromised. The consequence is that if the zone transfers are somehow compromised in transit they will not get signed and therefore the published root zone will remain the signed version of the last uncompromised transfer. 4.5. What are the security implications for the new "signing primaries"? Will they not have to be as secure as the hidden primary master is now? Yes. However, the entire root server system is based upon trust, i.e. the entire Internet is trusting the present root server system because there is no alternative to doing that. This proposal is not about changing the need for trust in the root server system. It is about providing a method of determining authenticity of data, something that is presently missing. The root operators are already trusted to provide a root server system based upon secure servers lacking validation mechanisms. It is therefore a bit difficult to argue for not trusting them to provide an improved system that adds exactly the lacking validation mechanisms on a basis of not trusting them to secure the servers involved. In both cases trust is involved, the difference is that in the latter case validation is possible. Furthermore, the proposed signing primaries will not need to have publicly known addresses (just as the present hidden primary master is not publicly known), nor will they need to be able to communicate with anyone outside the well defined set of servers that are part of the root server system. Because of this it will be significantly easier to secure the signing primaries than the already present task of securing the DNS root nameservers. 4.6. What happens if an operator key is compromised? If this happens it is necessary to do an emergency rollover of the keyset including the compromised key. I.e. a new set of operator keys is generated, the new keyset is signed by the "RIR on duty" using the active key-signing key and then the root zone is re-signed using one of the new operator keys. This problem is not unique to a signed root zone, it is inherent in any activity involving cryptographic keys. 4.7. What happens if the key-signing key is compromised? If this happens it is necessary to do an emergency rollover of the keyset including the compromised key and the suggested method is by switching to a keyset signed by a key-signing key issued by the next RIR "in line", i.e. just re-schedule the next planned rollover to take place immediately.. The new signed keyset is added to the root zone and then the zone is re-signed using one of the new operator keys. In this case, since the key-signing key is changed, every resolver will have to configure the new key to be able to validate lookups. This problem is not unique to a signed root zone, it is inherent in any activity involving cryptographic keys. 4.8. Does the out-of-band communication between the signing operators and the RIRs holding the key-signing keys add a new failure mode? Yes. The traditional DNSSEC design assumes that (for an arbitrary zone, not particularly for the root zone) the key-signing key is managed by the same entity that manages the operator keys and hence no communication issue exists. In this case it is important that the signing operators do not hold the responsibility for the key-signing keys. The root server operators should only act as a "publishing service" for the root zone contents, not as the entity that verifies correctness of the published data. However, apart from established secure methods of out-of-band communication being available, there is also the very real possibility of managing these exchanges with actual eye-to-eye contact. Representatives for the RIRs and the root server operators already meet on a regular basis during IETF meetings and the different RIR meetings. 5. Security Considerations Signing the DNS root zone is all about security. A signed root zone will aid applications and organizations all over the Internet in improving their security. Signing the root zone does add a new management step and therefore it is important to ensure that possible failures in this management step does not leave the published root zone in a state that is actually worse than the original unsigned state. The various failure modes that have been identified all show this property of not being any worse than the unsigned case. There is however one scenario that changes drastically with the proposed distributed signing scheme and that is the consequences of one signing operator intentionally changing the contents of the root zone prior to the actual signing. In the unsigned case this will cause the root zone to become inconsistent (i.e. the responses vary depending upon which server it comes from), while in the signed case the root zone will remain consistent but with erroneous data in it. It is my belief (this is impossible to prove) that the risk of human error among the operators, however small, is still far greater than the risk of willful misconduct. Therefore the proposed design that enforces consistency among the roots is actually also an improvement stability and security wise. Se further section 4 for a more complete risk analysis. 6. IANA Considerations. NONE. 7. References 7.1. Normative. [RFC2535] Domain Name System Security Extensions. D. Eastlake. March 1999. [RFC3090] DNS Security Extension Clarification on Zone Status. E. Lewis. March 2001. [RFC1995] Incremental Zone Transfer in DNS. M. Ohta. August 1996. [RFC1996] A Mechanism for Prompt Notification of Zone Changes (DNS NOTIFY). P. Vixie. August 1996. [RFC2845] Secret Key Transaction Authentication for DNS (TSIG). P. Vixie, O. Gudmundsson, D. Eastlake, B. Wellington. May 2000. 7.2. Informative. [RFC2930] Secret Key Establishment for DNS (TKEY RR). D. Eastlake. September 2000. [RFC3007] Secure Domain Name System (DNS) Dynamic Update. B. Wellington. November 2000. [RFC3008] Domain Name System Security (DNSSEC) Signing Authority. B. Wellington. November 2000. [RFC3110] RSA/SHA-1 SIGs and RSA KEYs in the Domain Name System (DNS). D. Eastlake 3rd. May 2001. [RFC3225] Indicating Resolver Support of DNSSEC. D. Conrad. December 2001. [RFC3226] DNSSEC and IPv6 A6 aware server/resolver message size requirements. O. Gudmundsson. December 2001. [Delegation-Signer] Delegation Signer Resource Record. O. Gudmundsson. October 2002. Work In Progress. [AXFR-clarify] draft-ietf-dnsext-axfr-clarify-NN.txt DNS Zone Transfer Protocol Clarifications. A. Gustafsson. March 2002. Work In Progress. [AD-secure] draft-ietf-dnsext-ad-is-secure-NN.txt Redefinition of DNS AD bit. B. Wellington, O Gudmundsson. June 2002. Work In Progress. [Opt-In] draft-ietf-dnsext-dnssec-opt-in-NN.txt DNSSEC Opt-In R. Arends, M Kosters, D. Blacka. June 2002. Work In Progress. [Wild-card-optimize] draft-olaf-dnsext-dnssec-wildcard- optimization-NN.txt DNSSEC Wildcard optimization. O. Kolkman, J. Ihren, R. Arends. September 2002. Work In Progress. 8. Acknowledgments. To help me produce this document I have received lots and lots of comments from many people around the Internet with suggestions for lots and lots sorely needed improvements. Among the folks who helped out are, in no particular order: Paul Vixie, Bill Manning, Ôlafur Gumundsson, Rob Austein, Patrik F„ltstr÷m, Olaf Kolkman, Harald Alvestrand, Suzanne Woolf, John M. Brown and Lars-Johan Liman. 9. Authors' Address Johan Ihren Autonomica AB Bellmansgatan 30 SE-118 47 Stockholm, Sweden johani@autonomica.se