Internet Draft Abel Yang draft-chen-idn-ace-eval-tw-00.txt Erin Chen Jul 13, 2001 TWNIC Expires Jan 13, 2002 Evaluation of various ACEs with existing Chinese (BIG5) Domain Names Status of this memo This document is an Internet-Draft and is in full conformance with all provisions of Section 10 of RFC2026. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. Abstract In order to help the ACE (ASCII Compatible Encoding) Design Team in IDN Working Group to selecting an appropriate ACE proposal. This document offer an overview of the results of various ACEs that being applied to Chinese (BIG5) Domain Names. And give a brief summary of the evaluation result. 1. Test data and tools The sample data were come from TWNIC registration database. All of them are in BIG5 encoding. The amount of the data is 28454. And the data distribution by the length is: Length of data quantity of BIG5 Domain Names 2 chars 3762 3 chars 3914 4 chars 4613 5 chars 1723 6 chars 1851 7 chars 4282 8 chars 1621 9 chars 1254 10 chars 2075 11 chars 1215 12 chars 753 13 chars 646 14 chars 745 amount=28454 TWNIC just accept 2 to 14 BIG5 chars in a Chinese Domain Name. Those data are all in local encoding of BIG5 ( or CP950 ). Base on those data, apply them to various ACEs. There are RACE-03 [RACE], BRACE-00 [BRACE], LACE-01 [LACE], UTF-6-00 [UTF6], DUDE-02 [DUDE], AMC-ACE-M-00 [AMCACEM], AltDUDE-01 [AltDUDE], AMC-ACE-O-00 [AMCACEO], and AMC-ACE-R-01 [AMCACER]. 2. Result of each ACE length The CHN_len is the length of Chinese (BIG5) domain name in byte before apply ACE encoding. All of the data in the same CHN_len, the max_len is the maximum length, the min_len is the minimum length, the aver_len is the averager length of them after apply ACE encoding. 2.1 AMC-ACE-M CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 11 6 11 4 17 8 14 5 17 11 17 6 21 11 20 7 24 13 22 8 27 15 25 9 30 16 28 10 33 18 31 11 36 22 34 12 39 19 37 13 42 21 40 14 45 23 43 2.2 AMC-ACE-O CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 6 11 4 20 9 15 5 20 11 18 6 24 11 22 7 27 13 25 8 31 15 29 9 35 16 32 10 39 18 36 11 42 24 38 12 46 19 41 13 50 21 44 14 54 23 48 2.3 AMC-ACE-R CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 6 12 4 20 9 16 5 20 11 19 6 24 11 23 7 28 13 26 8 32 15 30 9 36 16 34 10 40 18 38 11 44 24 41 12 48 19 44 13 52 21 48 14 56 24 51 2.4 AltDUDE CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 6 11 4 20 8 16 5 20 13 19 6 24 11 23 7 28 14 26 8 32 15 30 9 36 18 34 10 40 21 38 11 44 26 41 12 48 26 46 13 52 25 48 14 56 30 53 2.5 BRACE CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 7 5 7 3 10 7 10 4 17 8 14 5 17 11 17 6 20 11 20 7 23 13 23 8 28 14 26 9 30 15 30 10 33 17 33 11 36 22 36 12 39 18 39 13 42 20 42 14 46 23 46 2.6 DUDE CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 6 11 4 20 8 16 5 20 13 19 6 24 11 23 7 28 15 26 8 32 15 30 9 36 18 34 10 40 21 38 11 44 27 41 12 48 26 46 13 52 25 48 14 56 30 53 2.7 LACE CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 7 8 3 12 8 12 4 18 10 15 5 18 18 18 6 21 16 21 7 24 21 24 8 31 23 28 9 31 24 31 10 34 26 34 11 37 34 37 12 40 29 40 13 44 31 44 14 47 36 47 2.8 RACE CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 7 12 4 18 8 15 5 18 18 18 6 21 20 21 7 24 24 24 8 31 28 28 9 31 31 31 10 34 34 34 11 37 37 37 12 40 40 40 13 44 44 44 14 47 47 47 2.9 UTF-6 CHN_len max_len min_len aver_len ----------- -------- -------- -------- 2 8 5 8 3 12 6 12 4 20 9 16 5 20 14 20 6 24 14 24 7 28 18 28 8 32 20 32 9 36 21 36 10 40 23 40 11 44 30 44 12 48 28 48 13 52 29 52 14 56 33 56 3. Result of each ACE conversion speed UTF-6 RACE LACE BRACE DUDE ALTDUDE MAX 0.80 0.75 0.82 0.86 0.74 0.70 MIN 0.56 0.55 0.52 0.62 0.53 0.53 AVERAGE 0.68 0.64 0.62 0.73 0.64 0.62 AMC-ACE-M AMC-AMC-O AMC-ACE-R MAX 1.12 1.92 0.99 MIN 0.86 1.69 0.67 AVERAGE 0.98 1.81 0.82 4. Summary of the result All the ACE max_length is less than 64, because the Chinese Domain Name is limited from 2 to 14 BIG5 chars. For those sample data, the result of LENGTH and SPEED, it seems the LACE and RACE are better than others. 5. Acknowledgements This ACE evaluation testing tool was provided by JPNIC. TWNIC provide the real registered Chinese ( BIG5 ) domain names. Hsu Nai-Wen and TWNIC MDNS technical team give a lot of suggestion. 6. Author's Address Abel Yang Taiwan Network Information Center (TWNIC) 4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan. Email: abelyang@twnic.net.tw Tel: +886-23411313#505 Erin Chen Taiwan Network Information Center (TWNIC) 4F-2, No. 9, Sec. 2, Roosevelt Rd., Taipei, 100 Taiwan. Email: erin@twnic.net.tw Tel: +886-23411313#502 7. References [RFC1035] "DOMAIN NAMES - IMPLEMENTATION AND SPECIFICATION", RFC1034, Nov 1987, P. Mockapetris [RACE] "RACE: Row-based ASCII Compatible Encoding for IDN", draft-ietf-idn-race-03.txt, Nov 2000, P Hoffman [BRACE] "BRACE: Bi-mode Row-based ASCII-Compatible Encoding for IDN version 0.1.2" draft-ietf-idn-brace-00.txti, Sep 2000, A Costello [LACE] "LACE: Length-based ASCII Compatible Encoding for IDN" draft-ietf-idn-lace-01.txt, Jan 2001, M Davis, P Hoffman [UTF6] "UTF-6 - Yet Another ASCII-Compatible Encoding for IDN" draft-ietf-idn-utf6-00, Nov 2000, M Welter, B Spolarich [DUDE] "Differential Unicode Domain Encoding (DUDE)" draft-ietf-idn-dude-02.txt, Jun 2001, M Welter, B Spolarich, A Costello [AMCACEM] "AMC-ACE-M version 0.1.0" draft-ietf-idn-amc-ace-m-00.txt, Feb 2001, A Costello [AltDUDE] "AltDUDE version 0.0.2" draft-ietf-idn-altdude-00.txt, Mar 2001, A Costello [AMCACEO] "AMC-ACE-O version 0.0.3" draft-ietf-idn-amc-ace-o-00.txt, Mar 2001, A Costello [AMCACER] "AMC-ACE-R version 0.2.1" draft-ietf-idn-amc-ace-r-01.txt, May 2001, A Costello [AMCACEV] "AMC-ACE-V version 0.1.0" draft-ietf-idn-amc-ace-v-00.txt, May 2001, A Costello [AMCACEW] "AMC-ACE-W version 0.1.0" draft-ietf-idn-amc-ace-w-00.txt, May 2001, A Costello [MACE] "MACE: Modal ASCII Compatible Encoding for IDN" draft-ietf-idn-mace-00.txt, Jun 2001, M Ishisone, Y Yoneya [LDUDE] "Improving ACE using code point reordering v0.9" draft-ietf-idn-lsb-ace-00.txt, Jun 2001, Soobok Lee [MDNKIT] "Multilingual Domain Name tool Kit", http://www.nic.ad.jp/jp/research/idn/mdnkit/download/ [NAMEPREP] "Preparation of Internationalized Host Names", draft-ietf-idn-nameprep-03.txt, Feb 2001, P Hoffman, M Blanchet