Internet DRAFT - draft-happel-sml-problem-statement

draft-happel-sml-problem-statement







Network Working Group                                       H.-J. Happel
Internet-Draft                                              audriga GmbH
Intended status: Informational                               C. Junghans
Expires: 31 July 2023                                    27 January 2023


         Structured Email: Problem Statement and Areas of Work
                 draft-happel-sml-problem-statement-00

Abstract

   This document discusses benefits of complementing existing email
   standards by means that allow to replace or extend text-based email
   messages with message parts that describe content (full or in parts)
   in a machine-readable way.  This would enable rich and structured
   interaction for its recipients - may it be human users or agents on
   their behalf.

Status of This Memo

   This Internet-Draft is submitted in full conformance with the
   provisions of BCP 78 and BCP 79.

   Internet-Drafts are working documents of the Internet Engineering
   Task Force (IETF).  Note that other groups may also distribute
   working documents as Internet-Drafts.  The list of current Internet-
   Drafts is at https://datatracker.ietf.org/drafts/current/.

   Internet-Drafts are draft documents valid for a maximum of six months
   and may be updated, replaced, or obsoleted by other documents at any
   time.  It is inappropriate to use Internet-Drafts as reference
   material or to cite them other than as "work in progress."

   This Internet-Draft will expire on 31 July 2023.

Copyright Notice

   Copyright (c) 2023 IETF Trust and the persons identified as the
   document authors.  All rights reserved.

   This document is subject to BCP 78 and the IETF Trust's Legal
   Provisions Relating to IETF Documents (https://trustee.ietf.org/
   license-info) in effect on the date of publication of this document.
   Please review these documents carefully, as they describe your rights
   and restrictions with respect to this document.  Code Components
   extracted from this document must include Revised BSD License text as
   described in Section 4.e of the Trust Legal Provisions and are
   provided without warranty as described in the Revised BSD License.



Happel & Junghans         Expires 31 July 2023                  [Page 1]

Internet-Draft   Structured Email: Problem Statement and    January 2023


Table of Contents

   1.  Introduction  . . . . . . . . . . . . . . . . . . . . . . . .   2
   2.  Problem statement . . . . . . . . . . . . . . . . . . . . . .   3
     2.1.  Email content is not machine-readable . . . . . . . . . .   3
     2.2.  Using email as a "personal API" . . . . . . . . . . . . .   3
     2.3.  Slow adoption of existing solutions . . . . . . . . . . .   3
   3.  Areas of work . . . . . . . . . . . . . . . . . . . . . . . .   4
     3.1.  Core areas of work  . . . . . . . . . . . . . . . . . . .   4
       3.1.1.  Enable email clients to understand email content  . .   4
       3.1.2.  Enable users to compose structured email  . . . . . .   4
     3.2.  Cross-cutting areas of work . . . . . . . . . . . . . . .   5
       3.2.1.  Internet Message Format extensions  . . . . . . . . .   5
       3.2.2.  Trust . . . . . . . . . . . . . . . . . . . . . . . .   5
       3.2.3.  Interaction with HTTP backend APIs  . . . . . . . . .   6
     3.3.  Optional areas of work  . . . . . . . . . . . . . . . . .   6
       3.3.1.  Email search and filtering  . . . . . . . . . . . . .   6
       3.3.2.  Interaction with other data types and external
               APIs  . . . . . . . . . . . . . . . . . . . . . . . .   6
       3.3.3.  Email storage formats . . . . . . . . . . . . . . . .   7
       3.3.4.  Common practices in email processing  . . . . . . . .   7
     3.4.  Out of scope areas of work  . . . . . . . . . . . . . . .   7
       3.4.1.  Data extraction . . . . . . . . . . . . . . . . . . .   7
       3.4.2.  UI design . . . . . . . . . . . . . . . . . . . . . .   8
   4.  Privacy considerations  . . . . . . . . . . . . . . . . . . .   8
   5.  Security considerations . . . . . . . . . . . . . . . . . . .   8
   6.  IANA Considerations . . . . . . . . . . . . . . . . . . . . .   8
   7.  Informative References  . . . . . . . . . . . . . . . . . . .   8
   Authors' Addresses  . . . . . . . . . . . . . . . . . . . . . . .  10

1.  Introduction

   Email is a successful and widespread technology which is likely to
   remain important, even if new vendors and technologies continuously
   challenge its role.

   Email, to some extent, can be considered a victim of its own success.
   Interrelations of various components have created a relatively stable
   software ecosystem, which makes it difficult to introduce larger
   improvements.  While many RFCs have updated and extended initial
   email specifications, the major inner workings of email remain widely
   unchanged since their inception.









Happel & Junghans         Expires 31 July 2023                  [Page 2]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   This documents lines out certain issues with email content that might
   be addressed by standardization work.  The proposed approach aims to
   enable novel ways of how users and automated programs can interact
   via email while retaining downwards compatibility with existing email
   standards.  For additional background, see also [I-D.happel-
   structured-dynamic-email-00].

2.  Problem statement

2.1.  Email content is not machine-readable

   A large share of today's emails is of transactional nature, i.e.,
   sent by automated agents or processes to human users.  While those
   emails often have relatively clear semantics (such as _Invoice_ or
   _Reservation_), the medium of those emails is still human-readable
   text.

   In order to help users with managing and processing their emails more
   efficiently, a machine-readable representation of email content can
   be an important building block.

2.2.  Using email as a "personal API"

   Beyond improving the handling of existing email interactions, it is
   worthwhile to note that email can be considered a prime technology in
   the recent discussion about data sovereignty, which aims to give back
   control to users over their personal data.

   Email is inherently open and decentralized.  It is probably the only
   widely-used and standardized technology which seamlessly connects the
   local data space of users (i.e., emails on their PCs, mobile devices,
   or private hosted mailboxes) with the public internet.

   It might even be considered a ubiquitous "personal API" of internet
   users, which to date is mostly based on text-based instructions
   ("John, could you please send me this presentation?").  Along these
   lines, structured email might serve as an enabling technology for
   granting users additional sovereignty in date storage and exchange
   when used to interact with internet services.

2.3.  Slow adoption of existing solutions

   In a mature technological space such as email, any novel approach
   such as structured email needs to address the issue of user adoption.

   Existing attempts to structure email interaction can be distinguished
   in _standards-based_ and _vendor-driven_ approaches.  Several RFCs
   address very particular problems such as meeting workflows [RFC2447],



Happel & Junghans         Expires 31 July 2023                  [Page 3]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   message delivery notifications [RFC8098], mailing list subscriptions
   [RFC8058], message metadata [RFC6477] or message content
   interpretation [RFC9078].  Creating independent RFCs for each
   possible type of structured email interaction might not be a feasible
   approach for both standards makers and client developers.

   With respect to vendors, Google ([EMarkup], [AMPemail]) and Microsoft
   ([AM]) have made attempts to structure email interactions.  Even
   though some other vendors support, e.g., [EMarkup], a closer look
   shows that those implementations are mostly incompatible among each
   other ([SmartInbox], [YahooPS], [ZohoQAES]).  Besides a lack of
   standardization (and according tool support), widespread adoption
   also suffers from various sender restrictions, including manual
   approval processes.

3.  Areas of work

   This section tries to identify and structure areas of work to address
   the aforementioned topics.  We distinguish some core work,
   additional, and optional topics.

3.1.  Core areas of work

3.1.1.  Enable email clients to understand email content

   As a primary building block for structured mail, there needs to be a
   specification on how to represent email content or metadata about
   email content in a machine-readable form.

   Such an approach needs to be downwards compatible with existing
   practices and clients, probably in a similar way the "multipart/
   alternative" MIME part type is used for HTML email ("text/html").
   [AMPemail] for instance, is leveraging this approach by adding an
   additional MIME part in conjunction with "text/plain" and "text/
   html".

   Regarding content semantics, [EMarkup] is using a subset of
   [SchemaOrg] types in a JSON-LD [W3C.REC-json-ld11-20200716]
   representation.  Ideally an open approach to structured email would
   allow for decentralized extensions of content semantics.

3.1.2.  Enable users to compose structured email

   To sustain the decentralized and interactive nature of email, it is
   insufficient to just allow professional email senders to send
   structured emails to users.  Instead, users need to be enabled to
   answer and also to initiate structured email exchanges.




Happel & Junghans         Expires 31 July 2023                  [Page 4]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   [AMPemail] (initiated by Google) and Microsoft Actionable Messages
   [AM] are approaches that allow users to reply with a structurd answer
   based on input forms.  However, the required input is not machine-
   understandable, making it difficult to assist users in data entry.
   Both approaches allow responses to be sent to a dedicated HTTP API
   only, but not as a regular email response.

   Both approaches also do not allow users to initiate a structured
   email exchange.  For this purpose, a discovery mechanism would be
   required, which allows users resp. their email clients to determine
   which types of structured email content an intented email recipient
   is willing to accept besides standard email.

3.2.  Cross-cutting areas of work

3.2.1.  Internet Message Format extensions

   While structured email should be designed to maintain downwards-
   compatibility with the existing email technology stack, there might
   be some smaller helpful extensions:

   *  Additional email header information might inform clients about the
      presence of structured email content.
   *  It might be helpful to distinguish structured email markup which
      represents the complete message content (in the sense of
      "multipart/alternative") versus markup which provides partial
      annotations or metadata to regular email content.  For the latter
      case, a standardized option to help clients identify such MIME
      parts (e.g., based on a MIME part header) might be useful.
   *  A distinct characteristic of email technology is the multitude of
      processing options on both server-side or on client-side with
      multiple possible email clients operating on one account.  This
      challenge is, e.g., also present in the email filtering design
      space [RFC5228]/[RFC5804].  Accordingly, there might be use cases
      in which users may want to direct structured email processing to
      certain clients (e.g., travel information to the mobile email app)
      in which additional header information might help guide
      processing.

3.2.2.  Trust

   (Semi-)automated processing of structured email requires increased
   scrutiny with respect to security issues.  Accordingly, current
   vendor-specific approaches ([EMarkup]/[AMPemail]/[AM]) require DKIM
   [RFC6376] and/or SPF [RFC7208] set up in addition to a manual sender
   registration process.





Happel & Junghans         Expires 31 July 2023                  [Page 5]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   While the latter is already a clear inhibitor of adoption, this
   process will not work if users shall be enabled to send structured
   mail by themselves - a scenario without a central gatekeeper.
   Accordingly, suitable measures of trust need to be in place.
   Candidate approaches might be a consolidation of existing email trust
   indicators, specific concepts of "trusted senders", or drawing
   inspiration from sources such as the ACME protocol [RFC8555].

3.2.3.  Interaction with HTTP backend APIs

   Structured email may benefit from certain server-side (HTTP) APIs,
   serving different purposes.  An API for the discovery if an intended
   receiver supports structured email has already been discussed (see
   Section 3.1.2).

   Additional APIs could be useful to allow synchronous responses of
   structured data in certain uses cases in which an asynchronous
   message response cycle would be infeasible.  All current vendor-
   specific approaches ([EMarkup]/[AMPemail]/[AM]) allow for some sort
   of direct data transmission against an HTTP backend API.

3.3.  Optional areas of work

3.3.1.  Email search and filtering

   Machine-understandable interpretation of email content could also be
   used to improve some existing practices based on text/keyword-based
   processing of email content.  In particular, the IMAP SEARCH command
   (Section 6.4.4 of [RFC9051]) and email filtering [RFC5228] might
   benefit from corresponding extensions.  However, those might also be
   subject to work of other IETF working groups.

3.3.2.  Interaction with other data types and external APIs

   Email clients have grown over years to support various types of user
   data beyond email.  This typically involves contacts/address books,
   calendars, tasks, notes, or files.

   Besides the case of iMIP [RFC2447], interrelation between email and
   other data types is vendor-specific, if present at all.  Structured
   email could help to enable easier interaction within email clients,
   with other applications on the same machine and with services on the
   internet (see also Section 2.2).

   Ongoing work in the JMAP working group [RFC8620] could overlap with
   some of these aspects.  The W3C Solid initiative [Solid] is another
   loosely related




Happel & Junghans         Expires 31 July 2023                  [Page 6]

Internet-Draft   Structured Email: Problem Statement and    January 2023


3.3.3.  Email storage formats

   Several vendors use machine learning or data extraction approaches
   (see also Section 3.4.1) to derive information about email content
   which is similar to or complementary to structured email content.

   For interoperability and data portability, it would be helpful to
   standardize storage for such metadata.  This might also help with
   current portability issues stemming from user-defined metadata such
   as IMAP flags (Section 2.3.2 of [RFC9051]).

3.3.4.  Common practices in email processing

   While structured email aims to provide a user-driven approach to make
   email content machine-understandable, there may be a number of more
   technical issues related to email-processing which might be addressed
   en route.

   Besides consideration of already existing RFCs (see Section 2.3),
   this may involve currently unstandardized issues such as email
   signatures, vacation notices or "noreply" addresses.  Those are
   probably more specific application areas of structured email, since
   they can impact multiple layers of email processing and corresponding
   standards.

3.4.  Out of scope areas of work

   This section lists aspects which are relevant to structured mail but
   which might not be addressed by formal standardization work in the
   scope of this proposal.

3.4.1.  Data extraction

   Several service providers or tools (e.g., [KDEItinerary]) apply data
   extraction techniques to derive structured data from textual email
   content.  Data extraction is therefore an interesting approach to
   help bootstrapping the adoption of structured email, until it has
   become common practice among email senders.

   While cooperation on shared data extractors might be a useful
   activity for parties interested in structured email, it is probably
   not a subject for standardization.









Happel & Junghans         Expires 31 July 2023                  [Page 7]

Internet-Draft   Structured Email: Problem Statement and    January 2023


3.4.2.  UI design

   Prescription of how user interfaces should display structured email
   or organize interaction with it are - besides illustration -
   typically out of scope of RFCs.  As in the case of data extraction,
   there might however be a shared interest among vendors to collaborate
   on certain best practices.

4.  Privacy considerations

   Since email content often contains personal data, it is subject to
   privacy considerations.  From a high level perspective, privacy
   issues in structured email should not significantly differ to privacy
   issues in existing email standards.

   On a more fine granuar level, structured email might both raise
   certain novel privacy issues (i.e., since structured data is more
   easy to process and share), but it could also improve certain privacy
   concerns.  For instance, it could make certain data extraction
   practives (see Section 3.4.1) obsolete, which currently cannot
   distingish senstive from non-sentitive parts of an email.

5.  Security considerations

   As a problem statement document, no particular security
   considerations apply as such.  Similar to privacy considerations,
   security issues might also widely overlap with those of existing
   email standards.

   See Section 3.2.2 for the discussion of some general security aspects
   with respect to structured email.

6.  IANA Considerations

   This document has no IANA actions at this time.

7.  Informative References

   [AM]       Microsoft Inc., "Actionable Messages",
              <https://learn.microsoft.com/en-us/outlook/actionable-
              messages/>.

   [AMPemail] OpenJS Foundation, "AMP email",
              <https://amp.dev/about/email/>.

   [EMarkup]  Google Inc., "Email Markup",
              <https://developers.google.com/gmail/markup>.




Happel & Junghans         Expires 31 July 2023                  [Page 8]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   [KDEItinerary]
              KDE e.V., "KDE Itinerary",
              <https://apps.kde.org/itinerary/>.

   [RFC2447]  Dawson, F., Mansour, S., and S. Silverberg, "iCalendar
              Message-Based Interoperability Protocol (iMIP)", RFC 2447,
              DOI 10.17487/RFC2447, November 1998,
              <https://www.rfc-editor.org/info/rfc2447>.

   [RFC5228]  Guenther, P., Ed. and T. Showalter, Ed., "Sieve: An Email
              Filtering Language", RFC 5228, DOI 10.17487/RFC5228,
              January 2008, <https://www.rfc-editor.org/info/rfc5228>.

   [RFC5804]  Melnikov, A., Ed. and T. Martin, "A Protocol for Remotely
              Managing Sieve Scripts", RFC 5804, DOI 10.17487/RFC5804,
              July 2010, <https://www.rfc-editor.org/info/rfc5804>.

   [RFC6376]  Crocker, D., Ed., Hansen, T., Ed., and M. Kucherawy, Ed.,
              "DomainKeys Identified Mail (DKIM) Signatures", STD 76,
              RFC 6376, DOI 10.17487/RFC6376, September 2011,
              <https://www.rfc-editor.org/info/rfc6376>.

   [RFC6477]  Melnikov, A. and G. Lunt, "Registration of Military
              Message Handling System (MMHS) Header Fields for Use in
              Internet Mail", RFC 6477, DOI 10.17487/RFC6477, January
              2012, <https://www.rfc-editor.org/info/rfc6477>.

   [RFC7208]  Kitterman, S., "Sender Policy Framework (SPF) for
              Authorizing Use of Domains in Email, Version 1", RFC 7208,
              DOI 10.17487/RFC7208, April 2014,
              <https://www.rfc-editor.org/info/rfc7208>.

   [RFC8058]  Levine, J. and T. Herkula, "Signaling One-Click
              Functionality for List Email Headers", RFC 8058,
              DOI 10.17487/RFC8058, January 2017,
              <https://www.rfc-editor.org/info/rfc8058>.

   [RFC8098]  Hansen, T., Ed. and A. Melnikov, Ed., "Message Disposition
              Notification", STD 85, RFC 8098, DOI 10.17487/RFC8098,
              February 2017, <https://www.rfc-editor.org/info/rfc8098>.

   [RFC8555]  Barnes, R., Hoffman-Andrews, J., McCarney, D., and J.
              Kasten, "Automatic Certificate Management Environment
              (ACME)", RFC 8555, DOI 10.17487/RFC8555, March 2019,
              <https://www.rfc-editor.org/info/rfc8555>.






Happel & Junghans         Expires 31 July 2023                  [Page 9]

Internet-Draft   Structured Email: Problem Statement and    January 2023


   [RFC8620]  Jenkins, N. and C. Newman, "The JSON Meta Application
              Protocol (JMAP)", RFC 8620, DOI 10.17487/RFC8620, July
              2019, <https://www.rfc-editor.org/info/rfc8620>.

   [RFC9051]  Melnikov, A., Ed. and B. Leiba, Ed., "Internet Message
              Access Protocol (IMAP) - Version 4rev2", RFC 9051,
              DOI 10.17487/RFC9051, August 2021,
              <https://www.rfc-editor.org/info/rfc9051>.

   [RFC9078]  Crocker, D., Signes, R., and N. Freed, "Reaction:
              Indicating Summary Reaction to a Message", RFC 9078,
              DOI 10.17487/RFC9078, August 2021,
              <https://www.rfc-editor.org/info/rfc9078>.

   [SchemaOrg]
              W3C Schema.org Community Group, "Schema.org",
              <https://schema.org/>.

   [SmartInbox]
              1&1 Mail & Media GmbH, "WEB.DE Smart Mailbox",
              <https://postmaster.web.de/en/smartinbox>.

   [Solid]    W3C Solid Community Group, "Solid Protocol",
              <https://solidproject.org/TR/protocol>.

   [W3C.REC-json-ld11-20200716]
              Longley, D., Ed., Kellogg, G., Ed., and P. Champin, Ed.,
              "JSON-LD 1.1", W3C REC REC-json-ld11-20200716, W3C REC-
              json-ld11-20200716, 16 July 2020,
              <https://www.w3.org/TR/2020/REC-json-ld11-20200716/>.

   [YahooPS]  Yahoo Inc., "Promotions & Schema",
              <https://senders.yahooinc.com/promotions-and-schema/>.

   [ZohoQAES] Zoho Corporation Pvt. Ltd., "Quick Actions and Email
              Snippets",
              <https://www.zoho.com/mail/help/quick-actions.html>.

Authors' Addresses

   Hans-Joerg Happel
   audriga GmbH
   Email: happel@audriga.com
   URI:   https://www.audriga.com


   Conny Junghans
   Email: conny.junghans@1und1.de



Happel & Junghans         Expires 31 July 2023                 [Page 10]