Mailing List Manager (MLM) Transformations

v. L. Anelli 13 20122 Milano MI Italy vesely@tana.it

DMARC DKIM Mailing List The widespread adoption of Domain-based Message Authentication, Reporting, and Conformance (DMARC) led Mailing List Managers (MLM) to rewrite the From: header field as a workaround. This document proposes two methods to restore the original From: value of transformed messages, to be effected by the receiving Message Delivery Agent (rMDA). One method requires receivers to identify a set of trusted MLM operators who set the Author: header field. The other method tries to revert MLM transformations in order to verify DomainKeys Identified Mail (DKIM) signatures that were applied by the author domain at submission time.

Introduction Domain-based Message Authentication, Reporting, and Conformance (DMARC) () hinges on the alignment of the domain in the From: header field with an authenticated identifier. For that reason, MLMs that transform messages, however lightly, have to rewrite From:; an operation also known as From: munging. Depending on the kind of mailing list, From: munging can annoy participants or not. For lists paired by web fora, for example, it is almost unnoticed. For other lists, where personal knowledge plays a role, it can become a nuisance as it hinders off-list messaging. In this experiment, we try and restore the end-to-end nature of the From: email field after DMARC rewriting. That way, messages are presented to final recipients with the From: line restored to its original value. We present two methods to obtain that result. The first, simple method is described in . It only works for Mailing List Managers (MLMs) or author domains that save the original value of From: using the Author: header field , which is not current practice. This method is based on the trust that receivers place in those MLMs. The second, complex method is described in . It works with MLMs configured to add just a footer and a subject tag to the messages that they redistribute, which is what "classic" MLMs currently do. In addition, the method requires that author domains produce compatible signatures, which only some domains do. Author domains and MLMs can adopt either or both methods. Both provide that MLMs continue From: munging, but enable receivers to revert it at the receiving Mail Delivery Agent stage; that is, where local filters run.

Terms Definitions Signers and verifiers are defined by DKIM (). The use of the term Mailing List Manager, almost always abbreviated MLM follows . A MLM is a kind of Mediator in parlance. Message is defined in . It consists of a header made up of one or more fields, and a body possibly composed of various MIME entities, the latter being defined in and companions. The term original is used here to refer to the Author or parts of the Author's message as it was sent out by the author's domain, where Author is defined in and . We use colon (:) to indicate header field names, as in From:, Author: and the like. The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 when, and only when, they appear in all capitals, as shown here.

The simple method This method outlines a protocol on top of DMARC whereby the From: header field is saved as Author:. MLMs modify From: in order to comply with DMARC. Mail Delivery Agents (MDAs) restore the original value. Author domains that DKIM-sign outgoing messages SHOULD copy the value of From: to Author:, at least when one or more recipients are MLMs. Omission to do so limits the success of this method to MLMs that add the Author: field themselves. A mailbox provider can decide to not set Author: if its users seldom post to mailing lists. The Author: field can be set by the DKIM signing module. Signing Author: denotes an interest in this experiment. In this case, DMARC aggregate results are reported to the Author: domain as well. MLMs who modify From: MUST add an Author: header field, copying the value of the pristine From:, unless the Author: field is already present. When Author: is present, it MUST NOT be modified. However, MLMs who modify From: SHALL apply the DMARC mechanism also to the Author: domain. MLMs MAY copy the pristine value of From: also to other fields such as Reply-To: or Cc: in order to ease messaging for recipients whose providers don't apply de-munging. DMARC verifiers or downstream modules at receivers MUST check whether the From: domain having dmarc=pass is configured as a trusted MLM. In that case, if an Author: field exist and has a different domain, the module signals this result to downstream agents. How to signal it is a question of local settings and convenience. It can consist of an apposite reason or comment in Authentication-Results: (see example toward the end of ), or it can just write dmarc=pass. It can also add an Original-From: field as a signal that From: can be restored to that value. Receivers MUST NOT change From: at a stage where external forwarding is still possible. MDAs, or better yet the part calls rMDA, that is the receiver part, after any external forwarding has taken place, use the local signal to restore the pristine value of From:. The kind of signal can be designed so as to reduce the work of the rMDA module, which is executed for each local recipient. Using this method, an author domain can eliminate the disruption caused by From: munging, at the cost of configuring known MLM domains. The method will work at least for messages originating internally, which have the Author: field, irrespective of Mail User Agents (MUAs) and MLMs. Better results are obtained with MLMs that participate by adding or checking the Author: field. For increased security, MLM and receiver can also deploy the Authenticated Received Chain (ARC) protocol . A malicious actor can post list messages with fake From: or Author: values. Although a participating MLM checks those values, if the corresponding domains have loose DMARC policies (p=none) they can pass. Using ARC, a receiver knows what was the authentication status when the message arrived at the MLM. A verifier MAY omit to restore the value of From: if it wasn't authenticated by the MLM, or if it is deemed to be suspicious for whatever reason.

The complex method The scheme of this method is similar to the simple one, but there is more work for the DKIM verifier. In exchange, the place where MLMs save the original value of From: doesn't have to be Author:, and there is no need to trust MLMs as the method verifies the original author domain signatures. So it works out of the box with many existing MLMs and several signers that don't sign MLM-specific header fields. The method is based on the revertibility of the transformations a MLM applies to a message. These are described in . After reversion, the original DKIM signatures verify. That proves that the reversion is good, in particular for the original value of From:, which MLMs copy to Reply-To:, Cc: or similar. While the definition of revertible transformation implies the way to revert it, an informal outline of an implementation is presented in . This method is quite fragile as it needs compliance from multiple actors to interoperate. Asking users' domains to sign reasonably and limiting transformation to the essential sounds quite reasonable. However, if participants have to take special steps to be compatible, they'd probably opt for the simple method. When each actor complies with the requirements in , this method is reliable.

Revertible Transformations Message modifications can affect the header and/or the body of a message. This document only considers a very limited set of transformations, described in the following subsections. They turn out to be revertible.

Header Transformations

Subject MLM MAY modify the Subject: field by inserting a tag at the beginning of its value. A tag consists of a short text delimited by square brackets. For example: This transformation is easily reverted by removing the tag. For security reasons, subject tags MUST NOT exceed 20 characters. Note that some MLMs carry out further changes to this field. For example: can be transformed to: That's why, if the field is signed, it is RECOMMENDED to save a copy of it as Original-Subject:.

From From: rewriting is necessary for DMARC. That way, the MLM domain becomes the primary identifier of a message, in the DMARC sense. It is often achieved by transforming a field like this: ]]> into one like the following: ]]> While the simple method requires the original value to be copied to Author:, many MLMs save it in a variety of places, including Reply-To:, Cc:, X-Original-From:. When the original value is known, the transformation is revertible.

Body Transformations We only consider footer addition. It MUST be performed in one of three ways, according to the format of the original message.

Single-part plain text: When the original message is not structured, a footer can be appended at the end of the original text. See example in
Multipart added: The footer stands in its own MIME entity, which is appended as the last part of an original multipart/mixed structure. See example in
Multipart wrapped: The footer stands in the second entity of a new multipart/mixed MIME structure whose first entity consists of the original body. See example in

The footer begins with a line consisting exclusively of underscore ("_", ASCII 95) characters, at least four of them. Alternatively, a footer can consist of the three characters "-- " (dash, dash, space), the Usenet signature convention (see for example ). For security reasons, the footer MUST belong to an entity of Content-Type: text/plain in all cases. In addition, footers cannot exceed 10 lines of text, each shorter than 80 characters. If these restrictions are not met, the transformation cannot be reverted safely.

Outline of a Reverting Verifier This subsection is informative. The algorithm described here is implemented in a mail filter . These kind of filters usually read the input message twice —first pass, verify; last pass, rewrite the message to insert Authentication-Results:. When enabling MLM transformation reversion, there can be a retry pass in between those two. The result is yielded during the SMTP dialogue with no noticeable delay. Implementing reversion changed the software from 22730 lines of C code to 26762. The bulk of such ~18% increase is due to the addition of encoding conversion functions. Changes involve both verifying and signing functions (see for the latter). While reading the header in the first pass, the verifier looks for specific fields:

From:
Author:
Original-From:
X-Original-From:
Reply-To:
Cc:

These are candidates to the original mailbox. Note that Reply-To: and Cc: may contain multiple mailboxes. The verifier also collects the Subject: and any field named Original-* that the original signer might have set to ease the reversion. On reaching the end of the header, during the first pass, the verifier sorts the candidate original mailboxes according to the display name, which MLMs try and keep unaltered. The best candidate is then added to the collected set of Original-* fields. If the Subject: begins with a tag, its version without tag is added to that set as well, unless one was already found as Original-Subject:. Next, before reading the body, the verifier looks for prospect signatures; that is, signatures whose "d=" domain is not aligned with SPF credentials (), List-Post: (), Sender:, or the munged From: (if deemed to have been munged). If any such signature exist, along with MLM or other signatures, then the verifier enables parsing the body to look for a footer. Reversing verifiers also have to watch out for idiosyncrasies used to mask DKIM signatures. For example, a MLM introduced a header field named X-Mailman-Original-DKIM-Signature, because some receivers took the habit to downgrade messages with failed signatures, despite recommendation to consider an unauthenticated message regardless of whether or not it looks like it was signed. Body parsing is done in parallel with body canonicalization during the first pass. For multipart, track top level entities. Set transformation type to "wrapped" if there are exactly two entities, "added" otherwise. However, some lists, perhaps out of misconfiguration, insert an empty attachment before the one containing the footer. As it is unlikely that a mail client sends an empty attachment, heuristically it may be preferable to just not count it. For single-part, body parsing must avail of encoding conversions as needed. Assume identity encoding, 7bit or 8bit, unless otherwise directed by an Original-Content-Transfer-Encoding: field. At the end of the first pass, the verifier knows how prospect signatures did. Let's recall that DKIM signature verification results from two independent operations, steps 3 and 4 in . The signature in the "b=" tag depends on the header, while the body hash in the "bh=" tag depends on the body:

If the signature "b=" did not verify and the set of Original-* fields is not empty, then it is worth to try and re-canonicalize the header using the values in the set of Original-* fields.
If the body hash "bh=" did not match and a footer was found, then it is worth to try and re-canonicalize the body excluding the footer.

None, one, or both of the above operations are performed in the retry pass. On writing Authentication-Results, if a prospect signature verifies after reversion, the verifier signals this fact as described in . Zdkimfilter writes a prominent, documented "reason" in the relevant resinfo stanza (). For example: That way, reversion elements can be easily recognized and parsed by downstream agents.

Actors Roles and Compliance

Original Signer Like the simple method (), Author domains who DKIM-sign outgoing messages SHOULD copy the value of From: to Author:, at least when one or more recipients are MLMs. Omission to do so limits the success of this method to MLMs that add the Author: field themselves. A mailbox provider can decide to not set Author: if its users seldom post to mailing lists. The Author: field can be set by the DKIM signing module. Signing Author: denotes an interest in this experiment. In this case, DMARC aggregate results are reported to the Author: domain as well. In addition, Author domains who DKIM-sign outgoing messages MUST NOT sign header fields that MLMs will change, namely:

MIME-Version:
Content-Type:
Content-Transfer-Encoding:
Resent-Date:, Resent-From:, Resent-To:, Resent-Cc:
List-Id:, List-Help:, List-Unsubscribe:, List-Subscribe:, List-Post:, List-Owner:, List-Archive:

Not signing Content-Type: implies that author domains MUST NOT use the l= signature tag, according to . Furthermore, the original value of the signed fields SHOULD be mirrored by corresponding fields, From: copied to Author:, the other fields to an Original-*: field, that is Reply-To: copied to Original-Reply-To:, Subject: to Original-Subject: and so forth. Copying Date: is actually not necessary. Copying Reply-To:, To: and Cc: is only useful if there are multiple recipients and the MLM changes their order. Original-Subject: is necessary if it starts with a tag that can be removed when attempting to recover the original value; this field is defined by , where similar considerations hold. Mailbox providers ignore this requirement if they are not aware of this experiment or don't participate. In many cases, the method succeeds anyway. Other generic rules to ease reversion are as follows:

DKIM signatures MUST use the "relaxed" canonicalization, at least for the header, since MLMs may reflow header fields.
The quoted-printable encoding MUST NOT be used for the body of single-part text/plain messages, as it is impossible to guess original soft line breaks after re-encoding. Base64 is much more robust.
Single-part text/plain messages encoded as base64 MUST follow a constant column width of 76 characters (which is what most encoders do.) The encoding MUST be advertised by adding a new header field as follows:

Original-*: fields with an empty value stand for non-existing counterparts.

MLM MLMs MUST limit message changes to the revertible transformations described in . Since DKIM is MIME-agnostic, attention must be paid to preserve the exact preamble and epilogue of the original MIME structure. Several "classic" mailing lists behave in that way. MLMs MUST apply their own DKIM signature. It is RECOMMENDED that MLMs insert a mailbox entry to Reply-To: or Cc: in order to ease off-list replies as well as to allow transformation reversion. MLMs which collect posts from other MLMs must avoid to add their own footer and subject tag. Transformation reversion cannot be stacked. A second-level MLM can modify or replace the content of previous transformations. Attention must be paid to not exceed tag and footer length limits.

Verifier Attempts to verify original signatures can be done as outlined in . The reversion MUST NOT alter the messages signed and distributed by MLMs, except for adding an Authentication-Results: header field, and possibly an Original-From: or other header field used as a signal to downstream agents. If an original signature with rewritten From: is recovered, the verifier MUST make sure that the original value of From: is written out in a field agreed upon by downstream agents, typically Original-From:, which suggests for a similar use. However, suggests that Original-From: be added by mediators as well. Whatever field is used, the filter SHALL make sure it doesn't already exist. An MDA downstream MAY combine the Authentication-Results: with that field to restore the original value of From:. Replacing From: can invalidate the message, therefore, it must be done after any dot-forward processing, so that external verifiers receive the message as distributed by the MLM, and can revert transformations by themselves. If the Author: field is found and if it is included in the h= tag of the original signature, the corresponding DMARC record SHOULD be looked up and its "rua=" and "ruf=" tags considered for feedback reports, whatever the result. Omitting feedback can hamper the tuning of DKIM signatures at remote sites. A verifier can ignore reporting if it hasn't yet enabled it at all. If applying DMARC policies is considered, it is the From: field which rules. The policy of the Author: domain SHALL only be considered if From: is going to be changed in order to forward a modified version of the message.

Security Considerations Rewriting the From: header field is a treacherous modification to messages. It fosters the belief that the display name of a mailbox is more true than the angle address. A belief further consented by the tendency to not even display the latter. Bad actors take advantage of this belief by displaying the names of trusted institutions paired with trash email addresses hidden between angle brackets. That trick defeats DMARC's purpose. It is out of this document's scope to suggest how mail user agents (MUAs) could counter phishing by highlighting security indicators (for the extent that indicators can actually help preventing phishing attacks). Let's just note that MUAs have to cope with MLMs and phishing alike, which makes it hard to devise a pattern to tell apart one from the other without getting involved with the reputation of the specific domains. By safely restoring munged From: to the original value, that contrast is eliminated. Then, perhaps, deceptive From: lines might become amenable to some kind of efficient indication. Of course, MLM role can be played by miscreants as well. However, replaying a signed message, even with revertible transformations, has more limits than forging scam messages anew. Therefore, the risk introduced by easing transformation reversion is considerably lower than that of not signing, or of keeping DMARC policy at "none". Using the simple method () with an unaware MLM configured as trusted implies the risk of bad actors writing fake Author: fields for phishing. This risk can be mitigated if the MLM applies ARC seals (). In that case the reputation of the original author can be taken into account. An unlikely risk is that of a fake MLM sending messages with Author: signed by a broken signature in order to trick a reverting verifier into sending fake feedback reports. Compared with the use of "l=" tag (), the fact that footers are written in plain text removes the main security objection about footer additions. Namely, footers cannot completely replace the original content in the end recipient's eyes by exploiting lax HTML parsing in the MUA. Still, a footer can contain dangerous URLs and deceiving text. That possibility has to be countered by usual mail filtering and savvy behavior.

IANA Considerations IANA maintains the "Message Header" registry with several subregistries. IANA is asked to make the assignments set out in the following section.

Provisional Message Header Field Names IANA is asked to create new entries in the "Provisional Message Header Field Names" registry as follows.

Header Field Name	Applicable Protocol	Status	Author/Change controller	Reference
Original-Content-Transfer-Encoding	mail	provisional	IETF	this I-D
Original-Reply-To	mail	provisional	IETF	this I-D
Original-Cc	mail	provisional	IETF	this I-D

Experimental Goals Although mailing lists account for a minor part of the global email traffic, they are a tool of the trade in a number of communities, including IETF. In these communities, every body complains about how From: munging ruined their habits. DMARC authors want to stress that it wasn't their intention to have hard policies such as p=reject sported by domains that have human users who may want to participate in mailing list. One way to see if From: munging is really disturbing is to gauge what people is willing to do to fix it. There are mailing lists that reacted by omitting any message transformation. Some other lists cannot do it for legal reasons. The next possibility is to follow one or both the experimental methods proposed here. On the other hand, there are also lists that always munge From:, even when the author domain has p=none. And there are domains who publish p=quarantine; pct=0 in order to force munging and thereby reduce failures in feedback reports. So maybe From: munging is not such a disruption. The simple method can be experimented by a single domain, which would then be able to publish a hard DMARC policy and still deliver de-munged MLM messages among its own users. Or it can be set up by a mailing list, possibly followed by a (growing) number of participants. The complex method requires no MLM changes, so a single domain can experiment with it, gaining the possibility to de-munge some of the mailing list messages it receives. Deploying both methods makes sense, using one as a fallback for the other. The risk of applying either method or both is minimal, as the number of bad messages in a list is not going to increase because of this experiment. When the de-munging fails, recipients will just suffer munged From:s, as if the experiment wasn't tried. The success of this experiment can be measured by the appearance of Author: fields in email messages. A positive outcome will have solved the DMARC vs. MLMs problem. On the other hand, if we gauge zero interest in the experiment, we can conclude that the much waved dissatisfaction with From: munging is not really a hindrance. So in either case we'll have eliminated part of the hesitations that prevent widespread full usage of DMARC.

References Normative References Informative References zdkimfilter

Examples In the examples that follow, the first character of each wrapped line of DKIM-Signature: fields should be a TAB. For editorial reasons, it is rendered as four spaces. While visually there is little difference, those signatures won't verify unless replacing them with a TAB. To verify the examples, public keys can be set as follows:

Single-part plain text Base64 encoding has to be decoded in order to locate the footer. The original encoding was text/plain, this can be inferred by the verifier from the absence of an Original-Content-Transfer-Encoding: field. The original body hash will match after decoding and removing the footer. Note that an "l=" tag couldn't have done the trick in this case. Received: from mua.example.com by mail.example.com with ESMTPA Message-ID: <123456@author.example> Date: Mon, 28 Oct 2020 13:12:55 +0100 From: Author MIME-Version: 1.0 To: MLM@lists.example Subject: [example] Check simple MLM message Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: base64 VGhpcyBpcyBhIHBsYWluIHRleHQgbWVzc2FnZSBzdWJtaXR0ZWQgdG8gYSBtYWlsaW5nIGxpc3Qu ClRoZSBtYWlsaW5nIGxpc3QgaXMgZXhwZWN0ZWQgdG8gYWRkIGEgZm9vdGVyIGFuZCBhIHN1Ympl Y3QgdGFnLgoKQmVzdApBdXRob3IKCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19f X19fX18KdGhpcyBtZXNzYWdlIHdhcyBtb2RpZmllZCBieSBNTE0gZXhhbXBsZQphZGRpbmcgdGhp cyBmb290ZXIgYW5kIHRoZSBzdWJqZWN0IHRhZwoobm90ZSB0aGF0IGw9IGlzIG5vdCBzZXQpCg== ]]>

Multipart added When the original message has a MIME structure, MLMs can append an entity. Received: from mua.example.com by mail.example.com with ESMTPA Message-ID: <123456@author.example> Date: Mon, 28 Oct 2020 13:12:55 +0100 From: Author via MLM MIME-Version: 1.0 To: MLM@lists.example Subject: [example] Check simple MLM message Content-Type: multipart/mixed; boundary=original-boundary Original preamble must be preserved! --original-boundary Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit This is a plain text message submitted to a mailing list. The mailing list is expected to add a footer and a subject tag. Best Author --original-boundary Content-Type: image/png Content-Transfer-Encoding: base64 iVBORw0KGgoAAAANSUhEUgAAAAYAAAAGCAYAAADgzO9IAAAABHNCSVQICAgIfAhkiAAAAAlwSFlz AAAHKgAAByoB49HU1wAAABl0RVh0U29mdHdhcmUAd3d3Lmlua3NjYXBlLm9yZ5vuPBoAAAB+SURB VAiZNcGxDYUgAEXRhxTMYWLFVlDTOAUjOIEzWDqEC1igCQ0LSLi/+ueotUZKieu6uO+bdV2ptaLz PDHGsG0b+74jieM40Pd91Fr5K6UAMC3LImutxhgaY8g5p3meNcUYFULQ+756nkchBMUYpd47OWe8 93jvyTnTe+cHXqRZbKSV4EoAAAAASUVORK5CYII= --original-boundary Content-Tyep: text/plain ________________________________________ this message was modified by MLM example adding this footer and the subject tag (note that l= cannot work in this case) --original-boundary-- ]]>

Multipart wrapped When the original body is multipart/alternative, MLMs have to wrap the whole body into the first entity of a multipart/mixed structure. Indeed, appending an entity to a multipart/alternative would result in it either hiding or being hidden by the existing ones. Received: from mua.example.com by mail.example.com with ESMTPA Message-ID: <123456@author.example> Date: Mon, 28 Oct 2020 13:12:55 +0100 From: Author via MLM MIME-Version: 1.0 To: MLM@lists.example Subject: [example] Check simple MLM message Content-Type: multipart/mixed; boundary=MLM-boundary This is the MLM preamble, not signed by Author. --MLM-boundary Content-Type: multipart/alternative; boundary=original-boundary Original preamble must be preserved! --original-boundary Content-Type: text/plain; This is a plain text message submitted to a mailing list. The mailing list is expected to add a footer and a subject tag. Best Author --original-boundary Content-Type: text/html;

This is a plain text message submitted to a mailing list. The mailing list is expected to add a footer and a subject tag.

Best
Author
--original-boundary-- Original epilogue --MLM-boundary Content-Type: text/plain ________________________________________ this message was modified by MLM example adding this footer and the subject tag (note that l= is not set) --MLM-boundary-- MLM epilogue ]]>