(original) (raw)

Quick Mail Transfer Protocol (QMTP) D. J. Bernstein, djb@pobox.com 19970201 1. Introduction The Quick Mail Transfer Protocol (QMTP) is a replacement for the Simple Mail Transfer Protocol (SMTP). QMTP eliminates any need for end-of-line scanning between hosts with the same end-of-line convention. It features automatic pipelining and chunking, 8-bit transmission, prior declaration of the message size, and efficient batching. It is designed to be very easy to implement. QMTP is supported by the qmail-qmtpd and maildir2qmtp programs in the qmail package. In this document, a string of 8-bit bytes may be written in two different forms: as a series of hexadecimal numbers between angle brackets, or as a sequence of ASCII characters between double quotes. For example, <68 65 6c 6c 6f 20 77 6f 72 6c 64 21> is a string of length 12; it is the same as the string "hello world!". Note that these notations are part of this document, not part of the protocol. 2. Protocol A QMTP client connects to a QMTP server, as discussed in section 7, over a reliable stream protocol allowing transmission of 8-bit bytes. Protocol outline: the client sends one or more packages; after each package, the server sends back some responses. The client begins by sending a package. A package contains a mail message, an envelope sender address, and one or more envelope recipient addresses. See section 4 for the format of a package. When the server sees the end of the package, it sends back a series of responses, one response for each envelope recipient address, in the same order as given by the client. The server is not permitted to change the order under any circumstances, even if two addresses are the same. See section 5 for the format of a response. The server is not permitted to send any portion of its responses to a package until the client has sent the final byte of the package. The client is permitted to close the connection before sending the final byte of the package; in this case, the server must throw away the package without attempting to deliver the message. However, the server must not throw away previously accepted messages. The client does NOT need to wait for a server response before sending another package. The server must NOT throw away incoming data when it sends a response. It is the client's responsibility to avoid deadlock: if it sends a package before receiving all expected server responses, it must continuously watch for those responses. The server is permitted to delay its responses if further data has already shown up from the client; while it is delaying responses, it must not pause to wait for further data for the client. The server is permitted to close the connection at any time, although high-quality servers will try to avoid doing so. Any response not received by the client indicates a temporary failure. A QMTP session should take at most 1 hour. Both sides are expected to close the connection after this time. 3. Messages In this document, an ``8-bit mail message'' means a sequence of lines. Each line is a string of zero or more 8-bit bytes. A message is called ``safe'' if none of its bytes are <0a>. Implementation note: Here is the intended interpretation of text files as messages under some current operating systems. Under DOS, a message is stored on disk as first line, <0d 0a>, second line, <0d 0a> ... <0d 0a>, last line. Under UNIX, a message is stored on disk as first line, <0a>, second line, <0a> ... <0a>, last line. Notice that both of these encodings are reversible for safe messages. In practice, it is very common for the last line to be empty. Many existing utilities refer to the last line as a ``partial line'' and ignore it whether or not it is empty. 4. Packages A package is the concatenation of three strings: first, an encoded 8-bit mail message; second, an encoded envelope sender address; third, an encoded series of encoded envelope recipient addresses. Each envelope address is a string of 8-bit bytes. The interpretation of addresses depends on the environment in which QMTP is used and is outside the scope of this document. Each address is encoded as a netstring, as discussed in section 6. The series of encoded recipient addresses is in turn encoded as a netstring. A message is encoded as a string of 8-bit bytes in one of two ways: Encoding #1 is <0d>, the first line, <0d 0a>, the second line, <0d 0a>, the third line, ..., <0d 0a>, the last line. Encoding #2 is <0a>, the first line, <0a>, the second line, <0a>, the third line, ..., <0a>, the last line. This string of 8-bit bytes is in turn encoded as a netstring, as discussed in section 6. Every server must be prepared to handle encoding #1 and encoding #2. A server must not reject a message merely because of its encoding. Implementation note: The intent of encoding #1 and encoding #2 is to allow very straightforward handling of text files under DOS and UNIX respectively. The programmer can print <0d> or <0a> and then simply copy the file. 5. Responses Each response is a nonempty string of 8-bit bytes, encoded as a netstring. The first byte of the string is one of the following: "K" The message has been accepted for delivery to this envelope recipient. This is morally equivalent to the 250 response to DATA in SMTP; it is subject to the reliability requirements of RFC 1123, section 5.3.3. "Z" Temporary failure. The client should try again later. "D" Permanent failure. The remaining bytes are a description of what happened. It is expected that the description, when interpreted as UTF-2 characters, (1) will be human-readable, (2) will not repeat the envelope recipient address, and (3) will not include formatting characters other than <20>. However, these expectations are not requirements, and the client should be ready for arbitrary bytes from the server. Descriptions beginning with <20> are reserved for future extensions. In descriptions not beginning with <20>, the character "#" must not appear except in HCMSSC codes. A server must NOT accept a safe message unless it can store the message without corruption. More precisely: if the encoded message sent by the client matches the encoding of some safe message M, then acceptance means that the server is accepting responsibility to deliver M to the envelope recipient. (There is at most one possibility for M, since encodings are reversible on safe messages.) Deletion of nulls is NOT permissible; a server that deletes nulls must reject any message containing nulls. Folding of long lines and high-bit stripping are also NOT permissible. Servers are permitted to change unsafe messages. 6. Netstrings Any string of 8-bit bytes may be encoded as [len]":"[string]",". Here [string] is the string and [len] is a nonempty sequence of ASCII digits giving the length of [string] in decimal. The ASCII digits are <30> for 0, <31> for 1, and so on up through <39> for 9. Extra zeros at the front of [len] are prohibited: [len] begins with <30> exactly when [string] is empty. For example, the string "hello world!" is encoded as <31 32 3a 68 65 6c 6c 6f 20 77 6f 72 6c 64 21 2c>, i.e., "12:hello world!,". The empty string is encoded as "0:,". [len]":"[string]"," is called a netstring. [string] is called the interpretation of the netstring. 7. Encapsulation QMTP may be used on top of TCP. A QMTP-over-TCP server listens for TCP connections on port 209. 8. Examples A client opens a connection and sends the concatenation of the following strings: "246:" <0a> "Received: (qmail-queue invoked by uid 0);" " 29 Jul 1996 09:36:40 -0000" <0a> "Date: 29 Jul 1996 11:35:35 -0000" <0a> "Message-ID: 19960729113535.375.qmail@heaven.af.mil" <0a> "From: God@heaven.af.mil" <0a> "To: djb@silverton.berkeley.edu (D. J. Bernstein)" <0a> <0a> "This is a test." <0a> "," "24:" "God-DSN-37@heaven.af.mil" "," "30:" "26:djb@silverton.berkeley.edu," "," "356:" <0d> "From: MAILER-DAEMON@heaven.af.mil" <0d 0a> "To:" <0d 0a> " Hate." <22> "The Quoting" <22> "@SILVERTON.berkeley.edu," <0d 0a> " " <22> "\\Backslashes!" <22> "@silverton.BERKELEY.edu" <0d 0a> <0d 0a> "The recipient addresses here could" " have been encoded in SMTP as" <0d 0a> "" <0d 0a> " RCPT TO:<hate.the\ quoting@silverton.berkeley.edu="">" <0d 0a> " RCPT TO:<\\Backslashes!@silverton.berkeley.edu>" <0d 0a> <0d 0a> "This ends with a partial last line, right here" "," "0:" "," "83:" "39:Hate.The Quoting@silverton.berkeley.edu," "36:\Backslashes!@silverton.berkeley.EDU," "," The server sends the following response, indicating acceptance: "21:Kok 838640135 qp 1390," "21:Kok 838640135 qp 1391," "21:Kok 838640135 qp 1391," The client closes the connection. </hate.the\>