1 \section{Standard Module
\sectcode{rfc822
}}
4 \renewcommand{\indexsubitem}{(in module rfc822)
}
6 This module defines a class,
\code{Message
}, which represents a
7 collection of ``email headers'' as defined by the Internet standard
8 RFC
822. It is used in various contexts, usually to read such headers
11 A
\code{Message
} instance is instantiated with an open file object as
12 parameter. Instantiation reads headers from the file up to a blank
13 line and stores them in the instance; after instantiation, the file is
14 positioned directly after the blank line that terminates the headers.
16 Input lines as read from the file may either be terminated by CR-LF or
17 by a single linefeed; a terminating CR-LF is replaced by a single
18 linefeed before the line is stored.
20 All header matching is done independent of upper or lower case;
21 e.g.
\code{m
['From'
]},
\code{m
['from'
]} and
\code{m
['FROM'
]} all yield
24 \subsection{Message Objects
}
26 A
\code{Message
} instance has the following methods:
28 \begin{funcdesc
}{rewindbody
}{}
29 Seek to the start of the message body. This only works if the file
33 \begin{funcdesc
}{getallmatchingheaders
}{name
}
34 Return a list of lines consisting of all headers matching
35 \var{name
}, if any. Each physical line, whether it is a continuation
36 line or not, is a separate list item. Return the empty list if no
37 header matches
\var{name
}.
40 \begin{funcdesc
}{getfirstmatchingheader
}{name
}
41 Return a list of lines comprising the first header matching
42 \var{name
}, and its continuation line(s), if any. Return
\code{None
}
43 if there is no header matching
\var{name
}.
46 \begin{funcdesc
}{getrawheader
}{name
}
47 Return a single string consisting of the text after the colon in the
48 first header matching
\var{name
}. This includes leading whitespace,
49 the trailing linefeed, and internal linefeeds and whitespace if there
50 any continuation line(s) were present. Return
\code{None
} if there is
51 no header matching
\var{name
}.
54 \begin{funcdesc
}{getheader
}{name
}
55 Like
\code{getrawheader(
\var{name
})
}, but strip leading and trailing
56 whitespace (but not internal whitespace).
59 \begin{funcdesc
}{getaddr
}{name
}
60 Return a pair (full name, email address) parsed from the string
61 returned by
\code{getheader(
\var{name
})
}. If no header matching
62 \var{name
} exists, return
\code{None, None
}; otherwise both the full
63 name and the address are (possibly empty )strings.
65 Example: If
\code{m
}'s first
\code{From
} header contains the string\\
66 \code{'jack@cwi.nl (Jack Jansen)'
}, then
67 \code{m.getaddr('From')
} will yield the pair
68 \code{('Jack Jansen', 'jack@cwi.nl')
}.
69 If the header contained
70 \code{'Jack Jansen <jack@cwi.nl>'
} instead, it would yield the
74 \begin{funcdesc
}{getaddrlist
}{name
}
75 This is similar to
\code{getaddr(
\var{list
})
}, but parses a header
76 containing a list of email addresses (e.g. a
\code{To
} header) and
77 returns a list of (full name, email address) pairs (even if there was
78 only one address in the header). If there is no header matching
79 \var{name
}, return an empty list.
81 XXX The current version of this function is not really correct. It
82 yields bogus results if a full name contains a comma.
85 \begin{funcdesc
}{getdate
}{name
}
86 Retrieve a header using
\code{getheader
} and parse it into a
9-tuple
87 compatible with
\code{time.mktime()
}. If there is no header matching
88 \var{name
}, or it is unparsable, return
\code{None
}.
90 Date parsing appears to be a black art, and not all mailers adhere to
91 the standard. While it has been tested and found correct on a large
92 collection of email from many sources, it is still possible that this
93 function may occasionally yield an incorrect result.
96 \code{Message
} instances also support a read-only mapping interface.
97 In particular:
\code{m
[name
]} is the same as
\code{m.getheader(name)
};
98 and
\code{len(m)
},
\code{m.has_key(name)
},
\code{m.keys()
},
99 \code{m.values()
} and
\code{m.items()
} act as expected (and
102 Finally,
\code{Message
} instances have two public instance variables:
104 \begin{datadesc
}{headers
}
105 A list containing the entire set of header lines, in the order in
106 which they were read. Each line contains a trailing newline. The
107 blank line terminating the headers is not contained in the list.
111 The file object passed at instantiation time.