mutt, ISO-8859-1, and Terminal.app
Jens Troeger
savage at light-speed.de
Tue Sep 9 14:23:42 PDT 2008
On Tue, Sep 09, 2008 at 04:05:18PM -0500, Ryan Schmidt wrote:
> On Sep 9, 2008, at 1:03 PM, Jens Troeger wrote:
>
>> On Tue, Sep 09, 2008 at 07:49:23AM -1000, Baron Fujimoto wrote:
>>
>>> On Tue, 9 Sep 2008, Jens Troeger wrote:
>>>
>>>> I'm running on OS-X 10.4.11 and Terminal version 1.5.1 (133-1). Just
>>>> recently I installed MacPorts, including mutt-devel so that I can access
>>>> my gmail.com account. It all works nicely; however, there's one big
>>>> annoyance: ISO-8859-1 text. Some strings in the email header contain
>>>> such characters:
>>>>
>>>> From: "=?ISO-8859-1?Q?Bla Bl=ECn?=" < bla at bla >
>>>>
>>>> and that's exactly what I see in my email list in mutt. No actual name
>>>> with language specific characters, but instead this blob of undecoded
>>>> string.
>>>
>>>
>>> Is that a properly encoded header? Section 2 of RFC 2047 says,
>>>
>>> IMPORTANT: 'encoded-word's are designed to be recognized as 'atom's
>>> by an RFC 822 parser. As a consequence, unencoded white space
>>> characters (such as SPACE and HTAB) are FORBIDDEN within an
>>> 'encoded-word'. For example, the character sequence
>>>
>>> =?iso-8859-1?q?this is some text?=
>>>
>>> would be parsed as four 'atom's, rather than as a single 'atom' (by
>>> an RFC 822 parser) or 'encoded-word' (by a parser which understands
>>> 'encoded-words'). The correct way to encode the string "this is some
>>> text" is to encode the SPACE characters as well, e.g.
>>>
>>> =?iso-8859-1?q?this=20is=20some=20text?=
>>>
>>> and later in section 6.3:
>>>
>>> A mail reader need not attempt to display the text associated with an
>>> 'encoded-word' that is incorrectly formed.
>>>
>>> <http://www.ietf.org/rfc/rfc2047.txt>
>>>
>>>> Does anybody have an idea what's going on here, and how I can fix that?
>>>
>>> I don't know if this is the source of the problem, but it caught my eye.
>>
>> Sorry, I wanted to protect the name encoded in that string, and messed
>> it up. The string that's failing is this one:
>>
>> "=?ISO-8859-1?Q?Mia_Det=ECn?="
>>
>> So that would be a correct string then?
>
> Yes. It decodes to "Mia_Detìn". See:
>
> php -r 'echo
> utf8_encode(mb_decode_mimeheader("=?ISO-8859-1?Q?Mia_Det=ECn?=")) . "\n";'
>
> So could it be that mutt does not support RFC 822-/2047-style subject
> encoding? You'll have to ask the developers of mutt what's going on here.
The mutt that's running on my Linux box does the encoding right, that's
why I originally thought it's a Terminal problem on my Mac. Maybe
there's a compiler option I didn't set, or an env variable?
Jens
--
Jens Troeger
http://savage.light-speed.de/
More information about the macports-users
mailing list