Discussion:
[Bug 7657] New: Certain messages get mangled by double UTF-8 encoding
b***@bugzilla.spamassassin.org
2018-11-20 20:05:56 UTC
Permalink
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7657

Bug ID: 7657
Summary: Certain messages get mangled by double UTF-8 encoding
Product: Spamassassin
Version: 3.4.2
Hardware: PC
OS: Linux
Status: NEW
Severity: major
Priority: P2
Component: spamassassin
Assignee: ***@spamassassin.apache.org
Reporter: ***@caletka.cz
Target Milestone: Undefined

Created attachment 5628
--> https://bz.apache.org/SpamAssassin/attachment.cgi?id=5628&action=edit
Minimum working example

I use Debian 9 (Stretch) with spamassassin script run by procmail in the user's
mailbox. Last week, spamassassin got updated from 3.4.1 to 3.4.2 and since
then, some messages are mangled – the message body that was received UTF-8
encoded and transferred in 8bit mode is treated as ISO-8859-1 and reencoded to
UTF-8 again, resulting in totally garbled accented characters.

I'm attaching a minimum working example. To reproduce the issue, one has to put

add_header all Report _REPORT_

into ~/.spamassassin/user_prefs and call spamassassin < mwe.eml
mwe-mangled.eml
In the output file, the e-mail body will get double encoded, showing garbage
instead of accented characters.

Disabling Report header insertion works around the issue.
--
You are receiving this mail because:
You are the assignee for the bug.
b***@bugzilla.spamassassin.org
2018-11-20 20:07:01 UTC
Permalink
https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7657

Ondřej Caletka <***@caletka.cz> changed:

What |Removed |Added
----------------------------------------------------------------------------
CC| |***@caletka.cz

--- Comment #1 from Ondřej Caletka <***@caletka.cz> ---
Created attachment 5629
--> https://bz.apache.org/SpamAssassin/attachment.cgi?id=5629&action=edit
Mangled minimum working example
--
You are receiving this mail because:
You are the assignee for the bug.
Loading...