[thelist] Stripping HTML from Emails

admin at antonakis.co.uk admin at antonakis.co.uk
Mon May 24 17:55:02 CDT 2004


Liam,

Thanks for that, however it won't work.

Perhaps I didn't make myself clear in what I meant by stripping out the 
HTML form emails. I didn't just mean the HTML itself, I meant everything 
else associated with it as well.

For example, here is an HTML email minus it's headers:
Basically I just want the LX0008 & LX0010 part, and not all the 
content-type, etc that's added by some email browsers.

MIME-Version: 1.0
Content-Type: multipart/alternative; 
boundary="-----------------------------1085435574"
X-Mailer: 9.0 for Windows sub 630


-------------------------------1085435574
Content-Type: text/plain; charset="US-ASCII"
Content-Transfer-Encoding: 7bit

LX0008
LX0010

-------------------------------1085435574
Content-Type: text/html; charset="US-ASCII"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=3DContent-Type content=3D"text/html; charset=3DUS-ASCII">
<META content=3D"MSHTML 6.00.2737.800" name=3DGENERATOR></HEAD>
<BODY id=3Drole_body style=3D"FONT-SIZE: 10pt; COLOR: #000000; 
FONT-FAMILY:=20=
Arial"=20
bottomMargin=3D7 leftMargin=3D7 topMargin=3D7 rightMargin=3D7><FONT 
id=3Drol=
e_document=20
face=3DArial color=3D#000000 size=3D2>
<DIV>LX0008</DIV></FONT></BODY></HTML>

Regards
Alexis
-------------------------------1085435574--

Liam Delahunty wrote:
> on 24/05/2004 22:04 Alexis Antonakis wrote:
> 
>> Hi,
>>
>> I have a script written in PHP which extracts details from emails.
>> Everything works fine for plain text emails, but HTML ones are a 
>> nightmare.
>> Can anybody point me in the right direction as to how I can detail 
>> with these emails.
> 
> 
> Clear HTML with something like http://php.net/strip_tags ?
> 
> 



More information about the thelist mailing list