[thesite] Tip Harvester question (was: [***] Formatting tips )

Dean Mah dsmah at home.com
Fri Mar 30 14:05:02 CST 2001


When I was working on the Perl version I grabbed tips where the tag
appeared as the first non-whitespace characters.  I would also skip
over the part of a message that followed the "--- Original Message
---" text that Outlook tends to put in for quoted messages.  I would
also handle tips within tips, i.e., if you had something like:

<tip>
<tip>
</tip>
</tip>

it would pull it out as two separate tips, which would happen from time
to time.

Dean


Seth Bienek writes:

> The tip parser that I wrote looks for a line break followed by  <tip> and
> includes everything between the closing bracket and the first bracket of
> </tip>.  If it doesn't find the closing tag, it throws the whole thing out.
> 
> The drawback to this approach is that if someone replies and their email
> software doesn't support "quoting", the tip would go in twice.
> 
> I have put some thought into this, but have not been able to come up with a
> real solution other than a review process, or doing exact comparisons of
> every tip (which seems like it would inhibit performance, and I'm not sure
> if you can even compare two whole text fields..)
> 
> I'm open to suggestions..




More information about the thesite mailing list