[thelist] CF -- scrubbing special characters

Joshua Olson joshua at waetech.com
Mon Sep 9 19:41:00 CDT 2002


----- Original Message -----
From: "rudy" <r937 at interlog.com>
Sent: Monday, September 09, 2002 5:47 PM


> it appears to me, a mere cf grasshopper, that you are just substituting a
> numeric entity for an ascii character, and while this is great as far as
it
> goes, it's not a completely satisfactory solution for "invalid" characters
> like the infamous &#0149; or word's curly quotes, etc.

Rudy,

Using the code I provided, that particular character is replaced
appropriately.  You are correct in noticing that not all unicode characters
will be replaced.  Only characters between 127 and 255 slots will be
replaced.  In this case, the character you suggested is 149 and will be
replaced.

-joshua




More information about the thelist mailing list