[thelist] <TIP> FIXING MS WORD TO HTML CRAP CODE

Bob Davis bobd at members.evolt.org
Thu May 25 17:40:32 2000


on 05/25/2000 3:31 PM, Villano, Paul at VillanoP@usachcs-emh1.army.mil
wrote:

> Finally!!  Finally I get to post a <tip>!!!
> 
> <tip>
> Dealing with the infamous Word 2000 conversion to HTML files.  When last we
> checked with our hero, he was looking for sharp objects to hurt himself
> because trying to use the "Compact HTML" fix that MS provided did nothing to
> fix the problem.  However, he discovered (okay, someone told him) that you
> must run the MS HTML Filter 2.0 from OUTSIDE of Word.  (That is, choose it
> from the MS Office Tools menu off of the Start menu).  This will bring up a
> box that prompts you for the file to add (the MS Word 2000 converted to HTML
> file you want to clean up).  BE SURE TO CLICK THE OPTIONS BUTTON ON THE
> BOTTOM OF THE PROMPT BOX.  This will allow you to specify what you want to
> clean up (CSS tags, MS-specific tags, useless Meta tags, etc.)  It's not
> pretty, and it's not perfect...but it's an 90% solution.
> </tip>
> 

<tip type="HTML optimization and validation" author="bob davis">
Another option for cleaning and validation that works on a lot of platforms
(and I do mean a *lot*) is HTML Tidy.  It's fast, good and, best of all,
free!

http://www.w3.org/People/Raggett/tidy/

There are options for cleaning Word generated HTML and there's a plugin for
BBEdit (works very well) and there is a list of editors that work with it by
design.

Hey, it's from W3, how bad can it be?

</tip>
-- 

bob davis
bobd@members.evolt.org