Cleaning Up Word's HTML

web pages, but Word uses it to insert product and content information that is not strictly necessary. Example 16-1. Header Information <meta <meta <meta <meta <link http-equiv=Content-Type content="text/html; charset=windows-1252"> name=ProgId content=Word.Document> name=Generator content="Microsoft Word 9"> name=Originator content="Microsoft Word 9"> rel=File-List href="./Document19_files/filelist.xml"> Example 16-2 shows the first several lines of code that appear dimmed in the HTML Source Viewer in Figure 16-11. This code contains various document information, such as the author, revision times, and document statistics, that is pulled from Word's document properties (File Properties). I'll bet you didn't know all of this stuff was going onto the Web when you saved a document. Example 16-2. Document Property Information <!--[if gte mso 9]><xml> <o:DocumentProperties> <o:Author>Walter J. Glenn</o:Author> <o:Revision>1</o:Revision> <o:TotalTime>0</o:TotalTime> <o:LastPrinted>1601-01-01T00:00:00Z</o:LastPrinted> <o:Created>2000-05-18T03:12:00Z</o:Created> <o:LastSaved>1601-01-01T00:00:00Z</o:LastSaved> <o:Pages>1</o:Pages>