From Word to XML (24 tags)
In the year's last Q&A column John E. Simpson discusses some of the issues surrounding the conversion of MS Word documents to XML.
Manipulating Word Documents with Perl (9 tags)
Unix hackers love their text editors for plain-text manipulatey goodness--especially Emacs and Vim with their wonderful extension languages (and sometimes Perl bindings). Don't fret, defenestrators-to-be. Andrew Savikas demonstrates how to use Perl for your string-wrangling when you have to suffer through using Word.
Word to XML and Back Again (7 tags)
Peter Sefton introduces a technique, using Python and XSLT, to convert MS Word XML output into something useful.
Lightweight XML Editing in Word 2003 (6 tags)
Strictly speaking, you can edit custom XML in Word, but there are limitations that make the process needlessly complex. This article presents a lightweight approach to XML editing in Word that works in all editions of Word 2003. All you need besides Word is an XSLT processor. Evan Lenz, coauthor of Office 2003 XML, shows you how.
Opening Microsoft File Formats to Java (6 tags)
Microsoft's file formats were once black arts to developers on non-sanctioned platforms. Thanks to Jakarta POI, however, it's possible to read and write them from Java. Apache's Andrew C. Oliver and Avik Sengupta explain the basics of the project in the first of a series of three articles.