From Caolan McNamara <Caolan.McNamara@ul.ie>

Announcing

Mswordview, a MS Word 8 Decoder

mswordview is a program that can understand the Microsoft word 8 binary file format (office97), it currently converts word into html, which can then be read with a browser. its based on the word 8 format documentation that ms released, and uses laola (included) to split an OLE files into its constituent streams.

Features include :

1) ability to understand fastsaved files as well as non-fastsaved files.

2) conversion of word header paragraph styles into appropriate header levels of html.

3) conversion of basic font attributes such as italic, bold and font size into html tags

4) conversion of word tables into html tables.

5) a fair understanding of lists.

Non Supported Features include :

1) embedded graphics or other embedded types.

2) headers and footers.

3) fully correct conversion of tab stops and other formatting done by the user done with whitespace, because you cant really do this in html.

4) correct conversion of lists, all lists become bullet pointed lists (<UL>), got list format is a toughy.

5) other extraneous stuff like multi columns, table of contents, and those special fields in general

6) word 6 and 7 etc aren't currently supported, just word 8

Defects are :

1) mswordview uses laola to extract the OLE streams from the document, and on occasion laola cant cope with some files, i.e. corrupt docs and some large docs.

2) I've only tested it with whatever word 8 files I've at hand, if you have some that blow mswordview up, or get wrong output out of it then you can submit your file at the web gateway listed below.

Web Page for download and information at

http://www.csn.ul.ie/~caolan/docs/MSWordView.html

Web gateway to mswordview at

http://www.csn.ul.ie/~caolan/docs/MSWordView-Demo.html

I've submitted it to sunsite, and its in incoming there, i'd imagine it'll end up at

ftp://sunsite.unc.edu/pub/Linux/utils/file/mswordview-0.0.14.tar.gz

its also available at the website mentioned above and at ftp ://skynet.csn.ul.ie/pub/linux/utilsmswordview-0.0.14.tar.gz

I'm of course interested to hear if it works for you, if it doesn't, and if you have bug fixes for it.

(P.S. if someone like applixware or stardivision would like this to convert from word to their format, they can get in touch :-) )

C.

Real Life : Caolan McNamara

* Doing : MSc in HCIWork : Caolan.McNamara@ul.ie

* Phone : +353-61-202699 URL : http://skynet.csn.ul.ie/~caolan

* Sig : an oblique strategy Infinitesimal gradations

- --

This article has been digitally signed by the moderator, using PGP.

http://www.iki.fi/mjr/cola-public-key.asc has PGP key for validating signature.

Send submissions for comp.os.linux.announce to : linux-announce@news.ornl.gov PLEASE remember a short description of the software and the LOCATION.

This group is archived at http://www.iki.fi/mjr/linux/cola.html