[PATCH] Word 2 import (provisional)


Subject: [PATCH] Word 2 import (provisional)
From: Martin Vermeer (mv@liisa.pp.fi)
Date: Sat Sep 02 2000 - 20:02:38 CDT


Hi,

Now that CVS is back up -- or at least communicating at more that 50 bytes
a second -- I finally got made a patch. The code added allows wv to import
an MS Word 2 document, and output it in legible form. It appears (based
on what I have seen, half a dozen docs from all over the world) that
Word 2 docs are quite a lot simpler structured, with a linear text layout,
no OLE.

Of course nothing is perfect, and this outputs the text only in text form,
without "properties" or even basic formatting such as paragraphs (<p> or
\par). Doing that requires getting the FIB right (does anybody know the
FIB layout for Word 2?) and understanding what is happening on lines
188-399 of decode_simple.c (not very legible to an amateur :-( )

Any help or info would be appreciated.

As this patch adds functionality that was competely lacking earlier, it
should IMHO be applied after (necessary) regression testing. I don't
think there are very many Word 2 users any more, but still this may be
good to have.

-- 
Martin Vermeer mv@liisa.pp.fi
:wq




This archive was generated by hypermail 2b25 : Sat Sep 02 2000 - 14:59:36 CDT