Re: Abiword XML format description

From: F J Franklin (F.J.Franklin@sheffield.ac.uk)
Date: Fri Nov 08 2002 - 05:25:01 EST

  • Next message: Hubert Figuiere: "Re: Abiword XML format description"

    On Fri, 8 Nov 2002, David Buddrige wrote:
    > The gnumeric manual (as with many free-software
    > products), is in html format. I plan to write a perl
    > script that will convert the html gnumeric manual into
    > an abiword document, which I can then adjust/format if
    > necessary.
    >
    > To do this, I was wanting to get a complete
    > description of the XML tags that Abiword
    > uses/recognises. Is there a document anywhere that
    > describes the Abiword XML format?

    There is a DTD but it's woefully out of date. However,...

    AbiWord has an XHTML importer that is unfortunately too fussy for its own
    good, but you could try running the HTML docs through tidy to turn them
    into valid XHTML and then import them into AbiWord like that. Also,
    AbiWord has an HTML importer plugin that's more lenient but is also only
    half-complete and will ignore text in lists and tables (less than ideal).

    I'm planning to finish/rewrite the HTML importer at some point, not sure
    how soon, though.

    Regards, Frank

    ps. The development series of AbiWord (1.1.x) now has a much improved
        XML-DocBook import plugin, so there's a third possible route for you
        to take: HTML->DocBook->AbiWord...

    Francis James Franklin
    F.J.Franklin@shef.ac.uk

      `Medium atomic weights are available: Gold, Lead, Copper, Jet, Diamond,
    Radium, Sapphire, Silver and Steel.
      `Sapphire and Steel have been assigned...'



    This archive was generated by hypermail 2.1.4 : Fri Nov 08 2002 - 05:34:12 EST