Hi Petr,
> It seems like this would be done similar way as what
> OCR
> programs do... maybe some code could be reused (it
> would
> anyway be wonderful, if Abiword had integrated OC
> some day...)
For what it's worth, we already have decent PDF OCR
input. The Poppler/XPDF TextOutputDevice class
re-orders blocks into lines, columns, etc. as
appropriate. We then import the UTF8 text as it's been
arranged for us.
Best,
Dom
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
Received on Tue Sep 20 19:43:51 2005
This archive was generated by hypermail 2.1.8 : Tue Sep 20 2005 - 19:43:52 CEST