From: Dom Lachowicz (domlachowicz@yahoo.com)
Date: Thu May 01 2003 - 19:30:10 EDT
I'd like to take this opportunity to introduce
everyone to Nadav Rotem. Nadav, meet everyone.
Nadav is the primary author of the Open Text
Summarizer tool (Ots), which now lives in our CVS.
`cvs co ots` if you're interested. I've been helping
him out a bunch where I can.
What is Ots?
"The open text summarizer is an open source tool for
summarizing texts. The program reads a text and
decides which sentences are important and which are
not."
What can Ots do?
Given a source document and a summarization percent,
it will do one of:
1) Output the summary as text
2) Output the summary+original text as color-coded
HTML
3) Output only the keywords and their summaries
4) Output a short summary of the n highest ranked
terms
Its only requirements are Glib2 (for some elementary
list and string structures) and Popt (for command line
processing). It should compile out of the box on Win32
using either Mingw or Cygwin, provided you have Glib2
installed. It also ships with a RPM spec and
Pkg-Config file.
It includes:
*) A dynamic and static library
*) A command line processing tool
and currently works on the English and Hebrew
languages. Work is in-progress for both Russian and
German additions. Ots also has a good amount of
documentation and examples, including the script of
"When Harry Met Sally" ;-)
After we extend the API a bit to include parsing an
UTF-8 input stream in addition to FILE*'s, I intend to
make an auto-summarization plugin for AbiWord based on
ots.
Parties interested in helping should get in touch with
Nadav.
Cheers,
Dom
__________________________________
Do you Yahoo!?
The New Yahoo! Search - Faster. Easier. Bingo.
http://search.yahoo.com
This archive was generated by hypermail 2.1.4 : Thu May 01 2003 - 19:42:11 EDT