Announce: The Open Text Summarizer

From: Dom Lachowicz (domlachowicz@yahoo.com)
Date: Thu May 01 2003 - 19:30:10 EDT

  • Next message: Martin Sevior: "Re: Announce: The Open Text Summarizer"

    I'd like to take this opportunity to introduce
    everyone to Nadav Rotem. Nadav, meet everyone.

    Nadav is the primary author of the Open Text
    Summarizer tool (Ots), which now lives in our CVS.
    `cvs co ots` if you're interested. I've been helping
    him out a bunch where I can.

    What is Ots?
    "The open text summarizer is an open source tool for
    summarizing texts. The program reads a text and
    decides which sentences are important and which are
    not."

    What can Ots do?
    Given a source document and a summarization percent,
    it will do one of:
    1) Output the summary as text
    2) Output the summary+original text as color-coded
    HTML
    3) Output only the keywords and their summaries
    4) Output a short summary of the n highest ranked
    terms

    Its only requirements are Glib2 (for some elementary
    list and string structures) and Popt (for command line
    processing). It should compile out of the box on Win32
    using either Mingw or Cygwin, provided you have Glib2
    installed. It also ships with a RPM spec and
    Pkg-Config file.

    It includes:

    *) A dynamic and static library
    *) A command line processing tool

    and currently works on the English and Hebrew
    languages. Work is in-progress for both Russian and
    German additions. Ots also has a good amount of
    documentation and examples, including the script of
    "When Harry Met Sally" ;-)

    After we extend the API a bit to include parsing an
    UTF-8 input stream in addition to FILE*'s, I intend to
    make an auto-summarization plugin for AbiWord based on
    ots.

    Parties interested in helping should get in touch with
    Nadav.

    Cheers,
    Dom

    __________________________________
    Do you Yahoo!?
    The New Yahoo! Search - Faster. Easier. Bingo.
    http://search.yahoo.com



    This archive was generated by hypermail 2.1.4 : Thu May 01 2003 - 19:42:11 EDT