From: Nadav Rotem (nadavrotem_at_mail.ru)
Date: Tue Nov 25 2003 - 08:48:13 EST
The Inner product of two text is defined as the number of topics they
share. One of my professors is doing research in this field and needed a
matrix of the inner product of cunks of text. Here is a quick example,
in a Bash script, of how to use OTS to generate this list of topics.
Usage of the script:
[nadav_at_gringo articles]$ ./inner.sh sacbee1.txt sacbee2.txt
<sacbee1.txt,sacbee2.txt>= 0
[nadav_at_gringo articles]$ ./inner.sh test1.txt test2.txt
<test1.txt,test2.txt>= 3
>From your c Code you can get the list of topics through this call:
word = ots_word_in_list(Doc->ImpWords,i);
-Nadav
This archive was generated by hypermail 2.1.4 : Tue Nov 25 2003 - 08:49:54 EST