Treffer: A cooccurrence-based thesaurus and two applications to information retrieval

Title:
A cooccurrence-based thesaurus and two applications to information retrieval
Source:
Intelligent multimedia information retrieval systems and management (New York NY, October 11-12, 1994). :266-274
Publisher Information:
Paris: CID, 1994.
Publication Year:
1994
Physical Description:
print, 20 ref
Original Material:
INIST-CNRS
Document Type:
Konferenz Conference Paper
File Description:
text
Language:
English
Author Affiliations:
Xerox Palo Alto res. cent., Palo Alto CA 94304, United States
Rights:
Copyright 1995 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Notes:
Sciences of information and communication. Documentation

FRANCIS
Accession Number:
edscal.3553780
Database:
PASCAL Archive

Weitere Informationen

This paper presents a new method for computing a thesaurus form a text corpus. Each word is represented as a vector in a multi-dimensional space that captures cooccurence information. Words are defined to be similar if they have similar cooccurence patterns. Two different methods for using these thesaurus vectors in information retrieval are shown to significantly improve performance over the ARPA Tipster evaluation corpus as compared to a tf.idf baseline.