Treffer: Vector expansion in a large collection

Title:
Vector expansion in a large collection
Source:
TREC-1: Text Retrieval ConferenceNIST special publication. (500207):343-351
Publisher Information:
Gaithersburg, MD: National Institute of Standards and Technology, 1993.
Publication Year:
1993
Physical Description:
print, 7 ref
Original Material:
INIST-CNRS
Subject Terms:
Science technology, industry, Sciences et technologies, industries, Sciences exactes et technologie, Exact sciences and technology, Sciences et techniques communes, Sciences and techniques of general use, Sciences de l'information. Documentation, Information science. Documentation, Traitement et recherche de l'information, Information processing and retrieval, Structure et analyse des documents et de l'information, Information and document structure and analysis, Analyse des contenus, Content analysis, Indexation. Classification. Résumé. Synthèses, Indexing. Classification. Abstracting. Syntheses, Sciences de l'information et de la communication, Information and communication sciences, Traitement et recherche d'information, Informatique documentaire, Documentation data processing, Información documental, Ambiguité, Ambiguity, Ambiguedad, Analyse lexicale, Lexical analysis, Análisis lexical, Assistance ordinateur, Computer aid, Asistencia ordenador, Contrôle automatique, Automatic monitoring, Control automático, Espace vectoriel, Vector space, Espacio vectorial, Essai, Test, Ensayo, Evaluation système, System evaluation, Evaluación sistema, Expansion, Expansión, Indexation, Indexing, Indización, Langage naturel, Natural language, Lenguaje natural, Modèle statistique, Statistical model, Modelo estadístico, Modèle, Models, Modelo, Pondération, Weighting, Ponderación, Prototype, Prototipo, Question documentaire, Query, Pregunta documental, Recherche documentaire, Document retrieval, Recuperación documental, Recherche développement, Research and development, Investigación desarrollo, Synonymie, Synonymy, Sinonimia, Système documentaire, Document retrieval system, Sistema recuperación documental, Texte, Text, Texto, Vocabulaire contrôlé, Controlled vocabulary, Vocabulario controlado, Orienté concept, Concept oriented, SMART, TREC, WORDNET
Document Type:
Konferenz Conference Paper
File Description:
text
Language:
English
Author Affiliations:
Siemens Corp. Research, Inc., Princeton NJ 08540, United States
ISSN:
1048-776X
Rights:
Copyright 1994 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Notes:
Sciences of information and communication. Documentation

FRANCIS
Accession Number:
edscal.3772400
Database:
PASCAL Archive

Weitere Informationen

This paper investigates whether a completely automatic, statistical expansion technique that uses a general-purpose thesaurus as a source of related concepts is viable for large collection. The retrieval results indicate that the particular expansion technique used here improved the performance of some queries, but degrades the performance of other queries. The variability of the method is attributable to two main factors: the choice of concepts that are expanded and the confounding effects expansion has on cencept weights. Addressing these problems will require both a better method for determining the important concepts of a text and a better method for determining the correct sense of an ambiguous word.