Treffer: Indexing structures derived from Syntax in TREC-3 : system description

Title:

Indexing structures derived from Syntax in TREC-3 : system description

Authors:

SMEATON, A. F, O'DONNELL, R, KELLEDY, F

Source:

TREC-3: text retrieval conferenceNIST special publication. (500225):55-67

Publisher Information:

Gaithersburg, MD: National Institute of Standards and Technology, 1995.

Publication Year:

1995

Physical Description:

print, 8 ref

Original Material:

INIST-CNRS

Subject Terms:

Science technology, industry, Sciences et technologies, industries, Sciences exactes et technologie, Exact sciences and technology, Sciences et techniques communes, Sciences and techniques of general use, Sciences de l'information. Documentation, Information science. Documentation, Systèmes de recherche d'informations. Système de gestion documentaire et d'information, Information retrieval systems. Information and document management system, Systèmes de recherche d'information, Information retrieval systems, Sciences de l'information et de la communication, Information and communication sciences, Système de recherche documentaire. Système de gestion documentaire et d'information, Informatique documentaire, Documentation data processing, Información documental, Analyse syntaxique, Syntactic analysis, Análisis sintáxico, Etude cas, Case study, Estudio caso, Etude critique, Critical study, Estudio crítico, Evaluation, Evaluación, Indexation, Indexing, Indización, Langage naturel, Natural language, Lenguaje natural, Langue étrangère, Foreign language, Lengua extranjera, Méthode arborescente, Tree structured method, Método arborescente, Prototype, Prototipo, Recherche information, Information retrieval, Recuperación información, Système recherche, Search system, Sistema investigación

Document Type:

Konferenz Conference Paper

File Description:

text

Language:

English

Author Affiliations:

Dublin City univ., school computer applications, Dublin 9, Ireland

ISSN:

1048-776X

Access URL:

http://pascal-francis.inist.fr/vibad/index.php?action=search&terms=2484598

Rights:

Copyright 1997 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS

Notes:

Sciences of information and communication. Documentation

FRANCIS

Accession Number:

edscal.2484598

Database:

PASCAL Archive

Weitere Informationen

This paper describes an approach to information retrieval based on a syntactic analysis of the document texts and user queries and from that analysis, the construction of tree structures (TSAs) to encode and capture language ambiguities. TSAs are constructed at the clause many TSAs and each query may be represented by several TSAs. The TSAs from documents and from queries are then matched and their degrees of overlap between individual TSAs are computed and then aggregated to yield a score for each document, which is then used in ranking the collection. This paper presents the system description when benchmarking our retrieval strategy on category B of TREC-3, i.e. on c.550 Mbytes of the Wall Street Journal newspaper texts. The implementation is based on a two-stage retrieval where a statistically-based pre-fetch retrieval retrieves the set of WSJ articles for the more computationnaly expensive language based processing. The results of our retrieval system in terms of precision and recall are disappointing and an analysis of why is also included. Part of this analysis includes a direct comparison between our system and some mainstream IR approaches. In addition to performaing ad hoc retrieval on texts in English, we have also performed ad hoc retrieval retrieval on texts in Spanish using a weighted trigram approach, and this is outlined and performance results given in an appendix.

Treffer: Indexing structures derived from Syntax in TREC-3 : system description

Weitere Informationen

Links

Zusatz-Funktionen