Treffer: Natural language information retrieval : TREC-3 report

Title:

Natural language information retrieval : TREC-3 report

Authors:

STRZALKOWSKI, T, CARBALLO, J. P, MARINESCU, M

Source:

TREC-3: text retrieval conferenceNIST special publication. (500225):39-53

Publisher Information:

Gaithersburg, MD: National Institute of Standards and Technology, 1995.

Publication Year:

1995

Physical Description:

print, 24 ref

Original Material:

INIST-CNRS

Subject Terms:

Science technology, industry, Sciences et technologies, industries, Sciences exactes et technologie, Exact sciences and technology, Sciences et techniques communes, Sciences and techniques of general use, Sciences de l'information. Documentation, Information science. Documentation, Systèmes de recherche d'informations. Système de gestion documentaire et d'information, Information retrieval systems. Information and document management system, Systèmes de recherche d'information, Information retrieval systems, Sciences de l'information et de la communication, Information and communication sciences, Système de recherche documentaire. Système de gestion documentaire et d'information, Informatique documentaire, Documentation data processing, Información documental, Etude cas, Case study, Estudio caso, Langage naturel, Natural language, Lenguaje natural, Produit recherche, Search result, Resultado búsqueda, Prototype, Prototipo, Recherche documentaire, Document retrieval, Recuperación documental, Système recherche, Search system, Sistema investigación, Traitement information, Information processing, Procesamiento información, Linguistique informatique, Computational linguistics, NYU, TREC-3

Document Type:

Konferenz Conference Paper

File Description:

text

Language:

English

Author Affiliations:

New York univ., courant inst. mathematical sci., New York NY 10003, United States

ISSN:

1048-776X

Access URL:

http://pascal-francis.inist.fr/vibad/index.php?action=search&terms=2484565

Rights:

Copyright 1997 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS

Notes:

Sciences of information and communication. Documentation

FRANCIS

Accession Number:

edscal.2484565

Database:

PASCAL Archive

Weitere Informationen

In this paper we report on the recent developments in NYU's natural language information retrieval system especially as related to the 3rd Text Retrieval conference (TREC-3). The main characteristic of this system is the use of advanced natural language processing to enhance the effectiveness of term-based document retrieval. The system is designed around a traditional statiscal backbone consisting of the indexer module, which builds inverted index files from pre-processed documents, and a retrieval engine which searches and ranks the documents in response to user queries. Natural language processing is used to (1) preprocess the documents in order to extract content-carrying terms, (2) discover inter-term dependencies and build a conceptual hierarchy specific to the database domain, and (3) process user's natural language requests into effective search queries. For the present TREC-3 effort, the total of 3.3 GBytes of text articles have been processed (Tipster disks 1 through 3), including material from the Wall Street Journal, the Associated Press newswire, the Federal Register, Ziff Communication's Computer Library, Department of Energy abstract, U.S. Patents and the San Jose Mercury News, totaling more than 500 million words of English. Since the TREC-2 conference, many components of the system have been redesigned to facilitate its scalability to deal with ever increasing amounts of data. In particular, a randomized index-splitting mechanism has been installed which allows the system to create a number of smaller indexes that can be independently searched.

Treffer: Natural language information retrieval : TREC-3 report

Weitere Informationen

Links

Zusatz-Funktionen