Treffer: Parallel implementation of information retrieval clustering models

Title:

Parallel implementation of information retrieval clustering models

Authors:

Source:

High performance computing for computational science (Valencia, 28-30 june 2004, revised selected and invited papers)Lecture notes in computer science. :129-141

Publisher Information:

Berlin: Springer, 2005.

Publication Year:

2005

Physical Description:

print, 13 ref

Original Material:

INIST-CNRS

Subject Terms:

Computer science, Informatique, Sciences exactes et technologie, Exact sciences and technology, Sciences appliquees, Applied sciences, Informatique; automatique theorique; systemes, Computer science; control theory; systems, Logiciel, Software, Systèmes informatiques et systèmes répartis. Interface utilisateur, Computer systems and distributed systems. User interface, Algorithme k moyenne, K means algorithm, Algoritmo k media, Algorithme parallèle, Parallel algorithm, Algoritmo paralelo, Analyse amas, Cluster analysis, Analisis cluster, Calcul réparti, Distributed computing, Cálculo repartido, Classification, Clasificación, Haute performance, High performance, Alto rendimiento, Information utile, Useful information, Información útil, Information électronique, Electronic information, Información electronica, Internet, Modélisation, Modeling, Modelización, Parallélisation, Parallelization, Paralelisacíon, Performance algorithme, Algorithm performance, Resultado algoritmo, Recherche information, Information retrieval, Búsqueda información, Système information, Information system, Sistema información

Document Type:

Konferenz Conference Paper

File Description:

text

Language:

English

Author Affiliations:

Department of Computer Systems and Computation, Polytechnic University of Valencia, Spain

ISSN:

0302-9743

Access URL:

http://pascal-francis.inist.fr/vibad/index.php?action=search&terms=16895477

Rights:

Copyright 2005 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS

Notes:

Computer science; theoretical automation; systems

Accession Number:

edscal.16895477

Database:

PASCAL Archive

Weitere Informationen

Information Retrieval (IR) is fundamental nowadays, and more since the appearance of the Internet and huge amount of information in electronic format. All this information is not useful unless its search is efficient and effective. With large collections parallelization is important because the data volume is enormous. Hence, usually, only one computer is not sufficient to manage all data, and more in a reasonable time. The parallelization also is important because in many situations the document collection is already distributed and its centralization is not a good idea. This is the reason why we present parallel algorithms in information retrieval systems. We propose two parallel clustering algorithms: α-Bisecting K-Means and α-Bisecting Spherical K-Means. Moreover, we have prepared a set of experiments to compare the computation performance of the algorithms. These studies have been accomplished in a cluster of PCs with 20 bi-processor nodes and two different collections.

Treffer: Parallel implementation of information retrieval clustering models

Weitere Informationen

Links

Zusatz-Funktionen