Serviceeinschränkungen vom 12.-22.02.2026 - weitere Infos auf der UB-Homepage

Treffer: An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing

Title:
An instruction-systolic programmable shader architecture for multi-threaded 3D graphics processing
Source:
Journal of parallel and distributed computing (Print). 70(11):1110-1118
Publisher Information:
Amsterdam: Elsevier, 2010.
Publication Year:
2010
Physical Description:
print, 20 ref
Original Material:
INIST-CNRS
Subject Terms:
Document Type:
Fachzeitschrift Article
File Description:
text
Language:
English
Author Affiliations:
Department of Computer Science, C532, Yonsei University, 134 Shinchon-dong, Seoul, 120-749, Korea, Republic of
Department of Computer Engineering, Sejong University, 98 Kunja-Dong, Kwangjin-Ku, Seoul, 143-747, Korea, Republic of
Department of Computer Science, University of Massachusetts Amherst, MA 01003-4610, United States
ISSN:
0743-7315
Rights:
Copyright 2015 INIST-CNRS
CC BY 4.0
Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS
Notes:
Computer science; theoretical automation; systems
Accession Number:
edscal.23293339
Database:
PASCAL Archive

Weitere Informationen

In order to guarantee both performance and programmability demands in 3D graphics applications, vector and multithreaded SIMD architectures have been employed in recent graphics processing units. This paper introduces a novel instruction-systolic array architecture, which transfers an instruction stream in a pipelined fashion to efficiently share the expensive functional resources of a graphics processor. Specifically, cache misses and dynamic branches can cause additional latencies and complicated management in these parallel architectures. To address this problem, we combine a systolic execution scheme with on-demand warp activation that handles cache miss latency and branch divergence efficiently without significantly increasing hardware resources, either in terms of logic or register space. Simulation indicates that the proposed architecture offers 25% better performance than a traditional SIMD architecture with the same resources, and requires significantly fewer resources to match the performance of a typical modern vector multi-threaded GPU architecture.