Advances in GPU research and practice / edited by Hamid Sarbazi-Azad.

Advances in GPU Research and Practice focuses on research and practices in GPU based systems. The topics treated cover a range of issues, ranging from hardware and architectural issues, to high level issues, such as application systems, parallel programming, middleware, and power and energy issues....

Full description

Saved in:
Bibliographic Details
Online Access: Full Text (via ScienceDirect)
Other Authors: Sarbazi-Azad, Hamid (Editor)
Format: eBook
Language:English
Published: Cambridge, MA : Morgan Kaufmann, [2017]
Series:Emerging trends in computer science & applied computing.
Subjects:

MARC

LEADER 00000cam a2200000ui 4500
001 b9057888
003 CoU
005 20170311071113.6
006 m o d
007 cr |||||||||||
008 160924s2017 mau o 000 0 eng d
019 |a 960211532  |a 962007727 
020 |a 9780128037881  |q (electronic bk.) 
020 |a 0128037881  |q (electronic bk.) 
020 |z 9780128037386 
035 |a (OCoLC)scd962064304 
035 |a (OCoLC)961309123  |z (OCoLC)960211532  |z (OCoLC)962007727 
040 |a YDX  |b eng  |e rda  |e pn  |c YDX  |d OCLCO  |d UMI  |d STF  |d OCLCQ  |d TOH  |d MERUC  |d IDEBK  |d DEBBG  |d OPELS  |d YDX  |d N$T  |d MERER  |d UPM 
049 |a GWRE 
050 4 |a T385  |b .A38 2017 
245 0 0 |a Advances in GPU research and practice /  |c edited by Hamid Sarbazi-Azad. 
264 1 |a Cambridge, MA :  |b Morgan Kaufmann,  |c [2017] 
300 |a 1 online resource (776 pages) 
336 |a text  |b txt  |2 rdacontent. 
337 |a computer  |b c  |2 rdamedia. 
338 |a online resource  |b cr  |2 rdacarrier. 
490 1 |a Emerging Trends in Computer Science and Applied Computing. 
505 0 |a Front Cover; Advances in GPU Research and Practice; Copyright; Dedication; Contents; List of Contributors; Preface; Acknowledgments; Part 1: Programming and tools; Chapter 1: Formal analysis techniques for reliable GPU programming: current solutions and call to action; 1 GPUs in Support of Parallel Computing; Bugs in parallel and GPU code; 2 A quick introduction to GPUs; Organization of threads; Memory spaces; Barrier synchronization; Warps and lock-step execution; Dot product example; 3 Correctness issues in GPU programming; Data races; Lack of forward progress guarantees. 
505 8 |a Floating-point accuracy4 The need for effective tools; 4.1 A Taxonomy of Current Tools; 4.2 Canonical Schedules and the Two-Thread Reduction; Race freedom implies determinism; Detecting races: ̀̀all for one and one for all''; Restricting to a canonical schedule; Reduction to a pair of threads; 4.3 Symbolic Bug-Finding Case Study: GKLEE; 4.4 Verification Case Study: GPUVerify; 5 Call to Action; GPUs will become more pervasive; Current tools show promise; Solving basic correctness issues; Equivalence checking; Clarity from vendors and standards bodies; User validation of tools; Acknowledgments. 
504 |a ReferencesChapter 2: SnuCL: A unified OpenCL framework for heterogeneous clusters; 1 Introduction; 2 OpenCL; 2.1 Platform Model; 2.2 Execution Model; 2.3 Memory Model; 2.4 Synchronization; 2.5 Memory Consistency; 2.6 OpenCL ICD; 3 Overview of SnuCL framework; 3.1 Limitations of OpenCL; 3.2 SnuCL CPU; 3.3 SnuCL Single; 3.4 SnuCL Cluster; 3.4.1 Processing synchronization commands; 4 Memory management in SnuCL Cluster; 4.1 Space Allocation to Memory Objects; 4.2 Minimizing Copying Overhead; 4.3 Processing Memory Commands; 4.4 Consistency Management. 
505 8 |a 4.5 Detecting Memory Objects Written by a Kernel5 SnuCL extensions to OpenCL; 6 Performance evaluation; 6.1 Evaluation Methodology; 6.2 Performance; 6.2.1 Scalability on the medium-scale GPU cluster; 6.2.2 Scalability on the large-scale CPU cluster; 7 Conclusions; Acknowledgments; References; Chapter 3: Thread communication and synchronization on massively parallel GPUs; 1 Introduction; 2 Coarse-Grained Communication and Synchronization; 2.1 Global Barrier at the Kernel Level; 2.2 Local Barrier at the Work-Group Level; 2.3 Implicit Barrier at the Wavefront Level. 
505 8 |a 3 Built-In Atomic Functions on Regular Variables4 Fine-Grained Communication and Synchronization; 4.1 Memory Consistency Model; 4.1.1 Sequential consistency; 4.1.2 Relaxed consistency; 4.2 The OpenCL 2.0 Memory Model; 4.2.1 Relationships between two memory operations; 4.2.2 Special atomic operations and stand-alone memory fence; 4.2.3 Release and acquire semantics; 4.2.4 Memory order parameters; 4.2.5 Memory scope parameters; 5 Conclusion and Future Research Direction; References; Chapter 4: Software-level task scheduling on GPUs; 1 Introduction, Problem Statement, and Context. 
500 |a 2 Nondeterministic behaviors caused by the hardware. 
520 |a Advances in GPU Research and Practice focuses on research and practices in GPU based systems. The topics treated cover a range of issues, ranging from hardware and architectural issues, to high level issues, such as application systems, parallel programming, middleware, and power and energy issues. Divided into six parts, this edited volume provides the latest research on GPU computing. Part I: Architectural Solutions focuses on the architectural topics that improve on performance of GPUs, Part II: System Software discusses OS, compilers, libraries, programming environment, languages, and paradigms that are proposed and analyzed to help and support GPU programmers. Part III: Power and Reliability Issues covers different aspects of energy, power, and reliability concerns in GPUs. Part IV: Performance Analysis illustrates mathematical and analytical techniques to predict different performance metrics in GPUs. Part V: Algorithms presents how to design efficient algorithms and analyze their complexity for GPUs. Part VI: Applications and Related Topics provides use cases and examples of how GPUs are used across many sectors. 
588 0 |a Online resource; title from digital title page (viewed on October 27, 2016) 
650 0 |a Graphics processing units  |x Programming. 
650 0 |a Computer graphics. 
700 1 |a Sarbazi-Azad, Hamid,  |e editor. 
776 0 8 |i Print version:  |a Azad, Hamid Sarbazi.  |t Advances in GPU Research and Practice.  |d Saint Louis : Elsevier Science, ©2016  |z 9780128037386. 
830 0 |a Emerging trends in computer science & applied computing. 
856 4 0 |u https://colorado.idm.oclc.org/login?url=http://www.sciencedirect.com/science/book/9780128037386  |z Full Text (via ScienceDirect) 
907 |a .b90578880  |b 11-29-21  |c 02-22-17 
998 |a web  |b 03-23-17  |c f  |d b   |e -  |f eng  |g mau  |h 0  |i 2 
956 |a ScienceDirect ebooks 
956 |b ScienceDirect All Books 
999 f f |i 0e2f8019-878c-5fa9-802d-ee6c81aa56ad  |s a5b38c1d-20f8-52ec-9c9e-99a7304be146 
952 f f |p Can circulate  |a University of Colorado Boulder  |b Online  |c Online  |d Online  |e T385 .A38 2017  |h Library of Congress classification  |i web  |n 1