High Performance Parallelism Pearls : Multicore and Many-core Programming Approaches / James Reinders, Jim Jeffers, Intel Corporation, United States.
High Performance Parallelism Pearls Volume 2 offers another set of examples that demonstrate how to leverage parallelism. Similar to Volume 1, the techniques included here explain how to use processors and coprocessors with the same programming - illustrating the most effective ways to combine Xeon...
Saved in:
Online Access: |
Full Text (via O'Reilly/Safari) Full Text (via O'Reilly/Safari) |
---|---|
Main Authors: | , |
Format: | eBook |
Language: | English |
Published: |
Waltham, MA :
Morgan Kaufman,
[2015]
|
Edition: | First edition. |
Subjects: |
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | b10294682 | ||
006 | m o d | ||
007 | cr ||||||||||| | ||
008 | 141211s2015 maua ob 001 0 eng d | ||
005 | 20240829145017.6 | ||
019 | |a 921845676 |a 1259185271 | ||
020 | |a 9780128021996 | ||
020 | |a 0128021993 | ||
020 | |a 0128021187 | ||
020 | |a 9780128021187 | ||
020 | |a 9780128038901 | ||
020 | |a 012803890X | ||
020 | |a 0128038195 | ||
020 | |a 9780128038192 | ||
020 | |z 9780128021187 | ||
020 | |z 9780128038192 | ||
029 | 1 | |a DEBBG |b BV042490380 | |
029 | 1 | |a DEBBG |b BV043020296 | |
029 | 1 | |a DEBSZ |b 434835595 | |
029 | 1 | |a DEBSZ |b 455698678 | |
029 | 1 | |a GBVCP |b 882744445 | |
029 | 1 | |a GBVCP |b 882843001 | |
035 | |a (OCoLC)safo898036682 | ||
035 | |a (OCoLC)898036682 |z (OCoLC)921845676 |z (OCoLC)1259185271 | ||
037 | |a safo9780128021187 | ||
040 | |a UMI |b eng |e rda |e pn |c UMI |d OCLCF |d DEBBG |d DEBSZ |d NLE |d CEF |d OCLCQ |d AU@ |d OCLCQ |d OCLCO |d OCLCQ |d QGK |d OCLCO |d OCLCL |d OCLCQ | ||
049 | |a GWRE | ||
050 | 4 | |a QA76.642 | |
100 | 1 | |a Reinders, James, |e author. | |
245 | 1 | 0 | |a High Performance Parallelism Pearls : |b Multicore and Many-core Programming Approaches / |c James Reinders, Jim Jeffers, Intel Corporation, United States. |
250 | |a First edition. | ||
264 | 1 | |a Waltham, MA : |b Morgan Kaufman, |c [2015] | |
264 | 4 | |c ©2015 | |
300 | |a 1 online resource (1 volume) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a volume |b nc |2 rdacarrier | ||
347 | |a text file | ||
588 | 0 | |a Online resource; title from title page (Safari, viewed December 3, 2014). | |
504 | |a Includes bibliographical references and indexes. | ||
520 | |a High Performance Parallelism Pearls Volume 2 offers another set of examples that demonstrate how to leverage parallelism. Similar to Volume 1, the techniques included here explain how to use processors and coprocessors with the same programming - illustrating the most effective ways to combine Xeon Phi coprocessors with Xeon and other multicore processors. The book includes examples of successful programming efforts, drawn from across industries and domains such as biomed, genetics, finance, manufacturing, imaging, and more. Each chapter in this edited work includes detailed explanations of t. | ||
505 | 0 | |a Front Cover; High Performance Parallelism Pearls: Multicore and Many-core Programming Approaches; Copyright; Contents; Contributors; Acknowledgments; Foreword; Humongous computing needs: Science years in the making; Open standards; Keen on many-core architecture; Xeon Phi is born: Many cores, excellent vector ISA ; Learn highly scalable parallel programming; Future demands grow: Programming models matter; Preface; Inspired by 61 cores: A new era in programming; Chapter 1: Introduction; Learning from successful experiences; Code modernization; Modernize with concurrent algorithms | |
505 | 8 | |a Modernize with vectorization and data localityUnderstanding power usage; ISPC and OpenCL anyone?; Intel Xeon Phi coprocessor specific; Many-core, neo-heterogeneous; No "Xeon Phi" in the title, neo-heterogeneous programming; The future of many-core; Downloads; Chapter 2: From "Correct" to "Correct & Efficient": A Hydro2D Case Study with Godunov's Scheme; Scientific computing on contemporary computers; Modern computing environments; CEA's Hydro2D; A numerical method for shock hydrodynamics; Euler's equation; Godunov's method; Where it fits; Features of modern architectures | |
505 | 8 | |a Performance-oriented architectureProgramming tools and runtimes; Our computing environments; Paths to performance; Running Hydro2D; Hydro2D's structure; Computation scheme; Data structures; Measuring performance; Optimizations; Memory usage; Thread-level parallelism; Arithmetic efficiency and instruction-level parallelism; Data-level parallelism; Summary; The coprocessor vs the processor; A rising tide lifts all boats; Performance strategies; Chapter 3: Better Concurrency and SIMD on HBM ; The application: HIROMB -- BOOS -Model; Key usage: DMI ; HBM execution profile | |
505 | 8 | |a Overview for the optimization of HBM Data structures: Locality done right; Thread parallelism in HBM ; Data parallelism: SIMD vectorization; Trivial obstacles; Premature abstraction is the root of all evil; Results; Profiling details; Scaling on processor vs. coprocessor; Contiguous attribute; Summary; References; Chapter 4: Optimizing for Reacting Navier-Stokes Equations; Getting started; Version 1.0: Baseline; Version 2.0: ThreadBox ; Version 3.0: Stack memory; Version 4.0: Blocking; Version 5.0: Vectorization; Intel Xeon Phi coprocessor results; Summary | |
505 | 8 | |a Chapter 5: Plesiochronous Phasing BarriersWhat can be done to improve the code?; What more can be done to improve the code?; Hyper-Thread Phalanx; What is nonoptimal about this strategy?; Coding the Hyper-Thread Phalanx; How to determine thread binding to core and HT within core?; The Hyper-Thread Phalanx hand-partitioning technique; A lesson learned; Back to work; Data alignment; Use aligned data when possible; Redundancy can be good for you; The plesiochronous phasing barrier; Let us do something to recover this wasted time; A few "left to the reader" possibilities | |
546 | |a English. | ||
650 | 0 | |a Parallel programming (Computer science) | |
650 | 0 | |a Computer programming. | |
650 | 7 | |a Computer programming |2 fast | |
650 | 7 | |a Parallel programming (Computer science) |2 fast | |
700 | 1 | |a Jeffers, Jim |c (Computer engineer), |e author. |1 https://id.oclc.org/worldcat/entity/E39PCjyGcqMBt36Gx4QRD8JPpK | |
776 | |z 0-12-802118-7 | ||
856 | 4 | 0 | |u https://go.oreilly.com/UniOfColoradoBoulder/library/view/~/9780128021187/?ar |z Full Text (via O'Reilly/Safari) |
856 | 4 | 0 | |u https://go.oreilly.com/UniOfColoradoBoulder/library/view/~/9780128038901/?ar |z Full Text (via O'Reilly/Safari) |
915 | |a - | ||
956 | |a O'Reilly-Safari eBooks | ||
956 | |b O'Reilly Online Learning: Academic/Public Library Edition | ||
994 | |a 92 |b COD | ||
998 | |b Subsequent record output | ||
999 | f | f | |i fd187f6e-b70c-5205-9a98-fd6742301c9d |s 206b4d8d-89a6-542c-9caf-bcf677fca4a8 |
952 | f | f | |p Can circulate |a University of Colorado Boulder |b Online |c Online |d Online |e QA76.642 |h Library of Congress classification |i web |n 1 |