2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Items from 1 to 20 out of 35 results

book

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

ACM

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

chapter

Multi-Optimization power management for chip multiprocessors

Ke Meng, Russ Joseph, Robert P. Dick, Li Shang

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 177 - 186

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

The emergence of power as a first-class design constraint has fueled the proposal of a growing number of run-time power optimizations. Many of these optimizations trade-off power saving opportunity for a variable performance loss which depends on application characteristics and program phase. Furthermore, the potential benefits of these optimizations are sometimes non-additive, and it can be difficult...

chapter

(How) can programmers conquer the multicore menace?

Saman Amarasinghe

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 133

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

The document was not made available for publication as part of the conference proceedings.

chapter

Skewed redundancy

Gordon B. Bell, Mikko H. Lipasti

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 62 - 71

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Technology scaling in integrated circuits has consistently provided dramatic performance improvements in modern microprocessors. However, increasing device counts and decreasing on-chip voltage levels have made transient errors a first-order design constraint that can no longer be ignored. Several proposals have provided fault detection and tolerance through redundantly executing a program on an additional...

chapter

Prediction models for multi-dimensional power-performance optimization on many cores

Matthew Curtis-Maury, Ankur Shah, Filip Blagojevic, Dimitrios S. Nikolopoulos, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 250 - 259

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Power has become a primary concern for HPC systems. Dynamic voltage and frequency scaling (DVFS) and dynamic concurrency throttling (DCT) are two software tools (or knobs) for reducing the dynamic power consumption of HPC systems. To date, few works have considered the synergistic integration of DVFS and DCT in performance-constrained systems, and, to the best of our knowledge, no prior research has...

chapter

Multi-mode energy management for multi-tier server clusters

Tibor Horvath, Kevin Skadron

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 270 - 279

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

This paper presents an energy management policy for reconfigurable clusters running a multi-tier application, exploiting DVS together with multiple sleep states. We develop a theoretical analysis of the corresponding power optimization problem and design an algorithm around the solution. Moreover, we rigorously investigate selection of the optimal number of spare servers for each power state, a problem...

chapter

Analysis and approximation of optimal co-scheduling on Chip Multiprocessors

Yunlian Jiang, Xipeng Shen, Chen Jie, Rahul Tripathi

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 220 - 229

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Cache sharing among processors is important for Chip Multiprocessors to reduce inter-thread latency, but also brings cache contention, degrading program performance considerably. Recent studies have shown that job co-scheduling can effectively alleviate the contention, but it remains an open question how to efficiently find optimal co-schedules. Solving the question is critical for determining the...

chapter

Adaptive insertion policies for managing shared caches

Aamer Jaleel, William Hasenplaugh, Moinuddin Qureshi, Julien Sebot, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 208 - 219

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Chip Multiprocessors (CMPs) allow different applications to concurrently execute on a single chip. When applications with differing demands for memory compete for a shared cache, the conventional LRU replacement policy can significantly degrade cache performance when the aggregate working set size is greater than the shared cache. In such cases, shared cache performance can be significantly improved...

chapter

Author index

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 315

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

chapter

COMIC: A coherent shared memory interface for cell BE

Jaejin Lee, Sangmin Seo, Chihun Kim, Junghyun Kim, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 303 - 314

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

The Cell BE processor is a heterogeneous multicore that contains one PowerPC Processor Element (PPE) and eight Synergistic Processor Elements (SPEs). Each SPE has a small software-managed local store. Applications must explicitly control all DMA transfers of code and data between the SPE local stores and the main memory, and they must perform any coherence actions required for data transferred. The...

chapter

Hybrid access-specific software cache techniques for the cell BE architecture

Marc Gonzalez, Nikola Vujic, Xavier Martorell, Eduard Ayguade, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 292 - 302

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Ease of programming is one of the main impediments for the broad acceptance of multi-core systems with no hardware support for transparent data transfer between local and global memories. Software cache is a robust approach to provide the user with a transparent view of the memory architecture; but this software approach can suffer from poor performance. In this paper, we propose a hierarchical, hybrid...

chapter

Title pages

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > c1

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

chapter

MCAMP: Communication optimization on Massively Parallel Machines with hierarchical scratch-pad memory

Hiroshige Hayashizaki, Yutaka Sugawara, Mary Inaba, Kei Hiraki

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 102 - 111

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Massively parallel machines that integrate a large number of simple processors and small scratch-pad memories (SPMs) into a single chip can achieve a high peak performance per watt of power. In these machines, communication optimizations are important because the communication bandwidth tends to be a bottleneck. Previously proposed communication optimizations using copy candidates, which have been...

chapter

Feature selection and policy optimization for distributed instruction placement using reinforcement learning

Katherine E. Coons, Behnam Robatmili, Matthew E. Taylor, Betrand A. Maher, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 32 - 42

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribution of computation and data become increasingly important performance factors. Explicit Dataflow Graph Execution (EDGE) processors, in which instructions communicate with one another directly on a distributed substrate,...

chapter

GPU evolution: Will graphics morph into compute?

Norm Rubin

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 1

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

chapter

Mars: A MapReduce Framework on graphics processors

Bingsheng He, Wenbin Fang, Qiong Luo, Naga K. Govindaraju, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 260 - 269

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

We design and implement Mars, a MapReduce framework, on graphics processors (GPUs). MapReduce is a distributed programming framework originally proposed by Google for the ease of development of web search applications on a large number of commodity CPUs. Compared with CPUs, GPUs have an order of magnitude higher computation power and memory bandwidth, but are harder to program since their architectures...

chapter

Leveraging on-chip networks for data cache migration in chip multiprocessors

Noel Eisley, Li-Shiuan Peh, Li Shang

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 197 - 207

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Recently, chip multiprocessors (CMPs) have arisen as the de facto design for modern high-performance processors, with increasing core counts. An important property of CMPs is that remote, but on-chip, L2 cache accesses are less costly than off-chip accesses; this is in contrast to earlier chip-to-chip or board-to-board multiprocessors, where an access to a remote node is just as costly if not more...

chapter

Distributed Cooperative Caching

Enric Herrero, Jose Gonzalez, Ramon Canal

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 134 - 143

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

This paper presents the Distributed Cooperative Caching, a scalable and energy-efficient scheme to manage chip multiprocessor (CMP) cache resources. The proposed configuration is based in the Cooperative Caching framework [3] but it is intended for large scale CMPs. Both centralized and distributed configurations have the advantage of combining the benefits of private and shared caches. In our proposal,...

chapter

Scalable and reliable communication for hardware transactional memory

Seth H. Pugsley, Manu Awasthi, Niti Madan, Naveen Muralimanohar, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 144 - 154

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

In a hardware transactional memory system with lazy versioning and lazy conflict detection, the process of transaction commit can emerge as a bottleneck. This is especially true for a large-scale distributed memory system where multiple transactions may attempt to commit simultaneously and co-ordination is required before allowing commits to proceed in parallel. In this paper, we propose novel algorithms...

chapter

Profiler and compiler assisted adaptive I/O prefetching for shared storage caches

Seung Woo Son, Sai Prashanth Muralidhara, Ozcan Ozturk, Mahmut Kandemir, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 112 - 121

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

I/O prefetching has been employed in the past as one of the mechanisms to hide large disk latencies. However, I/O prefetching in parallel applications is problematic when multiple CPUs share the same set of disks due to the possibility that prefetches from different CPUs can interact on shared memory caches in the I/O nodes in complex and unpredictable ways. In this paper, we (i) quantify the impact...

Publication date

Set your own date range

Content availability

Available (34)
None (1)

Keywords

PROGRAM PROCESSORS (12)
MULTICORE PROCESSING (11)
HARDWARE (10)
OPTIMIZATION (9)
ALGORITHM DESIGN AND ANALYSIS (5)
COHERENCE (5)
COMPUTER ARCHITECTURE (5)
INSTRUCTION SETS (5)
REGISTERS (5)
PROGRAMMING (4)
RUNTIME (4)
GRAPHICS PROCESSING UNITS (3)
MEMORY MANAGEMENT (3)
PARALLEL PROCESSING (3)
PIPELINES (3)
PROPOSALS (3)
RADIATION DETECTORS (3)
SOFTWARE (3)
SYSTEM-ON-CHIP (3)
THROUGHPUT (3)
BANDWIDTH (2)
BENCHMARK TESTING (2)
CACHE STORAGE (2)
CHIP MULTIPROCESSORS (2)
COMPUTER SCIENCE (2)
DYNAMIC POWER MANAGEMENT (2)
DYNAMIC SCHEDULING (2)
ENERGY EFFICIENCY (2)
GENERATORS (2)
MATHEMATICAL MODEL (2)
MEMORY HIERARCHY (2)
MICROARCHITECTURE (2)
OPENMP (2)
PARALLEL PROGRAMMING (2)
PARTITIONING ALGORITHMS (2)
PREFETCHING (2)
PROTOCOLS (2)
REDUNDANCY (2)
RELIABILITY (2)
SCALABILITY (2)
SCHEDULES (2)
SHAPE (2)
SWITCHES (2)
SYNCHRONIZATION (2)
ADAPTATION MODELS (1)
ADAPTIVE (1)
ALGORITHMS FOR TRANSACTION COMMIT (1)
ANALYTICAL MODEL (1)
ANALYTICAL MODELS (1)
ANIMATION (1)
APPROXIMATION ALGORITHMS (1)
ARMCO (1)
ARRAYS (1)
BENCHMARK SUITE (1)
BOOLEAN FUNCTIONS (1)
BUFFER STORAGE (1)
CACHE COHERENCE (1)
CACHE CONTENTION (1)
CACHE PARTITIONING (1)
CACHE RESIZING (1)
CATHODE RAY TUBES (1)
CELL BE (1)
CHIP-MULTIPROCESSOR (1)
CHIPMULTI-PROCESSOR (1)
CIRCUIT FAULTS (1)
CMP (1)
CMP SCHEDULING (1)
CO-SCHEDULING (1)
COARSE-GRAINED RECONFIGURABLE ARCHITECTURE (1)
COMMUNICATION CODE GENERATION (1)
COMPILER (1)
COMPILER HEURISTICS (1)
COMPILER OPTIMIZATIONS (1)
COMPUTATIONAL MODELING (1)
COMPUTERS (1)
CONCURRENT COMPUTING (1)
CONTEXT (1)
CONTEXT MODELING (1)
CONTEXT SWITCH MISSES (1)
COOPERATIVE CACHING (1)
COPY CANDIDATES (1)
CRITICAL THREADS (1)
DATA PARALLELISM (1)
DATA REUSE (1)
DATA VISUALIZATION (1)
DEGRADATION (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED COOPERATIVE CACHING (1)
DISTRIBUTED PROCESSING (1)
DYNAMIC VOLTAGE SCALING (1)
ELECTRONICS PACKAGING (1)
END-TO-END LATENCY (1)
ENERGY CONSUMPTION (1)
ENERGY MANAGEMENT (1)
ENERGY-AWARE (1)
ENGINES (1)
ERROR TOLERANCE (1)
EXPRESSION OPTIMIZATION (1)
FACE (1)
FAULT TOLERANCE (1)
more

INFONA - science communication portal

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)