2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Items from 1 to 20 out of 68 results

book

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

IEEE

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

chapter

Radiation-Induced Error Criticality in Modern HPC Parallel Accelerators

Daniel Alfonso Goncalves De Oliveira, Laercio Lima Pilla, Mauricio Hanzich, Vinicius Fratin, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 577 - 588

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

In this paper, we evaluate the error criticality of radiation-induced errors on modern High-Performance Computing~(HPC) accelerators (Intel Xeon Phi and NVIDIA K40) through a dedicated set of metrics. We show that, as long as imprecise computing is concerned, the simple mismatch detection is not sufficient to evaluate and compare the radiation sensitivity of HPC devices and algorithms. Our analysis...

chapter

Pilot Register File: Energy Efficient Partitioned Register File for GPUs

Mohammad Abdel-Majeed, Alireza Shafaei, Hyeran Jeon, Massoud Pedram, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 589 - 600

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

GPU adoption for general purpose computing hasbeen accelerating. To support a large number of concurrentlyactive threads, GPUs are provisioned with a very large registerfile (RF). The RF power consumption is a critical concern. Oneoption to reduce the power consumption dramatically is touse near-threshold voltage(NTV) to operate the RF. However, operating MOSFET devices at NTV is fraught with stabilityand...

chapter

KAML: A Flexible, High-Performance Key-Value SSD

Yanqin Jin, Hung-Wei Tseng, Yannis Papakonstantinou, Steven Swanson

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 373 - 384

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Modern solid state drives (SSDs) unnecessarily confine host programs to the conventional block I/O interface, leading to suboptimal performance and resource under-utilization. Recent attempts to replace or extend this interface with a key-value-oriented interface and/or built-in support for transactions offer some improvements, but the details of their implementations make them a poor match for many...

chapter

Cold Boot Attacks are Still Hot: Security Analysis of Memory Scramblers in Modern Processors

Salessawi Ferede Yitbarek, Misiker Tadesse Aga, Reetuparna Das, Todd Austin

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 313 - 324

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Previous work has demonstrated that systems with unencrypted DRAM interfaces are susceptible to cold boot attacks – where the DRAM in a system is frozen to give it sufficient retention time and is then re-read after reboot, or is transferred to an attacker's machine for extracting sensitive data. This method has been shown to be an effective attack vector for extracting disk encryption keys out of...

chapter

Cooperative Path-ORAM for Effective Memory Bandwidth Sharing in Server Settings

Rujia Wang, Youtao Zhang, Jun Yang

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 325 - 336

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Path ORAM (Oblivious RAM) is a recently proposed ORAM protocol for preventing information leakage from memory access sequences. It receives wide adoption due to its simplicity, practical efficiency and asymptotic efficiency. However, Path ORAM has extremely large memory bandwidth demand, leading to severe memory competition in server settings, e.g., a server may service one application that uses Path...

chapter

Design and Evaluation of AWGR-Based Photonic NoC Architectures for 2.5D Integrated High Performance Computing Systems

Paolo Grani, Roberto Proietti, Venkatesh Akella, S. J. Ben Yoo

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 289 - 300

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

In future performance improvement of the basic building block of supercomputers has to come through increased integration enabled by 3D (vertical) and 2.5D (horizontal) die-stacking. But to take advantage of this integration we need an interconnection network between the memory and compute die that not only can provide an order of magnitude higher bandwidth but also consume an order of magnitude less...

chapter

Secure Dynamic Memory Scheduling Against Timing Channel Attacks

Yao Wang, Benjamin Wu, G. Edward Suh

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 301 - 312

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

This paper presents SecMC, a secure memory controller that provides efficient memory scheduling with a strong quantitative security guarantee against timing channel attacks. The first variant, named SecMC-NI, eliminates timing channels while allowing a tight memory schedule by interleaving memory requests that access different banks or ranks. Experimental results show that SecMC-NI significantly (45%...

chapter

A Split Cache Hierarchy for Enabling Data-Oriented Optimizations

Andreas Sembrant, Erik Hagersten, David Black-Schaffer

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 133 - 144

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Today's caches tightly couple data with metadata (Address Tags) at the cache line granularity. The co-location of data and its identifying metadata means that they require multiple approaches to locate data (associative way searches and level-by-level searches), evict data (coherent writebacks buffers and associative level-by-level searches) and keep data coherent (directory indirections and associative...

chapter

SWAP: Effective Fine-Grain Management of Shared Last-Level Caches with Minimum Hardware Support

Xiaodong Wang, Shuang Chen, Jeff Setter, Jose F. Martinez

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 121 - 132

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Performance isolation is an important goal in server-class environments. Partitioning the last-level cache of a chip multiprocessor (CMP) across co-running applications has proven useful in this regard. Two popular approaches are (a) hardware support for way partitioning, or (b) operating system support for set partitioning through page coloring. Unfortunately, neither approach by itself is scalable...

chapter

Defect Analysis and Cost-Effective Resilience Architecture for Future DRAM Devices

Sanguhn Cha, Seongil O, Hyunsung Shin, Sangjoon Hwang, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 61 - 72

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Technology scaling has continuously improved the density, performance, energy efficiency, and cost of DRAM-based main memory systems. Starting from sub-20nm processes, however, the industry began to pay considerably higher costs to screen and manage notably increasing defective cells. The traditional technique, which replaces the rows/columns containing faulty cells with spare rows/columns, has been...

chapter

Vulnerabilities in MLC NAND Flash Memory Programming: Experimental Analysis, Exploits, and Mitigation Techniques

Yu Cai, Saugata Ghose, Yixin Luo, Ken Mai, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 49 - 60

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Modern NAND flash memory chips provide high density by storing two bits of data in each flash cell, called a multi-level cell (MLC). An MLC partitions the threshold voltage range of a flash cell into four voltage states. When a flash cell is programmed, a high voltage is applied to the cell. Due to parasitic capacitance coupling between flash cells that are physically close to each other, flash cell...

chapter

Architecting an Energy-Efficient DRAM System for GPUs

Niladrish Chatterjee, Mike OConnor, Donghyuk Lee, Daniel R. Johnson, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 73 - 84

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

This paper proposes an energy-efficient, high-throughput DRAM architecture for GPUs and throughput processors. In these systems, requests from thousands of concurrent threads compete for a limited number of DRAM row buffers. As a result, only a fraction of the data fetched into a row buffer is used, leading to significant energy overheads. Our proposed DRAM architecture exploits the hierarchical organization...

chapter

Enabling Effective Module-Oblivious Power Gating for Embedded Processors

Hari Cherupalli, Henry Duwe, Weidong Ye, Rakesh Kumar, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 157 - 168

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

The increasingly-stringent power and energy requirements of emerging embedded applications have led to a strong recent interest in aggressive power gating techniques. Conventional techniques for aggressive power gating perform module-based power gating in processors, where power domains correspond to RTL modules. We observe that there can be significant power benefits from module-oblivious power gating,...

chapter

BRAVO: Balanced Reliability-Aware Voltage Optimization

Karthik Swaminathan, Nandhini Chandramoorthy, Chen-Yong Cher, Ramon Bertran, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 97 - 108

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Defining a processor micro-architecture for a targeted productspace involves multi-dimensional optimization across performance, power and reliability axes. A key decision in sucha definition process is the circuit-and technology-driven parameterof the nominal (voltage, frequency) operating point. This is a challenging task, since optimizing individually orpair-wise amongst these metrics usually results...

chapter

Controlled Kernel Launch for Dynamic Parallelism in GPUs

Xulong Tang, Ashutosh Pattnaik, Huaipan Jiang, Onur Kayiran, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 649 - 660

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Dynamic parallelism (DP) is a promising feature for GPUs, which allows on-demand spawning of kernels on the GPU without any CPU intervention. However, this feature has two major drawbacks. First, the launching of GPU kernels can incur significant performance penalties. Second, dynamically-generated kernels are not always able to efficiently utilize the GPU cores due to hardware-limits. To address...

chapter

Exploring Hyperdimensional Associative Memory

Mohsen Imani, Abbas Rahimi, Deqian Kong, Tajana Rosing, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 445 - 456

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Brain-inspired hyperdimensional (HD) computing emulates cognition tasks by computing with hypervectors as an alternative to computing with numbers. At its very core, HD computing is about manipulating and comparing large patterns, stored in memory as hypervectors: the input symbols are mapped to a hypervector and an associative search is performed for reasoning and classification. For every classification...

chapter

Program Committee

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > xiv - xv

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Provides a listing of current committee members and society officers.

chapter

SoftMC: A Flexible and Practical Open-Source Infrastructure for Enabling Experimental DRAM Studies

Hasan Hassan, Nandita Vijaykumar, Samira Khan, Saugata Ghose, more

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 241 - 252

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

DRAM is the primary technology used for main memory in modern systems. Unfortunately, as DRAM scales down to smaller technology nodes, it faces key challenges in both data integrity and latency, which strongly affects overall system reliability and performance. To develop reliable and high-performance DRAM-based main memory in future systems, it is critical to characterize, understand, and analyze...

chapter

Near-Ideal Networks-on-Chip for Servers

Pejman Lotfi-Kamran, Mehdi Modarressi, Hamid Sarbazi-Azad

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 277 - 288

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Server workloads benefit from execution on many-core processors due to their massive request-level parallelism. A key characteristic of server workloads is the large instruction footprints. While a shared last-level cache (LLC) captures the footprints, it necessitates a low-latency network-on-chip (NOC) to minimize the core stall time on accesses serviced by the LLC. As strict quality-of-service requirements...

Publication date

Set your own date range

Content availability

Available (67)
None (1)

Keywords

HARDWARE (16)
BANDWIDTH (15)
COMPUTER ARCHITECTURE (15)
RANDOM ACCESS MEMORY (15)
MEMORY MANAGEMENT (9)
SERVERS (8)
GRAPHICS PROCESSING UNITS (7)
PROGRAM PROCESSORS (7)
KERNEL (6)
PERFORMANCE EVALUATION (6)
THROUGHPUT (6)
BENCHMARK TESTING (5)
MEASUREMENT (5)
METADATA (5)
ORGANIZATIONS (5)
SOFTWARE (5)
DRAM (4)
INTERFERENCE (4)
MONITORING (4)
NETWORK TOPOLOGY (4)
PIPELINES (4)
POWER DEMAND (4)
REGISTERS (4)
RESOURCE MANAGEMENT (4)
SECURITY (4)
TOPOLOGY (4)
CLOCKS (3)
COHERENCE (3)
COMPLEXITY THEORY (3)
DELAYS (3)
GPU (3)
INSTRUCTION SETS (3)
MICROARCHITECTURE (3)
MICROPROCESSORS (3)
NON-VOLATILE MEMORY (3)
QUALITY OF SERVICE (3)
RELIABILITY (3)
TIMING (3)
ACCELERATION (2)
ANALYTICAL MODELS (2)
APPROXIMATE COMPUTING (2)
ARRAYS (2)
CACHE (2)
CAPACITORS (2)
COMPUTER CRASHES (2)
CRYPTOGRAPHY (2)
DATABASES (2)
DECODING (2)
ENERGY EFFICIENCY (2)
FLASH MEMORIES (2)
FREQUENCY MODULATION (2)
HYBRID MEMORY CUBE (2)
LOGIC GATES (2)
MEMORY ARCHITECTURE (2)
MULTICORE PROCESSING (2)
NONVOLATILE MEMORY (2)
OPTIMIZATION (2)
PARALLEL PROCESSING (2)
PREFETCHING (2)
PROCESSING-IN-MEMORY (2)
PROPOSALS (2)
PROTOCOLS (2)
REGISTER FILE (2)
RESILIENCE (2)
ROUTING (2)
RUNTIME (2)
SERVER (2)
SUPERCOMPUTERS (2)
SYSTEM RECOVERY (2)
SYSTEM-ON-CHIP (2)
TESTING (2)
THREE-DIMENSIONAL DISPLAYS (2)
THRESHOLD VOLTAGE (2)
TRANSISTORS (2)
3D RENDERING (1)
3D-STACKED MEMORY (1)
ACCELERATOR (1)
ACCESS PARTITIONING (1)
ANDROID (1)
ANDROIDS (1)
ARCHITECTURE (1)
ARRAYED WAVEGUIDE GRATINGS (1)
ASSOCIATIVE MEMORY (1)
ASYMMETRIC DRAM BANK ORGANIZATIONS (1)
ATOMIC LAYER DEPOSITION (1)
ATOMIC MEASUREMENTS (1)
ATOMICITY (1)
BIOLOGICAL NEURAL NETWORKS (1)
BRAIDS (1)
BRANCH-PREDICTOR-DIRECTED PREFETCHING (1)
BTB PREFETCHING (1)
CACHE COHERENCE (1)
CACHE MODELING (1)
CACHE PARTITIONING (1)
CACHE REPLACEMENT (1)
CAPACITANCE (1)
CENTRAL PROCESSING UNIT (1)
CHANNEL ALLOCATION (1)
CIRCUIT FAULTS (1)
CLOUD COMPUTING (1)
more

INFONA - science communication portal

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)