Search results

chapter

APPROX-NoC: A data approximation framework for Network-on-Chip architectures

Rahul Boyapati, Jiayi Huang, Pritam Majumder, Ki Hwan Yum, more

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 666 - 677

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

The trend of unsustainable power consumption and large memory bandwidth demands in massively parallel multicore systems, with the advent of the big data era, has brought upon the onset of alternate computation paradigms utilizing heterogeneity, specialization, processor-in-memory and approximation. Approximate Computing is being touted as a viable solution for high performance computation by relaxing...

chapter

Jenga: Software-defined cache hierarchies

Po-An Tsai, Nathan Beckmann, Daniel Sanchez

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 652 - 665

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Caches are traditionally organized as a rigid hierarchy, with multiple levels of progressively larger and slower memories. Hierarchy allows a simple, fixed design to benefit a wide range of applications, since working sets settle at the smallest (i.e., fastest and most energy-efficient) level they fit in. However, rigid hierarchies also add overheads, because each level adds latency and energy even...

chapter

Run-time hardware trojan detection using performance counters

Rana Elnaggar, Krishnendu Chakrabarty, Mehdi B. Tahoori

2017 IEEE International Test Conference (ITC) > 1 - 10

2017 IEEE International Test Conference (ITC)

There has been a growing trend in recent years to outsource various aspects of the semiconductor design and manufacturing flow to different parties spread across the globe. Such outsourcing increases the risk of adversaries adding malicious logic, referred to as hardware Trojans, to the original design. In this paper, we introduce a run-time hardware Trojan detection method for microprocessor cores...

chapter

Shield: A middleware to tolerate CPU transient faults in multicore architectures

Mohamed Mohamedin, Masoomeh Javidi Kishi, Roberto Palmieri

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA) > 1 - 9

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA)

Multicore architectures are increasingly becoming prone to transient faults. In this paper we present Shield, a middleware to provide transactional applications with resiliency to those faults that can happen anytime during the execution of a processor but do not cause any hardware interruption. Shield is inspired by the state machine replication approach, where computational resources are partitioned,...

chapter

A performance counter-based control flow checking technique for multi-core processors

Hussien Al-haj Ahmad, Yasser Sedaghat, Mohammadreza Rezaei

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) > 461 - 465

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

Today, both the rapid improvement of process technology and the arrival of new embedded systems with highperformance requirements, have led to making the current trend in processors manufacturing shift from single-core processors to multi-core processors. This trend has raised several challenges for reliability in safety-critical systems that operate in high-risk environments, making them more vulnerable...

chapter

Analysis of K-bit pipelined processor cores using perl benchmarking

Eze Victor Chisom, K. C. Okafor, A. A. Obayi, Okoro Nkem Jennifer, more

2017 International Conference on Computing Networking and Informatics (ICCNI) > 1 - 7

2017 International Conference on Computing Networking and Informatics (ICCNI)

In today's high performance computing (HPC) environments, analyzing and predicting the performance of multiple-processor systems (clusters cores) on critical workloads remains a challenge. This is as a result of the key metrics that influences system's behavior. Busty arrivals in HPCs demand either a shared memory-parallel architecture or pipelined dataflow architecture. At present, a processor model...

chapter

Lightweight multicore virtualization architecture exploiting ARM TrustZone

S. Pinto, A. Oliveira, J. Pereira, J. Cabral, more

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society > 3562 - 3567

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society

Virtualization technology is well established in the server and desktop spaces, and has been spreading across embedded system market. This technology allows for the coexistence and execution of multiples operating systems on top of the same hardware platform, with proven technological and economic benefits. Hardware extensions for easing virtualization have been added into several commercial off-the-shelf...

chapter

HeteroSync: A benchmark suite for fine-grained synchronization on tightly coupled GPUs

Matthew D. Sinclair, Johnathan Alsop, Sarita V. Adve

2017 IEEE International Symposium on Workload Characterization (IISWC) > 239 - 249

2017 IEEE International Symposium on Workload Characterization (IISWC)

Traditionally GPUs focused on streaming, data-parallel applications, with little data reuse or sharing and coarse-grained synchronization. However, the rise of general-purpose GPU (GPGPU) computing has made GPUs desirable for applications with more general sharing patterns and fine-grained synchronization, especially for recent GPUs that have a unified address space and coherent caches. Prior work...

chapter

CLIP: Cluster-Level Intelligent Power Coordination for Power-Bounded Systems

Pengfei Zou, Tyler Allen, Claude H. Davis IV, Xizhou Feng, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 541 - 551

2017 IEEE International Conference on Cluster Computing (CLUSTER)

High performance computing systems will need to operate with certain power budgets while maximizing performance in the exascale era. Such systems are built with power aware components, whose collective peak power may exceed the specified power budget. Cluster level power bounded computing addresses this power challenge by coordinating power among components within compute nodes and further adjusting...

chapter

Region based cache coherence for tiled MPSoCs

Akshay Srivatsa, Sven Rheindt, Thomas Wild, Andreas Herkersdorf

2017 30th IEEE International System-on-Chip Conference (SOCC) > 286 - 291

2017 30th IEEE International System-on-Chip Conference (SOCC)

The need for faster and more energy efficient computing has led us to the multicore era with distributed shared memory hierarchies. The primary goal is to distribute parallel tasks onto multiple processing elements to collectively achieve shorter execution times at lower frequencies and supply voltages when compared to a single-core architecture. Major challenges of this approach are how to achieve...

chapter

The best of both: High-performance anc deterministic real-time executive by application-specific multi-core SoCs

Steffen Vaas, Peter Ulbrich, Marc Reichenbach, Dietmar Fey

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP) > 1 - 6

2017 Conference on Design and Architectures for Signal and Image Processing (DASIP)

Embedded multi-core processors improve performance significantly and are desirable in many application-fields. This in particular includes safety-critical real-time systems, which typically require a deterministic temporal behavior. However, even tasks without dependencies running on different cores can interfere due to, sometimes hidden, shared hardware resources, such as common memories or buses...

chapter

On the Benefits of Multicores for Real-Time Systems

Selma Saidi

2017 Euromicro Conference on Digital System Design (DSD) > 383 - 389

2017 Euromicro Conference on Digital System Design (DSD)

In real-time and safety-critical systems, the move towards multicores is becoming unavoidable in order to keep pace with the increasing required processing power and to meet the high integration trend while maintaining a reasonable power consumption. However, the benefit expected from multicore platforms may not step up to the mark, and real-time constraints can be easily violated. Indeed, an efficient...

chapter

Software patterns for asymmetric multiprocessing devices on embedded systems: a performance assessment

Pedro Ignacio Martos, Alejandra Garrido

2017 Eight Argentine Symposium and Conference on Embedded Systems (CASE) > 1 - 6

2017 Eight Argentine Symposium and Conference on Embedded Systems (CASE)

In embedded systems there is a variant of Multicore System on Chip devices (MSoC devices) where not all the computing elements (processor cores) are equal. The differences in the cores of these devices range from different hardware architectures using the same instruction set to completely different processors working together inside the same device. These SoCs are called “Asymmetric Multi Processing...

chapter

Heterogeneous Hardware Support in BEAGLE, a High-Performance Computing Library for Statistical Phylogenetics

Daniel L. Ayres, Michael P. Cummings

2017 46th International Conference on Parallel Processing Workshops (ICPPW) > 23 - 32

2017 46th International Conference on Parallel Processing Workshops (ICPPW)

We describe our approach to extend the BEAGLE library for high-performance statistical phylogenetic inference (maximum likelihood estimation and Bayesian analysis) in order to support a wider range of modern accelerators and multicore CPUs, and present the corresponding performance results from these platforms. Our solution includes a shared code design providing a uniform interface for a variety...

chapter

Thermal-Aware Job Scheduling of MapReduce Applications on High Performance Clusters

Shubbhi Taneja, Yi Zhou, Mohammed Ibrahim Alghamdi, Xiao Qin

2017 46th International Conference on Parallel Processing Workshops (ICPPW) > 261 - 270

2017 46th International Conference on Parallel Processing Workshops (ICPPW)

In this study, we develop a thermal-aware job scheduling strategy called tDispatch tailored for MapReduce applications running on Hadoop clusters. The scheduling idea of tDispatch is motivated by a profiling study of CPU-intensive and I/O-intensive jobs from the perspective of thermal efficiency. More specifically, we investigate the thermal behaviors of these two types of jobs running on a Hadoop...

chapter

Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems

Rong Ge, Pengfei Zou, Xizhou Feng

2017 46th International Conference on Parallel Processing (ICPP) > 591 - 600

2017 46th International Conference on Parallel Processing (ICPP)

Power is a critical factor that limits the performance and scalability of modern high performance computer systems. Considering power as a first-order constraint and a scarce system resource, power-bounded computing represents a new perspective to address the power challenge in HPC.In this work we present an application-aware, multi-dimensional power allocation framework to support power-bounded parallel...

chapter

Multicore Cache Coherence Control by a Parallelizing Compiler

Hironori Kasahara, Keiji Kimura, Boma A. Adhi, Yuhei Hosokawa, more

2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC) > 1 > 492 - 497

2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC)

A recent development in multicore technology has enabled development of hundreds or thousands core processor. However, on such multicore processor, an efficient hardware cache coherence scheme will become very complex and expensive to develop. This paper proposes a parallelizing compiler directed software coherence scheme for shared memory multicore systems without hardware cache coherence control...

chapter

Voltage margins identification on commercial x86-64 multicore microprocessors

George Papadimitriou, Manolis Kaliorakis, Athanasios Chatzidimitriou, Charalampos Magdalinos, more

2017 IEEE 23rd International Symposium on On-Line Testing and Robust System Design (IOLTS) > 51 - 56

2017 IEEE 23rd International Symposium on On-Line Testing and Robust System Design (IOLTS)

In this paper, we explore the pessimistic voltage guardbands of two multicore x86-64 microprocessor chips that belong to different microarchitectures (one ultra-low power and one high-performance microprocessor), when programs are executed on individual cores of the CPU chips. We also examine the energy and temperature gains as positive effects of lowering the voltage in both chips while preserving...

chapter

Analysis of the Scope of Dynamic Power Management in Emerging Server Architectures

Markus Hahnel, Waltenegus Dargie, Alexander Schill

2017 26th International Conference on Computer Communication and Networks (ICCCN) > 1 - 6

2017 26th International Conference on Computer Communication and Networks (ICCCN)

The architectures of large-scale Internet servers are becoming more complex each year in order to store and process a large amount of Internet data (Big Data) as efficiently as possible. One of the consequences of this continually growing complexity is that individual servers consume a significant amount of data even when they are idle. In this paper we experimentally investigate the scope and usefulness...

chapter

Performance evaluation of delayed-committing transactional memory

Sekai Ichii, Shohei Hayashi, Atsushi Nunome, Hiroaki Hirata, more

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD) > 445 - 451

2017 18th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD)

Transactional memory (TM) is promising to make parallel programming easier. Many hardware implementations of transactional memory (HTM) have been proposed to improve the performance, but they still suffer from some overheads when a transaction either commits or aborts. So we have been developing a novel new HTM design, called Delayed-Committing TM (DCTM), which enables transactions of arbitrary size...

INFONA - science communication portal

Search results

APPROX-NoC: A data approximation framework for Network-on-Chip architectures

Jenga: Software-defined cache hierarchies

Run-time hardware trojan detection using performance counters

Shield: A middleware to tolerate CPU transient faults in multicore architectures

A performance counter-based control flow checking technique for multi-core processors

Analysis of K-bit pipelined processor cores using perl benchmarking

Lightweight multicore virtualization architecture exploiting ARM TrustZone

HeteroSync: A benchmark suite for fine-grained synchronization on tightly coupled GPUs

CLIP: Cluster-Level Intelligent Power Coordination for Power-Bounded Systems

Region based cache coherence for tiled MPSoCs

The best of both: High-performance anc deterministic real-time executive by application-specific multi-core SoCs

On the Benefits of Multicores for Real-Time Systems

Software patterns for asymmetric multiprocessing devices on embedded systems: a performance assessment

Heterogeneous Hardware Support in BEAGLE, a High-Performance Computing Library for Statistical Phylogenetics

Thermal-Aware Job Scheduling of MapReduce Applications on High Performance Clusters

Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems

Multicore Cache Coherence Control by a Parallelizing Compiler

Voltage margins identification on commercial x86-64 multicore microprocessors

Analysis of the Scope of Dynamic Power Management in Emerging Server Architectures

Performance evaluation of delayed-committing transactional memory

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options