Advanced search

From:

To:

Items from 1 to 20 out of 57 results

chapter

End-to-end scalable FPGA accelerator for deep residual networks

Yufei Ma, Minkyu Kim, Yu Cao, Sarma Vrudhula, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

This work presents an efficient hardware accelerator design of deep residual learning algorithms, which have shown superior image recognition accuracy (>90% top-5 accuracy on ImageNet database). Two key objectives of the acceleration strategy are to (1) maximize resource utilization and minimize data movements, and (2) employ scalable and reusable computing primitives to optimize physical design...

chapter

Stream-dataflow acceleration

Tony Nowatzki, Vinay Gangadhar, Newsha Ardalani, Karthikeyan Sankaralingam

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 416 - 429

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Demand for low-power data processing hardware continues to rise inexorably. Existing programmable and “general purpose” solutions (eg. SIMD, GPGPUs) are insufficient, as evidenced by the order-of-magnitude improvements and industry adoption of application and domain-specific accelerators in important areas like machine learning, computer vision and big data. The stark tradeoffs between efficiency...

chapter

FPGA-based HW/SW co-simulation system for mixed-signal circuits

A. Fernandez-Alvarez, M. Portela-Garcia, M. Garcia-Valderas

2016 Conference on Design of Circuits and Integrated Systems (DCIS) > 1 - 6

2016 Conference on Design of Circuits and Integrated Systems (DCIS)

The integration of mixed signal circuits in Systems on Chip is a trend in modern systems and applications with important challenges. In particular, the simulation of this kind of systems is a very time-consuming process that is becoming more and more complex due to the size of current designs. This paper describes a HW/SW co-simulation environment for mixed-signal circuits. The analog components are...

chapter

Ouessant: Microcontroller approach for flexible accelerator integration and control in System-on-Chip

Pierre-Henri Horrein, Benoit Porteboeuf, Andre Lalevee

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

When designing hardware accelerators for System on Chips, hardware and software integration can quickly become difficult. Heterogeneity in the interfaces prevents developers from efficiently using available hardware. In this paper, we propose an improved microcontroller approach to Intellectual Property (IP) core integration in System on Chips. This approach is based on an instruction set designed...

chapter

Task parallel programming model + hardware acceleration = performance advantage

Tamer Dallou, Divino Cesar Soares Lucas, Guido Araujo, Lucas Morais, more

2016 IEEE Hot Chips 28 Symposium (HCS) > 1

2016 IEEE Hot Chips 28 Symposium (HCS)

This article consists only of a collection of slides from the author's conference presentation.

chapter

A system for vehicle collision and rollover detection

Hamdy A. Ibrahim, Ahmed K. Aly, Behrouz H. Far

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 6

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

Traffic accidents negatively affect the lives of human beings. Accidents may result in deaths, severe injuries, and loss of income to the impacted families. Accident detection and prevention is a keystone in improving road safety. In this paper, a system for detecting vehicle collision and rollover is presented. The proposed system includes three key phases. Data acquisition where accelerometer and...

chapter

Energy efficient video fusion with heterogeneous CPU-FPGA devices

Peng Sun, Alin Achim, Ian Hasler, Paul Hill, more

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1399 - 1404

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

This paper presents a complete video fusion system with hardware acceleration and investigates the energy trade-offs between computing in the CPU or the FPGA device. The video fusion application is based on the Dual-Tree Complex Wavelet Transforms (DT-CWT). Video fusion combines information from different spectral bands into a single representation and advanced algorithms based on wavelet transforms...

chapter

A Data Layout Transformation (DLT) accelerator: Architectural support for data movement optimization in accelerated-centric heterogeneous systems

Tung Thanh-Hoang, Amirali Shambayati, Andrew A. Chien

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1489 - 1492

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Technology scaling and growing use of accelerators make optimization of data movement of increasing importance in all computing systems. Further, growing diversity in memory structures makes embedding such optimization in software non-portable. We propose a novel architectural solution called Data Layout Transformation (DLT) associated with a simple set of instructions that enable software to describe...

chapter

Transforming VHDL descriptions into formal component-based models

Ayoub Nouri, Rahma Ben Atitallah, Anca Molnos, Christian Fabre, more

2016 International Symposium on Rapid System Prototyping (RSP) > 1 - 8

2016 International Symposium on Rapid System Prototyping (RSP)

In this work, we investigate a transformation of VHDL descriptions into equivalent formal models. The targeted equivalence is at the level of the functional behavior. That is, we aim at producing formal models that have the same functional simulation behavior as the original VHDL implementation. We rely on the BIP component-based modeling language as the underlying formalism for this transformation...

chapter

Vectorised SIMD implementations of morphology algorithms

Michael J. Cree

2015 International Conference on Image and Vision Computing New Zealand (IVCNZ) > 1 - 6

2015 International Conference on Image and Vision Computing New Zealand (IVCNZ)

We explore vectorised implementations, exploiting single instruction multiple data (SIMD) CPU instructions on commonly used architectures, of three efficient algorithms for morphological dilation and erosion. We discuss issues specific to SIMD implementation and describe how they guide algorithm choice. We compare our implementations to a commonly used opensource SIMD accelerated machine vision library...

chapter

Malicious hypervisor and hidden virtualization of operation systems

Anton Sergeev, Victor Minchenkov, Vladimir Bashun

2015 9th International Conference on Application of Information and Communication Technologies (AICT) > 178 - 182

2015 9th International Conference on Application of Information and Communication Technologies (AICT)

Today virtualization technology is the focus of many new potential threats and introduces new security challenges that we must meet. The key problem is that malware can utilize the virtualization techniques of modern CPUs for “hidden virtualization” (invisible for user): to execute as a hypervisor and transform the working operation system (OS) into a “guest” state. In this work we analyzed and compared...

chapter

Decision tree ensemble hardware accelerators for embedded applications

R. Struharik

2015 IEEE 13th International Symposium on Intelligent Systems and Informatics (SISY) > 101 - 106

2015 IEEE 13th International Symposium on Intelligent Systems and Informatics (SISY)

This paper presents four different architectures for the hardware acceleration of axis-parallel, oblique and non-linear decision tree ensemble classifier systems. Hardware architectures for the implementation of a number of ensemble combination rules are also presented. The proposed architectures are optimized for size, making them particularly interesting for embedded applications where the size...

article

Architecture Support for Tightly-Coupled Multi-Core Clusters with Shared-Memory HW Accelerators

Masoud Dehyadegari, Andrea Marongiu, Mohammad Reza Kakoee, Siamak Mohammadi, more

IEEE Transactions on Computers > 2015 > 64 > 8 > 2132 - 2144

Coupling processors with acceleration hardware is an effective manner to improve energy efficiency of embedded systems. Many-core is nowadays a dominating design paradigm for SoCs, which opens new challenges and opportunities for designing HW blocks. Exploring acceleration solutions that naturally fit into well-established parallel programming models and that can be incrementally added on top of existing...

chapter

Customizable Heterogeneous Acceleration for Tomorrow's High-Performance Computing

Alessandro Cilardo, Jose Flich, Mirko Gagliardi, Rafael T. Gavila

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 1181 - 1185

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

High-performance computing as we know it today is experiencing unprecedented changes, encompassing all levels from technology to use cases. This paper explores the adoption of customizable, deeply heterogeneous manycore systems for future QoS-sensitive and power-efficient high-performance computing. At the heart of the proposed architecture is a NoC-based manycore system embracing medium-end CPUs,...

chapter

Does arithmetic logic dominate data movement? a systematic comparison of energy-efficiency for FFT accelerators

Tung Thanh-Hoang, Amirali Shambayati, Henry Hoffmann, Andrew A. Chien

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 66 - 67

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

In this paper, we perform a systematic comparison to study the energy cost of varying data formats and data types w.r.t. arithmetic logic and data movement for accelerator-based heterogeneous systems in which both compute-intensive (FFT accelerator) and data-intensive accelerators (DLT accelerator) are added. We explore evaluation for a wide range of design processes (e.g. 32nm bulk-CMOS and projected...

chapter

SIMAAH: RTL simulation accelerator for complex SoC's

Ipsita Biswas Mahapatra, Santhi Natarajan, Nalesh S, S. K. Nandy

2015 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT) > 1 - 6

2015 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)

The EDA industry has recently witnessed the growing popularity of densely populated, IP rich SoC designs targeting high performance computing platforms. Such SoCs require effective logic simulation, with high levels of accuracy and throughput, for a fault free design and faster time to market. Hardware-Assisted Simulation (HAS) is the appropriate choice, while simulating such designs. Existing HAS...

chapter

A brief evaluation of Intel®MPX

Christian W. Otterstad

2015 Annual IEEE Systems Conference (SysCon) Proceedings > 1 - 7

2015 9th Annual IEEE International Systems Conference (SysCon)

MPX implements hardware accelerated support for detection and prevention of memory corruption. This paper will examine the effectiveness of MPX. Herein we attempt to find false positives and false negatives, and to determine what attacks may still be feasible. In particular we wish to see if a system protected by MPX is still exploitable. Intel MPX appears to provide a solid mitigation technique,...

chapter

A case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling flexible data compression with assist warps

Nandita Vijaykumar, Gennady Pekhimenko, Adwait Jog, Abhishek Bhowmick, more

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) > 41 - 53

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA)

Modern Graphics Processing Units (GPUs) are well provisioned to support the concurrent execution of thousands of threads. Unfortunately, diUerent bottlenecks during execution and heterogeneous application requirements create imbalances in utilization of resources in the cores. For example, when a GPU is bottlenecked by the available oU-chip memory bandwidth, its computational resources are often overwhelmingly...

chapter

Modified line algorithm based on ORGFX

Lu Zheng, De-Xue Zhang, Feng-Yu Xiao, Wen Chen, more

2014 11th International Computer Conference on Wavelet Actiev Media Technology and Information Processing(ICCWAMTIP) > 42 - 45

2014 11th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)

In the open hardware graphics accelerator (ORGFX), there are rectangle, line, triangle and curve rasterization modules. This paper is only focused on the improvement of line rasterization speed. Besides modifying the algorithm itself, hardware implementation and resource consumption are put into consideration here. Originally, ORGFX uses the classic Bresenham line algorithm with high precision and...

chapter

IPPro: FPGA based image processing processor

Fahad Manzoor Siddiqui, Matthew Russell, Burak Bardak, Roger Woods, more

2014 IEEE Workshop on Signal Processing Systems (SiPS) > 1 - 6

2014 IEEE Workshop on Signal Processing Systems (SiPS)

The paper presents IPPro which is a high performance, scalable soft-core processor targeted for image processing applications. It has been based on the Xilinx DSP48E1 architecture using the ZYNQ Field Programmable Gate Array and is a scalar 16-bit RISC processor that operates at 526MHz, giving 526MIPS of performance. Each IPPro core uses 1 DSP48, 1 Block RAM and 330 Kintex-7 slice-registers, thus...

Keywords:
HARDWARE
REGISTERS
ACCELERATION

Publication date

Set your own date range

INFONA - science communication portal

Advanced search

Advanced search

End-to-end scalable FPGA accelerator for deep residual networks

Stream-dataflow acceleration

FPGA-based HW/SW co-simulation system for mixed-signal circuits

Ouessant: Microcontroller approach for flexible accelerator integration and control in System-on-Chip

Task parallel programming model + hardware acceleration = performance advantage

A system for vehicle collision and rollover detection

Energy efficient video fusion with heterogeneous CPU-FPGA devices

A Data Layout Transformation (DLT) accelerator: Architectural support for data movement optimization in accelerated-centric heterogeneous systems

Transforming VHDL descriptions into formal component-based models

Vectorised SIMD implementations of morphology algorithms

Malicious hypervisor and hidden virtualization of operation systems

Decision tree ensemble hardware accelerators for embedded applications

Architecture Support for Tightly-Coupled Multi-Core Clusters with Shared-Memory HW Accelerators

Customizable Heterogeneous Acceleration for Tomorrow's High-Performance Computing

Does arithmetic logic dominate data movement? a systematic comparison of energy-efficiency for FFT accelerators

SIMAAH: RTL simulation accelerator for complex SoC's

A brief evaluation of Intel®MPX

A case for Core-Assisted Bottleneck Acceleration in GPUs: Enabling flexible data compression with assist warps

Modified line algorithm based on ORGFX

IPPro: FPGA based image processing processor

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options