2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

chapter

A Key Size Configurable High Speed RSA Coprocessor

E Castillo, J Castillo, J Cano, P Huerta, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 250

The RSA algorithm is the most used public key cipher algorithm. Since it was presented in 1977, designers have proposed several architectures and implementations to improve its performance using different techniques and devices. In all cases, Montgomery's modular multiplication algorithm is the best option when trying to design an efficient RSA. In this paper, we present a high-radix Montgomery's...

chapter

Hybrid Data Structure for IP Lookup in Virtual Routers Using FPGAs

O Erdem, Hoang Le, V K Prasanna, C F Bazlamaçci

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 253

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

This paper makes the following contributions: 1) A compact trie representation and a hybrid data structure for IP lookup that reduces the memory consumption while eliminating backtracking. 2) A merging algorithm that eliminates leaf pushing and simplifies table updates in virtual routers.

chapter

Extending Force-Directed Scheduling with Explicit Parallel and Timed Constructs for High-Level Synthesis

R Sinha, H D Patel

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 214 - 217

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

This work extends force-directed scheduling (FDS) to support specification constructs that express parallelism and timing behaviours. We select the FDS algorithm because it maximizes the amount of resource sharing, and it naturally supports constructs for parallelism. However, timed constructs are not supported. As a result, we propose timed FDS (TFDS) that optimizes over parallel, timed and untimed...

chapter

Programming Real-Time Autofocus on a Massively Parallel Reconfigurable Architecture Using Occam-pi

Zain-ul-Abdin, A Ahlander, B Svensson

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 194 - 201

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

Recently we proposed occam-pi as a high-level language for programming massively parallel reconfigurable architectures. The design of occam-pi incorporates ideas from CSP and pi-calculus to facilitate expressing parallelism and reconfigurability. The feasability of this approach was illustrated by building three occam-pi implementations of DCT executing on an Ambric. However, because DCT is a simple...

chapter

Title Page i

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > i

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

chapter

Program Committee

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > xiii - xiv

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

chapter

Cover Art

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > C1

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

chapter

Panel Session Summary

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > xvii

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

chapter

A Sparse Matrix Personality for the Convey HC-1

K K Nagar, J D Bakos

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 1 - 8

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

In this paper we describe a double precision floating point sparse matrix-vector multiplier (SpMV) and its performance as implemented on a Convey HC-1 reconfigurable computer. The primary contributions of this work are a novel streaming reduction architecture for floating point accumulation, a novel on-chip cache optimized for streaming compressed sparse row (CSR) matrices, and end-to-end integration...

chapter

Hecto-Scale Frame Rate Face Detection System for SVGA Source on FPGA Board

Zheng Ding, Feng Zhao, Tinghui Wang, Wei Shu, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 37 - 40

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

This paper proposes techniques for face detection and gives the implementation details for an FPGA development board. We analyze and discuss the relation between the system computation cost and selection of the image scaling factor. We give a new method to select the stop threshold for the image reduction process, which reduces the total computation by half. We also provide a color image output mode...

chapter

Implementation and Performance Comparison of the Motion Compensation Kernel of the AVS Video Decoder on FPGA, GPU and Multicore Processors

M Owaida, N Bellas, C D Antonopoulos, K Daloukas, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 255

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

Next generation video standards have strict and increasing performance demands due to real-time requirements and the trend towards higher frame resolutions and bit rates. Leveraging the advantages of reconfigurable logic and emerging multi-core processor architectures to exploit all levels of parallelism of such workloads is necessary to achieve real time functionality at a reasonable cost.

chapter

FPGA Architecture of Generalized Laguerre-Volterra MIMO Model for Neural Population Spiking Activities

W X Y Li, R C C Cheung, Wei Zhang, R H M Chan, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 254

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

We present a parallelized and pipelined architecture for a generalized Laguerre-Volterra MIMO system to identify the time-varying neural dynamics underlying spike activities. The proposed architecture consists of a first stage containing a vector convolution and MAC (Multiply and Accumulation) component, a second stage containing a pre-threshold potential updating unit with an error approximation...

chapter

Reconsideration of Computing Paradigms and a Novel Reconfigurable Architecture

Ming Yan, Ziyu Yang, Sikun Li, Liu Yang

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 252

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

With the on-chip resources largely increased, the computing paradigm of modern architectures are much different from traditional ones. The relation between temporal computing and spatial computing is getting more and more intricate. In this paper, a coarse-grained reconfigurable architecture is proposed named as programmable dataflow computing architecture: ProDFA. With both fine-grained control ability...

chapter

A Model for Peak Matrix Performance on FPGAs

C Y Lin, H K So, P H W Leong

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 251

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

Computations involving matrices form the kernel of a large spectrum of computationally demanding applications for which FPGAs have actively been utilized as accelerators. The performances of such matrix operations on FPGAs are related to underlying architectural parameters such as computational resources, memory and I/O bandwidth. A model that gives bounds on the peak performance of matrix-vector...

chapter

Using Functional Programming to Generate an LDPC Forward Error Corrector

A Gill, T Bull, D DePardo, A Farmer, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 133 - 140

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

FPGAs as commodities offer a resource for high-performance computation that is unmatched in flexibility and price/performance. As a lab, we are interested in high-level descriptions of computation and data, and how they may be customized to map effectively on FPGA fabrics. This paper describes our tool-chain, approach and methodology to FPGA utilization. We give a case study of the generation of a...

chapter

FPGA-Based Solid-State Drive Prototyping Platform

Yu Cai, E F Haratsch, M McCartney, Ken Mai

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 101 - 104

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

NAND flash memory has been widely used for data storage due to its high density, high throughput, low cost, and low power. However, as flash memory manufacturers scale to smaller process technologies and store more bits per cell, the reliability and endurance of flash memory are decreasing. Wear-leveling and error correction coding can significantly improve both reliability and endurance, but finding...

chapter

Accelerating Statistical LOR Estimation for a High-Resolution PET Scanner Using FPGA Devices and a High Level Synthesis Tool

Zhong-Ho Chen, A W Y Su, Ming-Ting Sun, S Hauck

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 105 - 108

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

In this paper, we use an FPGA platform and a high level synthesis tool, called Impulse C, to speedup a statistical Line Of Reaction (LOR) estimation for a high-resolution Positron Emission Tomography (PET) scanner. The estimation algorithm provides a significant improvement over conventional methods, but the execution time is too long to be practical for clinic applications. Impulse C allows us to...

chapter

HMFlow: Accelerating FPGA Compilation with Hard Macros for Rapid Prototyping

C Lavin, M Padilla, J Lamprecht, P Lundrigan, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 117 - 124

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

The FPGA compilation process (synthesis, map, place, and route) is a time consuming task that severely limits designer productivity. Compilation time can be reduced by saving implementation data in the form of hard macros. Hard macros consist of previously synthesized, placed and routed circuits that enable rapid design assembly because of the native FPGA circuitry (primitives and nets)which they...

chapter

Automatic HDL-Based Generation of Homogeneous Hard Macros for FPGAs

S Korf, D Cozzi, M Koester, J Hagemeyer, more

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 125 - 132

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

The regularity of resources found in FPGAs is a unique feature, which can be utilized in a number of applications, e.g., in timing critical applications or applications with a demand for homogeneous routing. Current synthesis tools do not support an automatic generation of homogeneous FPGA designs, such that a time-consuming hand-crafted design is required. We present a tool flow, which automatically...

INFONA - science communication portal

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

A Key Size Configurable High Speed RSA Coprocessor

Hybrid Data Structure for IP Lookup in Virtual Routers Using FPGAs

Extending Force-Directed Scheduling with Explicit Parallel and Timed Constructs for High-Level Synthesis

Programming Real-Time Autofocus on a Massively Parallel Reconfigurable Architecture Using Occam-pi

Title Page i

Program Committee

Cover Art

Panel Session Summary

A Sparse Matrix Personality for the Convey HC-1

Hecto-Scale Frame Rate Face Detection System for SVGA Source on FPGA Board

Implementation and Performance Comparison of the Motion Compensation Kernel of the AVS Video Decoder on FPGA, GPU and Multicore Processors

FPGA Architecture of Generalized Laguerre-Volterra MIMO Model for Neural Population Spiking Activities

Reconsideration of Computing Paradigms and a Novel Reconfigurable Architecture

A Model for Peak Matrix Performance on FPGAs

Table of Contents

Using Functional Programming to Generate an LDPC Forward Error Corrector

FPGA-Based Solid-State Drive Prototyping Platform

Accelerating Statistical LOR Estimation for a High-Resolution PET Scanner Using FPGA Devices and a High Level Synthesis Tool

HMFlow: Accelerating FPGA Compilation with Hard Macros for Rapid Prototyping

Automatic HDL-Based Generation of Homogeneous Hard Macros for FPGAs

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)