2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Items from 1 to 20 out of 68 results

book

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

IEEE

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

chapter

A Nanosecond–Level Hybrid Table Design for Financial Market Data Generators

Haohuan Fu, Conghui He, Wayne Luk, Weijia Li, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 227 - 234

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

This paper proposes a hybrid sorted table design for minimizing electronic trading latency, with three main contributions. First, a hierarchical sorted table with two levels, a fast cache table in reconfigurable hardware storing megabytes of data items and a master table in software storing gigabytes of data items. Second, a full set of operations, including insertion, deletion, selection and sorting,...

chapter

[Publisher's information]

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 238

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Provides a listing of current committee members and society officers.

chapter

Centaur: A Framework for Hybrid CPU-FPGA Databases

Muhsen Owaida, David Sidler, Kaan Kara, Gustavo Alonso

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 211 - 218

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Accelerating relational databases in general and SQL in particular has become an important topic given thechallenges arising from large data collections and increasinglycomplex workloads. Most existing work, however, has beenfocused on either accelerating a single operator (e.g., a join) orin data reduction along the data path (e.g., from disk to CPU). In this paper we focus instead on the system...

chapter

A Real-Time Embedded FPGA Processor for a Stand-Alone Dual-Mode Assistive Device

Ali Jafari, Maysam Ghovanloo, Tinoosh Mohsenin

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 199

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

This paper presents a stand-alone Dual-mode Tongue DriveSystem (sdTDS) which is designed for people with severedisabilities to control their environment using their tonguemotion and speech. The sdTDS detects user's tongue motion using a magnetic tracer placed on tongue and an array of magnetic sensors embedded in a wireless headset and at the same time it can capture the user's voice using a small...

chapter

Exploring High Efficiency Hardware Accelerator for the Key Algorithm of Square Kilometer Array Telescope Data Processing

Qian Wu, Yongxin Zhu, Xu Wang, Mengjun Li, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 195

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

The SKA (Square Kilometer Array) radio telescope under construction will become the largest telescope in the world by integrating the sampled data from a huge number of small antenna nodes in the array to emulate a giant antenna. Due to the limited storage space, the SKA needs to process massive data in real-time, which makes the SKA scientific data processing become a bottleneck of the computational...

chapter

CAPSL: A Tool for Automatic Generation of Hardware Sandboxes for IP Security

Taylor JL Whitaker, Christophe Bobda

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 200

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

We propose a design flow for automatic generation of hardware sandboxes. Our tool, the Component Authentication Process for Sandboxed Layouts (CAPSL), generates sandboxes capable of detecting trojan activation and nullifying potential damage to a system at run-time. Our approach captures the behavioral properties of non-trusted IPs with formal models that are translated to checker automata and implemented...

chapter

Minimalist Design for Accelerating Convolutional Neural Networks for Low-End FPGA Platforms

Raghid Morcel, Haitham Akkary, Hazem Hajj, Mazen Saghir, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 196

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Deep neural networks have gained tremendous attention in both the academic and industrial communities due to their performance in many artificial intelligence applications, particularly in computer vision. However, these algorithms are known to be computationally very demanding for both scoring and model learning applications. State-of-the-art recognition models use tens of millions of parameters...

chapter

Dynamic Module Partitioning for Library Based Placement on Heterogeneous FPGAs

Fubing Mao, Wei Zhang, Bingsheng He, Siew-Kei Lam

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 194

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Library based design and IP reuse have been previouslyproposed to speed up the synthesis for large-scale FPGAdesigns. However, previous library based design flow faces severalunresolved challenges. Firstly, there may result in large wastearea between the modules due to the difference in module sizes. While utilizing multiple ratio modules can help to reduce thewaste area, pre-synthesis each module...

chapter

ParaDiMe: A Distributed Memory FPGA Router Based on Speculative Parallelism and Path Encoding

Chin Hau Hoo, Akash Kumar

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 172 - 179

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

The increase in speed and capacity of FPGAs is faster than the development of effective design tools to fully utilize it, and routing of nets remains as one of the most time-consuming stages of the FPGA design flow. While existing works have proposed methods of accelerating routing through parallelization, they are limited by the memory architecture of the system that they target. In this paper, we...

chapter

Fine-Grained Acceleration of Binary Neural Networks Using Intel® Xeon® Processor with Integrated FPGA

Philip Colangelo, Randy Huang, Enno Luebbers, Martin Margala, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 135

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Binary weighted networks (BWN) for imageclassification reduce computation for convolutional neuralnetworks (CNN) from multiply-adds to accumulates with littleto no accuracy loss. Hardware architectures such as FPGA cantake full advantage of BWN computations because of theirability to express weights represented as 0 and 1 efficientlythrough customizable logic. In this paper, we present animplementation...

chapter

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates

Yijin Guan, Hao Liang, Ningyi Xu, Wenqiang Wang, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 152 - 159

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

DNNs (Deep Neural Networks) have demonstrated great success in numerous applications such as image classification, speech recognition, video analysis, etc. However, DNNs are much more computation-intensive and memory-intensive than previous shallow models. Thus, it is challenging to deploy DNNs in both large-scale data centers and real-time embedded systems. Considering performance, flexibility, and...

chapter

Efficient GPGPU Computing with Cross-Core Resource Sharing and Core Reconfiguration

Ashutosh Dhar, Deming Chen

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 48 - 55

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

GPUs are capable of running a variety of applications, however their generic parallel-architecture can lead to inefficient use of resources and reduced power efficiency, due to algorithmic or architectural constraints. In this work, taking inspiration from CGRAs (coarse-grained reconfigurable architectures), we demonstrate resource sharing and re-distribution as a solution that can be leveraged by...

chapter

An Architecture for the Acceleration of a Hybrid Leaky Integrate and Fire SNN on the Convey HC-2ex FPGA-Based Processor

Emmanouil Kousanakis, Apostolos Dollas, Euripides Sotiriades, Ioannis Papaefstathiou, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 56 - 63

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Neuromorphic computing is expanding by leaps and bounds through custom integrated circuits (digital and analog), and large scale platforms developed by industry or government-funded projects (e.g. TrueNorth and BrainScaleS, respectively). Whereas the trend is for massive parallelism and neuromorphic computation in order to solve problems, such as those that may appear in machine learning and deep...

chapter

Bonded Force Computations on FPGAs

Qingqing Xiong, Martin C. Herbordt

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 72 - 75

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

While acceleration of Molecular Dynamics has received much attention, a significant part of that application, the bonded force calculation, has not. We present what we believe to be the first description and analysis of bonded force calculations outside of ASICs. We characterize the computational requirements. We find that a naive direct implementation requires FPGA resources out of proportion with...

chapter

FPGA Delay Model Considering Logic-Level and Transistor-Level Parameters

Qiang Liu, Hanjing Qian

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 29

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Field programmable gate arrays (FPGAs) have been adopted in various fields, due to the design flexibility and customizability. Different applications have different requirements in performance, hardware resources and cost, leading to demands of diverse FPGA architectures. Delay is an important metric to evaluate different alternatives during FPGA architecture development. The existing analytical delay...

chapter

An FPGA Design Framework for CNN Sparsification and Acceleration

Sicheng Li, Wei Wen, Yu Wang, Song Han, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 28

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Convolutional neural networks (CNNs) have recently broken many performance records in image recognition and object detection problems. The success of CNNs, to a great extent, is enabled by the fast scaling-up of the networks that learn from a huge volume of data. The deployment of big CNN models can be both computation-intensive and memory-intensive, leaving severe challenges to hardware implementations...

chapter

On Bit-Serial NoCs for FPGAs

Nachiket Kapre

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 32 - 39

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

We can build lightweight bit-serial FPGA NoC routers thatcost 20 LUT, 17 FF per router and operate at 800–900 MHzspeeds. Each bit-serial router implements deflection-routing on aunidirectional torus topology requiring 1b-wide connection perport. The key ideas that enable this implementation are (1)reformulation of the dimension-ordered routing (DOR) functionusing compact 1 LUT, 1 FF streaming pattern...

chapter

Terabyte Sort on FPGA-Accelerated Flash Storage

Sang-Woo Jun, Shuotao Xu, Arvind

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 17 - 24

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Sorting is one of the most fundamental and usefulapplications in computer science, and continues to be animportant tool in analyzing large datasets. An important andchallenging subclass of sorting problems involves sorting terabytescale datasets with hundreds of billions of records. Theconventional method of sorting such large amounts of datais to distribute the data and computation over a cluster...

chapter

Scalable Network Function Virtualization for Heterogeneous Middleboxes

Xuzhi Zhang, Xiaozhe Shao, George Provelengios, Naveen Kumar Dumpala, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 219 - 226

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Over the past decade, a wide-ranging collection of network functions in middleboxes has been used to accommodate the needs of network users. Although the use of general-purpose processors has been shown to be feasible for this purpose, the serial nature of microprocessors limits network functional virtualization (NFV) performance. In this paper, we describe a new heterogeneous hardware-software approach...

Publication date

Set your own date range

Content availability

Available (67)
None (1)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (45)
HARDWARE (27)
FPGA (24)
COMPUTER ARCHITECTURE (11)
TOOLS (11)
ACCELERATION (10)
COMPUTATIONAL MODELING (8)
KERNEL (6)
SOFTWARE (6)
NEURAL NETWORKS (5)
RANDOM ACCESS MEMORY (5)
REGISTERS (5)
TABLE LOOKUP (5)
ALGORITHM DESIGN AND ANALYSIS (4)
BENCHMARK TESTING (4)
CLOCKS (4)
DELAYS (4)
PIPELINES (4)
PROGRAM PROCESSORS (4)
ROUTING (4)
ANALYTICAL MODELS (3)
BANDWIDTH (3)
COMPUTERS (3)
DATA MODELS (3)
DECODING (3)
ENERGY CONSUMPTION (3)
GENERATORS (3)
HIGH-LEVEL SYNTHESIS (3)
IP NETWORKS (3)
LIBRARIES (3)
MEMORY MANAGEMENT (3)
OPTIMIZATION (3)
PARALLEL PROCESSING (3)
PORTS (COMPUTERS) (3)
RESOURCE MANAGEMENT (3)
SORTING (3)
SPACE EXPLORATION (3)
THROUGHPUT (3)
ACTIVATION FUNCTION (2)
APPROXIMATION ALGORITHMS (2)
AUTOMATA (2)
BIG DATA (2)
CLOUD COMPUTING (2)
CLUSTERING ALGORITHMS (2)
CNN (2)
COMPUTER SCIENCE (2)
DATABASES (2)
DEBUGGING (2)
DEEP LEARNING (2)
DEEP NEURAL NETWORKS (2)
ENCODING (2)
ENCRYPTION (2)
ENGINES (2)
GRAPHICS PROCESSING UNITS (2)
HARDWARE ACCELERATOR (2)
HARDWARE DESIGN LANGUAGES (2)
INSTRUCTION SETS (2)
INSTRUMENTS (2)
INTERPOLATION (2)
MARKOV PROCESSES (2)
PARTITIONING ALGORITHMS (2)
PERFORMANCE EVALUATION (2)
POWER DEMAND (2)
REAL-TIME SYSTEMS (2)
RELIABILITY (2)
SERVERS (2)
SYNCHRONIZATION (2)
SYSTEM-ON-CHIP (2)
TUNNELING MAGNETORESISTANCE (2)
ACCELERATOR (1)
ALGORITHM (1)
ANTENNA ARRAYS (1)
API (1)
APTSIM (1)
ARCHITECTURE (1)
ARCHITECTURE SIMULATION (1)
ARCTANGENT (1)
ARM (1)
ARRAYS (1)
ARTIFICIAL NEURAL NETWORKS (1)
ATAL (1)
AUTOMATA PROCESSING (1)
AUTOMATA PROCESSOR (1)
AUTOMATION (1)
B-TREE (1)
BATCH PRODUCTION SYSTEMS (1)
BAYES METHODS (1)
BCM RULE (1)
BIOINFORMATICS (1)
BIOINFORMATICS ACCELERATOR (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIOLOGICAL SYSTEM MODELING (1)
BITSTREAM ENCRYPTION AND AUTHENTICATION (1)
BITSTREAM RELOCATION (1)
BLOOM FILTER (1)
BLUESPEC (1)
BRAIN MODELING (1)
BREADTH-FIRST SEARCH (1)
BWN (1)
C LANGUAGES (1)
more

INFONA - science communication portal

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)