Search results

Items from 1 to 20 out of 633 results

chapter

Research and design of add-based length-scalable dual-field modular multiplication-addition-subtraction

Jiamin Li, Zibin Dai, Wei Li, Suwen Yi, more

2017 2nd IEEE International Conference on Integrated Circuits and Microsystems (ICICM) > 48 - 52

2017 2nd IEEE International Conference on Integrated Circuits and Microsystems (ICICM)

Modular multiplication, addition, and subtraction being the core operation of Elliptic curve public(ECC) system, the decrease of area and the merging of structure have been a hot topic in recent years. This paper first analyzes the difference between multiplication type and addition type of modular multiplier. Then, Combined with the structural characteristics of the modular adder, and mixing modular...

chapter

Designing a High-Throughput Pipeline for Digitizing Pinned Insects

Mark Hereld, Nicola J. Ferrier, Nitin Agarwal, Petra Sierwald

2017 IEEE 13th International Conference on e-Science (e-Science) > 542 - 550

2017 IEEE 13th International Conference on e-Science (e-Science)

This paper presents the design and prototyping of hardware and software to address the problem of rapid and reliable 3D digitization of very large collections of pinned insects. Using the collection at the Field Museum of Natural History (FMNH) as a use case, a pipeline to ingest the entire collection of 4.5 million specimens in circa 1-2 years imposes a few second limit on average processing time...

chapter

In situ video encoding of floating-point volume data using special-purpose hardware for a posteriori rendering and analysis

Nick Leaf, Bob Miller, Kwan-Liu Ma

2017 IEEE 7th Symposium on Large Data Analysis and Visualization (LDAV) > 64 - 73

2017 IEEE 7th Symposium on Large Data Analysis and Visualization (LDAV)

Scientific simulations typically store only a small fraction of computed timesteps due to storage and I/O bandwidth limitations. Previous work has demonstrated the compressibility of floating-point volume data, but such compression often comes with a tradeoff between computational complexity and the achievable compression ratio. This work demonstrates the use of special-purpose video encoding hardware...

chapter

A compilation method for zero overhead loop in DSPs with VLIW

Rui Chang, Jun Wu, Haoqi Ren

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP) > 1 - 7

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP)

The increasing use of digital signal processors (DSPs) in wireless communications and signal processing necessitates the optimization of compilers to support special hardware features. In this paper, we propose a compiler transformation method for zero overhead loop (ZOL). It supports very long instruction word (VLIW), internal branches and the loops whose iterative times are known at runtime and...

chapter

Encrypted computing: Speed, security and provable obfuscation against insiders

Peter T. Breuer, Jonathan P. Bowen, Esther Palomar, Zhiming Liu

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2017 International Carnahan Conference on Security Technology (ICCST)

Over the past few years we have articulated theory that describes ‘encrypted computing’, in which data remains in encrypted form while being worked on inside a processor, by virtue of a modified arithmetic. The last two years have seen research and development on a standards-compliant processor that shows that near-conventional speeds are attainable via this approach. Benchmark performance with the...

chapter

Exploring computation-communication tradeoffs in camera systems

Amrita Mazumdar, Thierry Moreau, Sung Kim, Meghan Cowan, more

2017 IEEE International Symposium on Workload Characterization (IISWC) > 177 - 186

2017 IEEE International Symposium on Workload Characterization (IISWC)

Cameras are the defacto sensor. The growing demand for real-time and low-power computer vision, coupled with trends towards high-efficiency heterogeneous systems, has given rise to a wide range of image processing acceleration techniques at the camera node and in the cloud. In this paper, we characterize two novel camera systems that use acceleration techniques to push the extremes of energy and performance...

chapter

Toward a programmable FIB caching architecture

Garegin Grigoryan, Yaoqing Liu

2017 IEEE 25th International Conference on Network Protocols (ICNP) > 1 - 2

2017 IEEE 25th International Conference on Network Protocols (ICNP)

The current Internet routing ecosystem is neither sustainable nor economical. More than 711K IPv4 routes and more than 41K IPv6 routes exist in current global Forwarding Information Base (FIBs) with growth rates increasing. This rapid growth has serious consequences, such as creating the need for costly FIB memory upgrades and increased potential for Internet service outages. And while FIB memories...

chapter

Optimising packet forwarding in multi-tenant networks using rule compilation

Stefan Hommes, Petko Valtchev, Khalil Blaiech, Salaheddine Hamadi, more

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA) > 1 - 9

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA)

Packet forwarding in Software-Defined Networks (SDN) relies on a centralised network controller which enforces network policies expressed as forwarding rules. Rules are deployed as sets of entries into network device tables. With heterogeneous devices, deployment is strongly bounded by the respective table constraints (size, lookup time, etc.) and forwarding pipelines. Hence, minimising the overall...

chapter

Reconfiguring the Imaging Pipeline for Computer Vision

Mark Buckler, Suren Jayasuriya, Adrian Sampson

2017 IEEE International Conference on Computer Vision (ICCV) > 975 - 984

2017 IEEE International Conference on Computer Vision (ICCV)

Advancements in deep learning have ignited an explosion of research on efficient hardware for embedded computer vision. Hardware vision acceleration, however, does not address the cost of capturing and processing the image data that feeds these algorithms. We examine the role of the image signal processing (ISP) pipeline in computer vision to identify opportunities to reduce computation and save energy...

chapter

Broken-Karatsuba multiplication and its application to Montgomery modular multiplication

Jinnan Ding, Shuguo Li

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Large number multiplication has always been an essential operation in cryptographic algorithms. In this paper, we propose Broken-Karatsuba multiplication by applying the non-least-positive form to represent large numbers and dig the parallelism hidden in conventional Karatsuba multiplication. Further, we modify Montgomery modular multiplication algorithm with Broken-Karatsuba multiplication to make...

chapter

A pythonic approach for rapid hardware prototyping and instrumentation

John Clow, Georgios Tzimpragos, Deeksha Dangwal, Sammy Guo, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 7

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

We introduce PyRTL, a Python embedded hardware design language that helps concisely and precisely describe digital hardware structures. Rather than attempt to infer a good design via HLS, PyRTL provides a wrapper over a well-defined "core" set of primitives in a way that empowers digital hardware design teaching and research. The proposed system takes advantage of the programming language...

chapter

Rapid implementation of a partially reconfigurable video system with PYNQ

Brad Hutchings, Mike Wirthlin

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Undergraduate students rapidly implement a partially-reconfigured, real-time video processor on the Xilinx PYNQ board. The video processor performs various real-time operations including Sobel edge detection, embossing, averaging, an interactive Pong game, etc., using a separate partially-reconfigurable bit-stream for each distinct function. Selection of image-processing functions is accomplished...

chapter

OpenCL for HPC with FPGAs: Case study in molecular electrostatics

Chen Yang, Jiayi Sheng, Rushi Patel, Ahmed Sanaullah, more

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 8

2017 IEEE High Performance Extreme Computing Conference (HPEC)

FPGAs have emerged as a cost-effective accelerator alternative in clouds and clusters. Programmability remains a challenge, however, with OpenCL being generally recognized as a likely part of the solution. In this work we seek to advance the use of OpenCL for HPC on FPGAs in two ways. The first is by examining a core HPC application, Molecular Dynamics. The second is by examining a fundamental design...

chapter

POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads

Yuxi Liu, Xia Zhao, Zhibin Yu, Zhenlin Wang, more

2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT) > 140 - 141

2017 26th International Conference on Parallel Architectures and Compilation Techniques (PACT)

General-purpose workloads running on modern graphics processing units (GPGPUs) rely on hardware-based barriers to synchronize warps within a thread block (TB). However, imbalance may exist before reaching a barrier if a GPGPU workload contains irregular memory accesses, i.e., some warps may be critical while others may not. Ideally, cache space should be reserved for the critical warps. Unfortunately,...

chapter

Auto-SI: An adaptive reconfigurable processor with run-time loop detection and acceleration

Tanja Harbaum, Christoph Schade, Marvin Damschen, Carsten Tradowsky, more

2017 30th IEEE International System-on-Chip Conference (SOCC) > 153 - 158

2017 30th IEEE International System-on-Chip Conference (SOCC)

Modern computer architectures have an ever-increasing demand for performance, but are constrained in power dissipation and chip area. To tackle these demands, architectures with application-specific accelerators have gained traction in research and industry. While this is a very promising direction, hard-wired accelerators fall short when too many applications need to be supported or flexibility is...

chapter

Implementation of application specific instruction-set processor for the artificial neural network acceleration using LISA ADL

Damjan Rakanovic, Rastislav Struharik

2017 IEEE East-West Design & Test Symposium (EWDTS) > 1 - 6

2017 IEEE East-West Design & Test Symposium (EWDTS)

In fields like embedded vision, where algorithms are computationally expensive, hardware accelerators play a major role in high throughput applications. These accelerators could be implemented as hardwired IP cores or Application Specific Instruction-set Processors (ASIPs). While hardwired solutions often provide the best possible performance, they are less flexible then ASIP implementation. In this...

chapter

Implementing an ISR defense on a MIPS architecture

Loriana Sanabria Sancho, Elena Gabriela Barrantes

2017 XLIII Latin American Computer Conference (CLEI) > 1 - 7

2017 XLIII Latin American Computer Conference (CLEI)

Code injection attacks are an undeniable threat in today's cyberworld. Instruction Set Randomization (ISR) was initially proposed in 2003. This technique was designed to protect systems against code injection attacks by creating an unique instruction set for each machine, thanks to randomization. It is a promising technique in the growing embedded system and Internet of Things (IoT) devices ecosystem,...

chapter

Incremental high throughput network traffic classifier

H. R. Loo, Alireza Monemi, Trias Andromeda, M. N. Marsono

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI) > 1 - 6

2017 4th International Conference on Electrical Engineering, Computer Science and Informatics (EECSI)

Today's network traffic are dynamic and fast. Conventional network traffic classification based on flow feature and data mining are not able to process traffic efficiently. Hardware based network traffic classifier is needed to be adaptable to dynamic network state and to provide accurate and updated classification at high speed. In this paper, a hardware architecture of online incremental semi-supervised...

chapter

(Invited) Software-guided greybox design methodology with integrated power and clock management

Tianyu Jia, Yuanbo Fan, Russ Joseph, Jie Gu

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 894 - 897

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

In this paper, we propose a cross-layer integrated microprocessor design methodology where instructions in software programs drive the design down to the gate level netlists. Based on in-depth exploration of the dynamic timing behavior of each instruction in the program, a fully integrated design approach is proposed with ultra-dynamic clock and power management circuits and software driven design...

chapter

Packet Classification with Limited Memory Resources

Michal Kekely, Jan Korenek

2017 Euromicro Conference on Digital System Design (DSD) > 179 - 183

2017 Euromicro Conference on Digital System Design (DSD)

Network security and monitoring devices use packet classification to match packet header fields in a set of rules. Many hardware architectures have been designed to accelerate packet classification and achieve wire-speed throughput for 100 Gbps networks. The architectures are designed for high throughput even for the shortest packets. However, FPGA SoC and Intel Xeon with FPGA have limited resources...

Keywords:
HARDWARE
PIPELINES

Publication date

Set your own date range

Content availability

Available (630)
None (3)

Keywords

COMPUTER ARCHITECTURE (199)
FIELD PROGRAMMABLE GATE ARRAYS (181)
REGISTERS (163)
PIPELINE PROCESSING (86)
CLOCKS (84)
FPGA (84)
SOFTWARE (77)
THROUGHPUT (63)
ALGORITHM DESIGN AND ANALYSIS (62)
INSTRUCTION SETS (53)
PROGRAM PROCESSORS (53)
RANDOM ACCESS MEMORY (47)
MICROPROCESSOR CHIPS (46)
PARALLEL PROCESSING (42)
BENCHMARK TESTING (40)
COMPUTATIONAL MODELING (38)
DECODING (32)
PIPELINE (31)
REAL-TIME SYSTEMS (31)
PARALLEL ARCHITECTURES (30)
SWITCHES (30)
LOGIC GATES (27)
MULTIPROCESSING SYSTEMS (27)
ADDERS (26)
EMBEDDED SYSTEMS (26)
KERNEL (26)
DELAY (25)
GRAPHICS PROCESSING UNITS (25)
DATA MINING (24)
DELAYS (24)
OPTIMIZATION (24)
VIDEO CODING (23)
ENGINES (22)
HARDWARE DESIGN LANGUAGES (22)
SYSTEM-ON-CHIP (22)
TIMING (22)
CRYPTOGRAPHY (21)
ENCODING (21)
HARDWARE DESCRIPTION LANGUAGES (21)
PIXEL (20)
RECONFIGURABLE ARCHITECTURES (20)
RENDERING (COMPUTER GRAPHICS) (20)
ARRAYS (19)
COMPUTERS (19)
MICROPROCESSORS (19)
ACCELERATION (18)
CONTEXT (18)
SYNCHRONIZATION (18)
COPROCESSORS (17)
FIELD PROGRAMMABLE GATE ARRAY (17)
LOGIC DESIGN (17)
MEMORY MANAGEMENT (17)
MONITORING (17)
MULTICORE PROCESSING (17)
PERFORMANCE EVALUATION (17)
REAL TIME SYSTEMS (17)
REDUCED INSTRUCTION SET COMPUTING (17)
GRAPHICS (16)
DIGITAL SIGNAL PROCESSING (15)
IMAGE PROCESSING (15)
RADIATION DETECTORS (15)
SECURITY (15)
MATHEMATICAL MODEL (14)
MULTI-THREADING (14)
POWER DEMAND (14)
PROGRAMMING (14)
TABLE LOOKUP (14)
GRAPHICS PROCESSING UNIT (13)
INDEXES (13)
OFDM (13)
PROTOCOLS (13)
REDUNDANCY (13)
SIGNAL PROCESSING ALGORITHMS (13)
SYSTEM-ON-A-CHIP (13)
ANALYTICAL MODELS (12)
BANDWIDTH (12)
EQUATIONS (12)
FAST FOURIER TRANSFORMS (12)
GPU (12)
IP NETWORKS (12)
MICROARCHITECTURE (12)
OUT OF ORDER (12)
CAMERAS (11)
ENCRYPTION (11)
FEATURE EXTRACTION (11)
LIBRARIES (11)
PREFETCHING (11)
PROCESS CONTROL (11)
TRANSFORMS (11)
VHDL (11)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (10)
GENERATORS (10)
PROCESSOR SCHEDULING (10)
RELIABILITY (10)
RESOURCE MANAGEMENT (10)
SCHEDULING (10)
STREAMING MEDIA (10)
VECTORS (10)
more

INFONA - science communication portal

Search results

Research and design of add-based length-scalable dual-field modular multiplication-addition-subtraction

Designing a High-Throughput Pipeline for Digitizing Pinned Insects

In situ video encoding of floating-point volume data using special-purpose hardware for a posteriori rendering and analysis

A compilation method for zero overhead loop in DSPs with VLIW

Encrypted computing: Speed, security and provable obfuscation against insiders

Exploring computation-communication tradeoffs in camera systems

Toward a programmable FIB caching architecture

Optimising packet forwarding in multi-tenant networks using rule compilation

Reconfiguring the Imaging Pipeline for Computer Vision

Broken-Karatsuba multiplication and its application to Montgomery modular multiplication

A pythonic approach for rapid hardware prototyping and instrumentation

Rapid implementation of a partially reconfigurable video system with PYNQ

OpenCL for HPC with FPGAs: Case study in molecular electrostatics

POSTER: BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads

Auto-SI: An adaptive reconfigurable processor with run-time loop detection and acceleration

Implementation of application specific instruction-set processor for the artificial neural network acceleration using LISA ADL

Implementing an ISR defense on a MIPS architecture

Incremental high throughput network traffic classifier

(Invited) Software-guided greybox design methodology with integrated power and clock management

Packet Classification with Limited Memory Resources

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options