Advanced search

From:

To:

Items from 1 to 20 out of 43 results

chapter

A method for synchronous dataflow retiming

Anatolij Sergiyenko, Anastasia Serhienko, Andrij Simonenko

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON) > 1015 - 1018

2017 IEEE First Ukraine Conference on Electrical and Computer Engineering (UKRCON)

A method of retiming the spatial synchronous dataflow graph (SDF) is proposed, which is based on the SDF representation in the multidimensional space. The dimensions of this space are the spatial coordinate of the processing unit, coordinate of the operator firing and operator type. At the first stage of the datapath synthesis, the operator nodes are placed in the space according to a set of rules...

chapter

Edge-centric modulo scheduling for coarse-grained reconfigurable architectures

Hyunchul Park, Kevin Fan, Scott Mahlke, Taewook Oh, more

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT) > 166 - 176

2008 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Coarse-grained reconfigurable architectures (CGRAs) present an appealing hardware platform by providing the potential for high computation throughput, scalability, low cost, and energy efficiency. CGRAs consist of an array of function units and register files often organized as a two dimensional grid. The most difficult challenge in deploying CGRAs is compiler scheduling technology that can efficiently...

chapter

Taming warp divergence

Jayvant Anantpur, R. Govindarajan

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) > 50 - 60

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

Graphics Processing Units (GPUs) are designed to exploit large amount of parallelism. However, warp-level divergence occurring due to different amounts of work, memory access latency experienced, etc., results in warps of a thread block (TB) finishing kernel execution at different points in time. This, in effect, reduces utilization of resources of SMs and hence performance of the GPU. We propose...

chapter

Efficient hardware architecture of deterministic MPA decoder for SCMA

Chao Yang, Chuan Zhang, Shunqing Zhang, Xiaohu You

2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS) > 293 - 296

2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS)

Sparse code multiple access (SCMA) is a new kind of multiple access (MA) technology which ranks one of the most promising candidates for 5G wireless because of its outstanding performance. SCMA enjoys stronger overloading tolerance compared with traditional MA technologies. It also takes the advantage of sparse property to achieve lower complexity when using message passing algorithm (MPA). In this...

chapter

Efficient hardware architecture of deterministic MPA decoder for SCMA

Chao Yang, Chuan Zhang, Shunqing Zhang, Xiaohu You

2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS) > 392 - 395

2016 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS)

chapter

VarySched: A Framework for Variable Scheduling in Heterogeneous Environments

Tim SuB, Nils Doring, Ramy Gad, Lars Nagel, more

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 489 - 492

2016 IEEE International Conference on Cluster Computing (CLUSTER)

Despite many efforts to better utilize the potential of GPUs and CPUs, it is far from being fully exploited. Although many tasks can be easily sped up by using accelerators, most of the existing schedulers are not flexible enough to really optimize the resource usage of the complete system. The main reasons are (i) that each processing unit requires a specific program code and that this code is often...

chapter

Scheduling instruction effects for a statically pipelined processor

B. Davis, R. Baird, P. Gavin, M. Sjalander, more

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) > 167 - 176

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)

Statically pipelined processors have a fully exposed datapath where all portions of the pipeline are directly controlled by effects within an instruction, which simplifies hardware and enables a new level of compiler optimizations. This paper describes an effect scheduling strategy to aggressively compact instructions, which has a critical impact on code size and performance. Unique scheduling challenges...

chapter

Improving the interface performance of synthesized structural FAME simulators through scheduling

David A. Penry

2015 33rd IEEE International Conference on Computer Design (ICCD) > 70 - 77

2015 33rd IEEE International Conference on Computer Design (ICCD)

Computer designers rely upon near-cycle-accurate microarchitectural simulators to explore the design space of new systems. Hybrid simulators which offload simulation work onto FPGAs (also known as FAME simulators) can overcome the speed limitations of software-only simulators. However such simulators must be automatically synthesized or the time to design them becomes prohibitive. Previous work has...

chapter

Scheduling on a superscalar processor using the chain technique

Lin Meng, Nobihiro Moriwaki, Shigeru Oyanagi

Proceedings of the 2014 International Conference on Advanced Mechatronic Systems > 398 - 403

2014 International Conference on Advanced Mechatronic Systems (ICAMechS)

Instruction level parallelism is one of the basic ways of increasing the performance of current processors. One method to improve instruction parallelism is the chain technique, which bypasses execution results from one Arithmetic Logic Unit (ALU) to others. However, this technique cannot be used with the current superscalar processor scheduling method. We develop a scheduling method for the chain...

chapter

FlexPRET: A processor platform for mixed-criticality systems

Michael Zimmer, David Broman, Chris Shaver, Edward A. Lee

2014 IEEE 19th Real-Time and Embedded Technology and Applications Symposium (RTAS) > 101 - 110

2014 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS)

Mixed-criticality systems, in which multiple tasks of varying criticality execute on a single hardware platform, are an emerging research area in real-time embedded systems. High-criticality tasks require spatial and temporal isolation guarantees for independent verification, and the task set should efficiently utilize hardware resources. Hardware-based isolation is desirable but often underutilizes...

chapter

Resiliency-aware scheduling: Resource allocation for hardened computation on configurable devices

Jeremy Abramson, Pedro C. Diniz

2012 International Conference on Field-Programmable Technology > 129 - 134

2012 International Conference on Field-Programmable Technology (FPT)

The number of configurable systems deployed in hostile environments continues to rise. This, along with decreasing geometries and lower operating voltages leads to an expected increase in transient errors. This paper presents Resiliency-aware Scheduling, a novel approach to resource allocation for hardening computations on configurable systems. Using modular and replicated functional units called...

chapter

Real-time scheduling coprocessor for NIOS II processor

M. Varela, R. Cayssials, E. Ferro, E. Boemo

2012 VIII Southern Conference on Programmable Logic > 1 - 6

2012 VIII Southern Conference on Programmable Logic (SPL)

In this paper we describe and analyze the main features of the Hardware Real-Time Scheduler Coprocessor unit (HRTC) for NIOS II processor. We describe how the HRTSC supports time, events, task and priorities. The HRTSC was designed as a SOPC component to incorporate real-time features for embedded real-time applications. The hardware architecture has an easy integration with the IDE programming environment...

chapter

Dynamic Communication in a Coarse Grained Reconfigurable Array

R Panda, S Hauck

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines > 25 - 28

2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2011)

Coarse Grained Reconfigurable Arrays (CGRAs) are typically very efficient for a single task. However all functional units are required to perform in lock step, wasting resources and making complex programming flows difficult. Massively Parallel Processor Arrays (MPPAs) excel at executing unrelated tasks simultaneously, but limit the amount of resources dedicated to a single task. We propose an architecture...

chapter

Hardware architectures for successive cancellation decoding of polar codes

Camille Leroux, Ido Tal, Alexander Vardy, Warren J. Gross

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1665 - 1668

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recently-discovered polar codes are widely seen as a major breakthrough in coding theory. These codes achieve the capacity of many important channels under successive cancellation decoding. Motivated by the rapid progress in the theory of polar codes, we propose a family of architectures for efficient hardware implementation of successive cancellation decoders. We show that such decoders can be...

chapter

A multithreaded processor core with low overhead context switch for IP-packet processing

Kang Li, Hong Zhang, Jiandong Li, Yue Hao, more

2010 10th IEEE International Conference on Solid-State and Integrated Circuit Technology > 272 - 274

2010 10th IEEE International Conference on Solid-State and Integrated Circuit Technology (ICSICT)

In this paper a multithreaded processor with hardware context switch mechanism driven by external events is presented for multi-processor system on chip (MPSoC). Combining this mechanism with asynchronous memory access the proposed processor implements Non-preemptive thread scheduling which can assure fairness of threads and optimization for single thread. The overhead of hardware thread switch is...

chapter

Optimized communication architecture of MPSoCs with a hardware scheduler: A system view

Diandian Zhang, Han Zhang, Jeronimo Castrillon, Torsten Kempf, more

2010 International Symposium on System on Chip > 163 - 168

2010 International Symposium on System-on-Chip (SOC)

With increasing complexity of MPSoCs, efficient runtime management of system resources becomes of vital importance for improving the system performance and energy efficiency. OSIP-an operating system application-specific instruction-set processor - provides a promising solution to this. It delivers high computational performance to deal with dynamic task scheduling and mapping, while still being programmable...

chapter

Using SMT to Hide Context Switch Times of Large Real-Time Tasksets

J Mische, S Uhrig, F Kluge, T Ungerer

2010 IEEE 16th International Conference on Embedded and Real-Time Computing Systems and Applications > 255 - 264

2010 IEEE 16th International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA 2010)

Theoretical real-time research generally neglects context switch times. But in recent embedded applications which consist of dozens of threads with very short execution times, their impact is too serious to be ignored. We present a hard real-time scheduling algorithm that perfectly hides the context switch times of an arbitrary number of threads. It requires a Simultaneous Multithreaded (SMT) processor...

chapter

Reconfigurable custom functional unit generation and exploitation in multiple-issue processors

Hui-Shan Wang, I-Wei Wu, Jean Jyh-Jiun Shann, Chung-Ping Chung

2010 IEEE 8th Symposium on Application Specific Processors (SASP) > 115 - 118

2010 IEEE 8th Symposium on Application Specific Processors (SASP 2010)

Recently, next-generation digital entertainment and mobile communication devices are driving the demand for high-performance processing solutions. In order to achieve this demand, multiple-issue processors such as very long instruction word (VLIW) architecture augmented with a reconfigurable hardware accelerator have been proposed in many papers. The reconfigurable hardware accelerator is usually...

chapter

ARM7TDMI Optimization Based on GCC

Den Wenjian

2010 Second International Conference on Computer Research and Development > 639 - 642

Second International Conference on Computer Research and Development (ICCRD 2010)

The paper discusses optimization of hardware architecture from angle of compiling. The angle of compiling, which the paper refers to, is the located compiling technology. That is to say, the paper will be analyzed how to optimizing instruction set, register location and pipelining of hardware architecture from GCC compiling technology, such as peephole, diagram coloring and instruction scheduling.

chapter

A Register Transfer Level Approach for Intermittent Semi-Concurrent Error Detection

Yang Donghu, Jiang Jianhui, Yin Jie, Huang Jipeng

2010 2nd International Conference on E-business and Information System Security > 1 - 6

2010 2nd International Conference on E-business and Information System Security (EBISS 2010)

With the increasing chip density and the continuous improvement of reliability requirements, the concurrent error detection techniques in register transfer level have become an increasing concern. Because of the actual low probability of failure occurring, a number of semi-concurrent error detection techniques are feasible. The circuits can be checked in every N iterations. So the recomputations will...

Keywords:
HARDWARE
REGISTERS
PROCESSOR SCHEDULING

Publication date

Set your own date range

Publication type

book (33)
article (10)

Keywords

SCHEDULING (15)
CLOCKS (11)
COMPUTER ARCHITECTURE (10)
HIGH-LEVEL SYNTHESIS (8)
MICROPROCESSOR CHIPS (8)
PIPELINE PROCESSING (8)
PIPELINES (8)
SCHEDULING ALGORITHM (7)
FIELD PROGRAMMABLE GATE ARRAYS (6)
HIGH LEVEL SYNTHESIS (6)
MULTIPROCESSING SYSTEMS (5)
OPTIMIZATION (5)
PARALLEL PROCESSING (5)
REAL TIME SYSTEMS (5)
SOFTWARE (5)
VLSI (5)
CONTEXT (4)
COST FUNCTION (4)
DECODING (4)
PIPELINING (4)
SCHEDULES (4)
SWITCHES (4)
SYSTEM-ON-CHIP (4)
ALGORITHM DESIGN AND ANALYSIS (3)
EMBEDDED SYSTEM (3)
HARDWARE ARCHITECTURE (3)
INSTRUCTION SETS (3)
MEMORY MANAGEMENT (3)
MICROPROCESSORS (3)
OPTIMISATION (3)
PROGRAM PROCESSORS (3)
REAL-TIME SYSTEMS (3)
RECONFIGURABLE ARCHITECTURES (3)
TIMING (3)
VLIW (3)
APPLICATION SOFTWARE (2)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (2)
BANDWIDTH (2)
CHECKPOINTING (2)
CIRCUIT CAD (2)
CONTROL SYSTEM SYNTHESIS (2)
CONTROL SYSTEMS (2)
DATA FLOW GRAPHS (2)
DATA MINING (2)
DELAY (2)
DETERMINISTIC MESSAGE PASSING ALGORITHM (DMPA) (2)
EMBEDDED SYSTEMS (2)
ENERGY CONSUMPTION (2)
ERROR CORRECTION (2)
ERROR DETECTION (2)
FPGA (2)
FUNCTIONAL UNITS (2)
HARDWARE SCHEDULER (2)
INTEGRATED CIRCUIT DESIGN (2)
KERNEL (2)
LOGIC DESIGN (2)
LOGIC GATES (2)
LOOP BOUND ANALYSIS (2)
LOW-POWER ELECTRONICS (2)
MPSOC (2)
OPERATING SYSTEMS (2)
PERMISSION (2)
PROBABILITY (2)
PROCESS CONTROL (2)
PROCESS DESIGN (2)
RADIO FREQUENCY (2)
RANDOM ACCESS MEMORY (2)
REDUCED INSTRUCTION SET COMPUTING (2)
REGISTER TRANSFER LEVEL (2)
SPARSE CODE MULTIPLE ACCESS (SCMA) (2)
STAGE-LEVEL FOLDING (2)
THROUGHPUT (2)
VERY LARGE SCALE INTEGRATION (2)
4-CORE PROCESSOR SYSTEM (1)
ABSTRACTION TECHNIQUES (1)
ACCELERATOR (1)
ACCURACY (1)
ADA (1)
ADAPTIVE HARDWARE (1)
ADAPTIVE HARDWARE REAL-TIME TASK SCHEDULER MANAGEMENT (1)
ALLOCATION PHASE (1)
ALTERNATIVE INTERCONNECT STRATEGY (1)
ANGLE OF COMPILING (1)
APPLICATION SCHEDULING (1)
APPLICATION SPECIFIC CORE (1)
APPLICATION-SPECIFIC EMBEDDED SYSTEMS (1)
APPLICATION-SPECIFIC INSTRUCTION-SET PROCESSOR (1)
APPLICATION-TURNED PROCESSOR ARCHITECTURE (1)
ARBITRARY SEQUENTIAL CONSTRAINTS (1)
ARCHITECTURE (1)
ARITHMETIC (1)
ARM7TDMI (1)
ARM7TDMI OPTIMIZATION (1)
ARRAYS (1)
ASIC (1)
ASYNCHRONOUS CIRCUITS (1)
ASYNCHRONOUS MEMORY ACCESS (1)
more

INFONA - science communication portal

Advanced search

Advanced search

A method for synchronous dataflow retiming

Edge-centric modulo scheduling for coarse-grained reconfigurable architectures

Taming warp divergence

Efficient hardware architecture of deterministic MPA decoder for SCMA

Efficient hardware architecture of deterministic MPA decoder for SCMA

VarySched: A Framework for Variable Scheduling in Heterogeneous Environments

Scheduling instruction effects for a statically pipelined processor

Improving the interface performance of synthesized structural FAME simulators through scheduling

Scheduling on a superscalar processor using the chain technique

FlexPRET: A processor platform for mixed-criticality systems

Resiliency-aware scheduling: Resource allocation for hardened computation on configurable devices

Real-time scheduling coprocessor for NIOS II processor

Dynamic Communication in a Coarse Grained Reconfigurable Array

Hardware architectures for successive cancellation decoding of polar codes

A multithreaded processor core with low overhead context switch for IP-packet processing

Optimized communication architecture of MPSoCs with a hardware scheduler: A system view

Using SMT to Hide Context Switch Times of Large Real-Time Tasksets

Reconfigurable custom functional unit generation and exploitation in multiple-issue processors

ARM7TDMI Optimization Based on GCC

A Register Transfer Level Approach for Intermittent Semi-Concurrent Error Detection

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Advanced search

Advanced search

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options