Search results

chapter

Task-based execution of synchronous dataflow graphs for scalable multicore computing

Georgios Georgakarakos, Sudeep Kanur, Johan Lilius, Karol Desnos

2017 IEEE International Workshop on Signal Processing Systems (SiPS) > 1 - 6

2017 IEEE International Workshop on Signal Processing Systems (SiPS)

Dataflow models of computation have early on been acknowledged as an attractive methodology to describe parallel algorithms, hence they have become highly relevant for programming in the current multicore processor era. While several frameworks provide tools to create dataflow descriptions of algorithms, generating parallel code for programmable processors is still sub-optimal due to the scheduling...

chapter

Maximum likelihood network localization using range estimation and GPS measurements

Hongwei Yu, Yi Jiang

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP) > 1 - 6

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP)

Localization of wireless networks has remained an active research topic for more than a decade, as it finds more and more applications in numerous scenarios, including environment surveillance, asset tracking, and healthcare monitoring, etc. In this paper, we consider the maximum likelihood localization of a network where some of the nodes are GPS-capable while the others attempt to achieve self localization...

chapter

Understanding and overcoming parallelism bottlenecks in ForkJoin applications

Gustavo Pinto, Anthony Canino, Fernando Castor, Guoqing Xu, more

2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE) > 765 - 775

2017 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE)

ForkJoin framework is a widely used parallel programming framework upon which both core concurrency libraries and real-world applications are built. Beneath its simple and user-friendly APIs, ForkJoin is a sophisticated managed parallel runtime unfamiliar to many application programmers: the framework core is a work-stealing scheduler, handles fine-grained tasks, and sustains the pressure from automatic...

chapter

Synchronized UML diagrams for object-oriented program comprehension

Jeong Yang, Young Lee, Deep Gandhi, Sruthi Ganesan Valli

2017 12th International Conference on Computer Science and Education (ICCSE) > 12 - 17

2017 12th International Conference on Computer Science and Education (ICCSE)

We propose a novel approach for visualizing reverse-engineered Unified Modeling Language (UML) diagrams (class, object, and sequence) to improve Object-Oriented Program (OOP) comprehension on a web-based programming environment, JaguarCode. It aims to help students better understand static structure and dynamic behavior of Java programs and object-oriented programming concepts. This paper presents...

chapter

MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling

Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rosetti, more

2017 46th International Conference on Parallel Processing (ICPP) > 151 - 160

2017 46th International Conference on Parallel Processing (ICPP)

While GPUs are becoming common in HPC systems, the CPU is still responsible for managing both GPU-side and CPU-side compute, communication, and synchronization operations. For instance, if a result from a GPU-side computation is to be transferred to a remote destination, then the CPU must synchronize on GPU compute completion issuing a communication operation. Both CPU cycles and energy are consumed...

chapter

Practical Experience with Transactional Lock Elision

Tingzhe Zhou, Pante A Zardoshti, Michael Spear

2017 46th International Conference on Parallel Processing (ICPP) > 81 - 90

2017 46th International Conference on Parallel Processing (ICPP)

Transactional Memory (TM) promises both to provide a scalable mechanism for synchronization in concurrent programs, and to offer ease-of-use benefits to programmers. The most straightforward use of TM in real-world programs is in the form of Transactional Lock Elision (TLE). In TLE, critical sections are attempted as transactions, with a fall-back to a lock if conflicts manifest. Thus TLE expects...

chapter

Visualization of Open Community Runtime Task Graphs

Jiri Dokulil, Jana Katreniakova

2017 21st International Conference Information Visualisation (IV) > 236 - 241

2017 21st International Conference Information Visualisation (IV)

The emergence of new types of high performance hardware also drives the need for new programming models. The Open Community Runtime (OCR) proposal uses a task-based programming model to target some of these architectures. In OCR, the whole program from start to end needs to be expressed using tasks and synchronized using task-to-task dependences, significantly limiting the applicability and usefulness...

chapter

Towards Variability Management in Bidirectional Model Transformation

Xiao He, Zhenjiang Hu, Yi Liu

2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC) > 1 > 224 - 233

2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC)

The bidirectional model transformation (BX) comprises a forward transformation get and a backward transformation put. Given that get may be an information-loss transformation, the behavior of put may be uncertain. An uncertain put produces many valid outputs that fit different application scenarios. This paper proposes an approach to variability management in BX to enable put to generate an output...

chapter

Publish-subscribe programming for a NoC-based multiprocessor system-on-chip

Jean Carlo Hamerski, Geancarlo Abich, Ricardo Reis, Luciano Ost, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Shared memory and message passing are traditional parallel programming models used on multiprocessor system-on-chip environments. Underlying models are traditionally meant for static scenarios where all communicating entities and their intercommunication patterns are known a priori by the software engineer. The systems design following such programming models became complex due to dynamic behavior...

chapter

A pulse-based memristor programming circuit

Olufemi A. Olumodeji, Massimo Gottardi

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

In this paper, we present a novel and simple circuit for accurately programming memristors in both an incremental and a decremental fashion. One of the main constituting blocks of the circuit is an inverting voltage amplifier block within which the memristor forms a gain stage with a reference resistor. Memristor resistance modulation is achieved by means of auto-tuning operational amplifier's gain...

chapter

Implementation and Evaluation of One-Sided PGAS Communication in XcalableACC for Accelerated Clusters

Akihiro Tabuchi, Masahiro Nakao, Hitoshi Murai, Taisuke Boku, more

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) > 625 - 634

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

Clusters equipped with accelerators such as graphics processing unit (GPU) and Many Integrated Core (MIC) are widely used. For such clusters, programmers write programs for their applications by combining MPI with one of the available accelerator programming models. In particular, OpenACC enables programmers to develop their applications easily, but with lower productivity owing to complex MPI programming...

chapter

Comparison of Threading Programming Models

Solmaz Salehian, Jiawen Liu, Yonghong Yan

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 766 - 774

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

In this paper, we provide comparison of languagefeatures and runtime systems of commonly used threadingparallel programming models for high performance computing, including OpenMP, Intel Cilk Plus, Intel TBB, OpenACC, NvidiaCUDA, OpenCL, C++11 and PThreads. We then report ourperformance comparison of OpenMP, Cilk Plus and C++11 fordata and task parallelism on CPU using benchmarks. The resultsshow...

chapter

Improving the Integration of Task Nesting and Dependencies in OpenMP

Josep M. Perez, Vicenc Beltran, Jesus Labarta, Eduard Ayguade

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 809 - 818

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

The tasking model of OpenMP 4.0 supports both nesting and the definition of dependences between sibling tasks. A natural way to parallelize many codes with tasks is to first taskify the high-level functions and then to further refine these tasks with additional subtasks. However, this top-down approach has some drawbacks since combining nesting with dependencies usually requires additional measures...

chapter

Coverage-Driven Test Code Generation for Concurrent Classes

Valerio Terragni, Shing-Chi Cheung

2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE) > 1121 - 1132

2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)

Previous techniques on concurrency testing have mainly focused on exploring the interleaving space of manually written test code to expose faulty interleavings of shared memory accesses. These techniques assume the availability of failure-inducing tests. In this paper, we present AutoConTest, a coverage-driven approach to generate effective concurrent test code that achieve high interleaving coverage...

chapter

Prodirect Manipulation: Bidirectional Programming for the Masses

Ravi Chugh

2016 IEEE/ACM 38th International Conference on Software Engineering Companion (ICSE-C) > 781 - 784

2016 IEEE/ACM 38th International Conference on Software Engineering Companion (ICSE-C)

Software interfaces today generally fall at either end of a spectrum. On one end are programmable systems, which allow expert users (i.e. programmers) to write software artifacts that describe complex abstractions, but programs are disconnected from their eventual output. On the other end are domain-specific graphical user interfaces (GUIs), which allow end users (i.e. non-programmers) to easily create...

chapter

Reusable Self-Adaptation through Bidirectional Programming

Kevin Colson, Robin Dupuis, Lionel Montrieux, Zhenjiang Hu, more

2016 IEEE/ACM 11th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS) > 4 - 15

2016 IEEE/ACM 11th International Symposium on Software Engineering for Adaptive and Self-Managing Systems (SEAMS)

In self-adaptive systems, an adaptation strategy can apply to several implementations of a target system. Reusing this strategy requires models of the target system that are independent of its implementation. In particular, configuration files must be transformed into abstract configurations, but correctly synchronizing these two representations is not trivial. We propose an approach that uses putback-based...

chapter

A Language, Framework, and SDK for Robotic Communications, Integration, and Interoperability

Jeffrey Wesley Wallace, Sara Jane Kambouris

2016 International Conference on Computational Science and Computational Intelligence (CSCI) > 472 - 477

2016 International Conference on Computational Science and Computational Intelligence (CSCI)

A framework to integrate different artificial intelligence and machine learning algorithms is combined with an execution framework to create a powerful cloud computing system development platform. By providing an execution framework and control software that is native to cloud architectures and supports interactivity and time synchronization, the true utility of cloud computing and "big data...

chapter

Deep parallelization of parallel FP-growth using parent-child MapReduce

Adetokunbo Makanju, Zahra Farzanyar, Aijun An, Nick Cercone, more

2016 IEEE International Conference on Big Data (Big Data) > 1422 - 1431

2016 IEEE International Conference on Big Data (Big Data)

MapReduce is an important programming model for processing in distributed environments. Compared to other distributed programming models, MapReduce reduces communication overheads between computers and improves fault tolerance. However, the MapReduce model does not allow for automatic synchronization between jobs. A large number of data analytics algorithms use a recursive divide-and-conquer approach,...

chapter

Reducing the Communication Costs of Graph Analysis by Read-Only Replicas and Prioritized Execution

Yun Gao, Wei Zhou, Jizhong Han, Dan Meng

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS) > 102 - 109

2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS)

Graph mining is widely used in fields like social network analysis. The synchronous vertex-centric frameworks strike better balance between the performance and ease-of-use, so they are widely used in realistic. However, traditional architectures of this type, like the vertex-based push architecture and GAS, are encumbered by high communication costs. In this paper we proposed a new replica-based push...

chapter

Experiences of Applying One-Sided Communication to Nearest-Neighbor Communication

Hongzhang Shan, Samuel Williams, Yili Zheng, Weiqun Zhang, more

2016 PGAS Applications Workshop (PAW) > 17 - 24

2016 PGAS Applications Workshop (PAW)

Nearest-neighbor communication is one of the most important communication patterns appearing in many scientific applications. In this paper, we discuss the results of applying UPC++, a library-based partitioned global address space (PGAS) programming extension to C++, to an adaptive mesh framework (BoxLib), and a full scientific application GTC-P, whose communications are dominated by the nearest-neighbor...

INFONA - science communication portal

Search results

Task-based execution of synchronous dataflow graphs for scalable multicore computing

Maximum likelihood network localization using range estimation and GPS measurements

Understanding and overcoming parallelism bottlenecks in ForkJoin applications

Synchronized UML diagrams for object-oriented program comprehension

MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling

Practical Experience with Transactional Lock Elision

Visualization of Open Community Runtime Task Graphs

Towards Variability Management in Bidirectional Model Transformation

Publish-subscribe programming for a NoC-based multiprocessor system-on-chip

A pulse-based memristor programming circuit

Implementation and Evaluation of One-Sided PGAS Communication in XcalableACC for Accelerated Clusters

Comparison of Threading Programming Models

Improving the Integration of Task Nesting and Dependencies in OpenMP

Coverage-Driven Test Code Generation for Concurrent Classes

Prodirect Manipulation: Bidirectional Programming for the Masses

Reusable Self-Adaptation through Bidirectional Programming

A Language, Framework, and SDK for Robotic Communications, Integration, and Interoperability

Deep parallelization of parallel FP-growth using parent-child MapReduce

Reducing the Communication Costs of Graph Analysis by Read-Only Replicas and Prioritized Execution

Experiences of Applying One-Sided Communication to Nearest-Neighbor Communication

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options