Search results

chapter

Automatic Scan Parallelization in OpenMP

Maicol Zegarra, Marcio Pereira, Xavier Martorell, Guido Araujo

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) > 85 - 90

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)

Prefix Scan (or simply scan) is an operator that computes all the partial sums of a vector. A scan operation results in a vector where each element is the sum of the preceding elements in the original vector up to the corresponding position. Scan is a key operation in many relevant problems like sorting, lexical analysis, string comparison, image filtering among others. Although there are libraries...

chapter

Compression experiments on term-document index

Murat Cihan Sorkun, Can Ozbey

2017 International Conference on Computer Science and Engineering (UBMK) > 435 - 439

2017 International Conference on Computer Science and Engineering (UBMK)

The increase in the size of the data used in natural language processing activities brings with it time and space constraints. Thus, it is important to both store and access data efficiently. This study includes experiments for storing the term-document index, which will be used in a natural language processing project, effectively in memory. For this purpose, the indexed data is compressed using...

chapter

Diagnostics of linear phased array from near-field data using iterative regularization

A. B. Khashimov

2017 2nd International Ural Conference on Measurements (UralCon) > 330 - 335

2017 2nd International Ural Conference on Measurements (UralCon)

This paper considers a method for detecting faulty elements in a linear phased array by means of set near-field probes. Using the asymptotic correspondence for two- and three-dimensional problems of the antenna theory leads to simpler mathematical models for the vertical dipoles array. A numerical study of mathematical models is based on the iterative regularization of the ill-posed problem of reconstructing...

chapter

Coherence tuning of pulsed photonic crystal VCSEL arrays

Harshil Dave, Stewart T. M. Fryslie, Zihe Gao, Bradley J. Thompson, more

2017 IEEE Photonics Conference (IPC) > 531 - 532

2017 IEEE Photonics Conference (IPC)

Characterization of coherence tuning range for 2×1 photonic crystal VCSEL arrays under pulsed excitation is reported. Far field data show the coherence range is larger under pulsed conditions compared to cw operation due to reduction of resistive diode heating.

chapter

Autonomous, independent management of dynamic graphs on GPUs

Martin Winter, Rhaleb Zayer, Markus Steinberger

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

In this paper, we present a new, dynamic graph data structure, built to deliver high update rates while keeping a low memory footprint using autonomous memory management directly on the GPU. By transferring the memory management to the GPU, efficient updating of the graph structure and fast initialization times are enabled as no additional memory allocation calls or reallocation procedures are necessary...

chapter

A novel ReRAM-based processing-in-memory architecture for graph computing

Lei Han, Zhaoyan Shen, Zili Shao, H. Howie Huang, more

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA) > 1 - 6

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA)

Graph algorithms such as breadth-first search (BFS) have been gaining ever-increasing importance in the era of Big Data. However, the memory bandwidth remains the key performance bottleneck for graph processing. To address this problem, we utilize processing-in-memory (PIM), combined with non-volatile metal-oxide resistive random access memory (ReRAM), to improve the performance of both computation...

chapter

Performance analysis of superdirective beamforming of circular hydrophone array

Zhengyao He, Qiang Shi, Yuanliang Ma

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 144 - 147

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

The performance of superdirective beamforming of a practical circular hydrophone array is investigated. The received signal of a 16-element circular hydrophone array with radius of 0.25m are measured in the anechoic water tank. The array manifold at the frequency of 1.11kHz is obtained. The superdirective beamforming of the circular array is designed and realized. The correctness and effectiveness...

chapter

On decoding rank-metric codes over large fields

Ron M. Roth

2017 IEEE International Symposium on Information Theory (ISIT) > 2756 - 2760

2017 IEEE International Symposium on Information Theory (ISIT)

A decoding algorithm is presented for rank-metric array codes that are based on diagonal interleaving of MDS codes. W.r.t. this metric, such array codes are known to be optimal when the underlying field is algebraically closed. It is also shown that for any list decoding radius that is smaller than the minimum rank distance, the list size can be bounded from above by an expression that is independent...

chapter

GenServ: Genome Sequencing Services on Scalable Energy Efficient Accelerators

Chao Wang, Haijie Fang, Shiming Lei, Lei Gong, more

2017 IEEE International Conference on Web Services (ICWS) > 814 - 817

2017 IEEE International Conference on Web Services (ICWS)

As a traditional algorithm, the string match meets a challenge with the development of the massive volume of data be-cause of gene sequencing. Surveys show that there will be a huge amount of short read segments during the process of gene sequencing and the need for a highly efficient is urgent. The BWA is an effective algorithm to deal with the short read mapping. Compared with other short read mapping...

chapter

Boosting Performance of Map Matching Algorithms by Parallelization on Graphics Processors

Markus Auer, Hubert Rehborn, Sven-Eric Molzahn, Klaus Bogenberger

2017 IEEE Intelligent Vehicles Symposium (IV) > 462 - 467

2017 IEEE Intelligent Vehicles Symposium (IV)

In this paper existing map matching algorithms are combined and modified such, that the resulting algorithm is suitable for the implementation on the graphics processing unit (GPU). The map matching algorithm implemented on GPU consists of a geometrical and topological processing step, which provides high accuracy with high efficiency at the same time. An important building block of the implementation...

chapter

A cache-friendly concurrent lock-free queue for efficient inter-core communication

Xianghui Meng, Xuewen Zeng, Xiao Chen, Xiaozhou Ye

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 538 - 542

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

Buffer sharing based on pipeline parallelism is quite susceptible to inter-core communication overhead. Existing work on concurrent lock-free (CLF) queue algorithm did not take full advantage of CPU cache features to improve performance. In order to implement a fast single-producer-single-consumer (SPSC) buffer scheduling queue, this paper proposes a cache-friendly CLF queue scheduling algorithm (CFCLF),...

chapter

Robust and efficient algorithms for storage and retrieval of disk based data structures

Kathiravan Srinivasan, Ravinder Kumar, Sahil Singla

2017 International Conference on Applied System Innovation (ICASI) > 934 - 937

2017 International Conference on Applied System Innovation (ICASI)

Data sets are often too immense to fit completely inside the computer's main memory and must instead reside on disk. If data set will be kept in main memory it will be very costly. A computer must retrieve required data and place it in internal memory to process it. Efficient data structures, like B-tree, B+ tree, are used to process large datasets. Nodes of these data structures are buffered in memory...

chapter

Scalable Lock-Free Vector with Combining

Ivan Walulya, Philippas Tsigas

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 917 - 926

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Dynamic vectors are among the most commonly used data structures in programming. They provide constant time random access and resizable data storage. Additionally, they provide constant time insertion (pushback) and deletion (popback) at the end of the sequence. However, in a multithreaded system, concurrent pushback and popback operations attempt to update the same shared object, creating a synchronization...

chapter

Aces4: A Platform for Computational Chemistry Calculations with Extremely Large Block-Sparse Arrays

Beverly A. Sanders, Jason N. Byrd, Nakul Jindal, Victor F. Lotrich, more

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 555 - 564

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Aces4 is a parallel programming platform comprising a DSL for Computational Chemistry and its runtime system. It offers a convenient way to express parallelism together with extensive support for extremely large, possibly sparse, distributed arrays. It aids scientists in the creation of performant, scalable, massively parallel programs that can effectively take advantage of leadership class computing...

chapter

Stabbing Colors in One Dimension

Arnab Ganguly, Wing-Kai Hon, Rahul Shah

2017 Data Compression Conference (DCC) > 280 - 289

2017 Data Compression Conference (DCC)

Given n horizontal segments, each associated with a color from [σ], the Categorical Segment Stabbing problem is to find the distinct K colors stabbed by a vertical line. When the end-points of the segments are distinct and lie in [1, 2n], we present an (2 + ε)n log σ + O(n)-bit index with O(K/ε) query time, where ε∈ (0, 1].When the end-points are arbitrary real numbers, a standard reduction to the...

chapter

Full Compressed Affix Tree Representations

Rodrigo Canovas, Eric Rivals

2017 Data Compression Conference (DCC) > 102 - 111

2017 Data Compression Conference (DCC)

The Suffix Tree, a crucial and versatile data structure for string analysis of large texts, is often used in pattern matching and in bioinformatics applications. The Affix Tree generalizes the Suffix Tree in that it supports full tree functionalities in both search directions. The bottleneck of Affix Trees is their space requirement for storing the data structure. Here, we discuss existing representations...

chapter

A Scalable Storage System for Structured Data Based on Higher Order Index Array

Mehnuma Tabassum Omar, K.M. Azharul Hasan

2016 IEEE/ACM 3rd International Conference on Big Data Computing Applications and Technologies (BDCAT) > 247 - 252

2016 IEEE/ACM 3rd International Conference on Big Data Computing Applications and Technologies (BDCAT)

Conventional computing techniques extensively diverge over large scale computing. In order to store and operate structured data, most data scientists suggest higher dimensional arrays, especially in linearization of higher order data. However, with the developing size of datasets, the structures become prone to performance degradation for inability of maintaining expanded data velocity. Besides, reallocation...

chapter

A Compact-Trie-Based Structure for K-Nearest-Neighbour Searching

Peng Gong, Wendy Osborn

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA) > 578 - 585

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA)

This paper proposes a k-nearest neighbour search method inspired by grid space partitioning and the compact-trie structure. A compact trie structure, and a k-nearest neighbour search strategy are presented. Then, a k-nearest neighbour search performance comparison is carried out against two well-known methods, using one million two-dimensional spatial points and finding up to 1000 nearest neighbours...

chapter

Verifying Concurrent Programs Using Contracts

Ricardo J. Dias, Carla Ferreira, Jan Fiedor, Joao M. Lourenco, more

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST) > 196 - 206

2017 IEEE International Conference on Software Testing, Verification and Validation (ICST)

The central notion of this paper is that of contracts for concurrency, allowing one to capture the expected atomicity of sequences of method or service calls in a concurrent program. The contracts may be either extracted automatically from the source code, or provided by developers of libraries or software modules to reflect their expected usage in a concurrent setting. We start by extending the so-far...

chapter

Tessellating memory space for parallel access

Juan Escobedo, Mingjie Lin

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) > 75 - 80

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC)

Modern reconfigurable computing chips, such as FPGAs, offer an unprecedented opportunity to achieving both multifunctionality and real-time responsiveness for memoryintensive embedded applications. However, how to cost-effectively synthesize application-specific hardware constructs that fully exploit memory-level parallelism remains to be a key challenge. To address this problem, we propose a new...

INFONA - science communication portal

Search results

Automatic Scan Parallelization in OpenMP

Compression experiments on term-document index

Diagnostics of linear phased array from near-field data using iterative regularization

Coherence tuning of pulsed photonic crystal VCSEL arrays

Autonomous, independent management of dynamic graphs on GPUs

A novel ReRAM-based processing-in-memory architecture for graph computing

Performance analysis of superdirective beamforming of circular hydrophone array

On decoding rank-metric codes over large fields

GenServ: Genome Sequencing Services on Scalable Energy Efficient Accelerators

Boosting Performance of Map Matching Algorithms by Parallelization on Graphics Processors

A cache-friendly concurrent lock-free queue for efficient inter-core communication

Robust and efficient algorithms for storage and retrieval of disk based data structures

Scalable Lock-Free Vector with Combining

Aces4: A Platform for Computational Chemistry Calculations with Extremely Large Block-Sparse Arrays

Stabbing Colors in One Dimension

Full Compressed Affix Tree Representations

A Scalable Storage System for Structured Data Based on Higher Order Index Array

A Compact-Trie-Based Structure for K-Nearest-Neighbour Searching

Verifying Concurrent Programs Using Contracts

Tessellating memory space for parallel access

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options