Search results

chapter

Automatic Scan Parallelization in OpenMP

Maicol Zegarra, Marcio Pereira, Xavier Martorell, Guido Araujo

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW) > 85 - 90

2017 International Symposium on Computer Architecture and High Performance Computing Workshops (SBAC-PADW)

Prefix Scan (or simply scan) is an operator that computes all the partial sums of a vector. A scan operation results in a vector where each element is the sum of the preceding elements in the original vector up to the corresponding position. Scan is a key operation in many relevant problems like sorting, lexical analysis, string comparison, image filtering among others. Although there are libraries...

chapter

Correlation clustering: A parallel approach?

Laszlo Aszalcos, Maria Bako

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 403 - 406

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

Correlation clustering is a NP-hard problem, and for large graphs finding even just a good approximation of the optimal solution is a hard task. In previous articles we have suggested a contraction method and its divide and conquer variant. In this article we present several improvements of this method (preprocessing, quasi-parallelism, etc.) and prepare it for parallelism. Based on speed tests we...

chapter

Concept of parallel graph processing system for large-scale network science

Mikhail Chernoskutov

2017 International Multi-Conference on Engineering, Computer and Information Sciences (SIBIRCON) > 206 - 208

2017 International Multi-Conference on Engineering, Computer and Information Sciences (SIBIRCON)

The paper describes the concept of parallel graph processing system, which allows to develop architecture independent applications for processing large graphs. Also, the system allows the user to abstract from the implementation details of the graph storage format. This system can be used in such fast growing research fields as network science.

chapter

Efficient lowest density MDS array codes of column distance 4

Zhijie Huang, Hong Jiang, Nong Xiao

2017 IEEE International Symposium on Information Theory (ISIT) > 834 - 838

2017 IEEE International Symposium on Information Theory (ISIT)

The extremely strict code length constraint is the main drawback of lowest density, maximum-distance separable (MDS) array codes of distance greater than 3. To break away from the status quo, we proposed in [5] a family of lowest density MDS array codes of (column) distance 4, called XI-Code. Compared with the previous alternatives, XI-Code has lower encoding and decoding complexities, and much looser...

chapter

A performance, power, and energy efficiency analysis of load balancing techniques for GPUs

Federico Busato, Nicola Bombieri

2017 12th IEEE International Symposium on Industrial Embedded Systems (SIES) > 1 - 8

2017 12th IEEE International Symposium on Industrial Embedded Systems (SIES)

Load balancing is a key aspect to face when implementing any parallel application for Graphic Processing Units (GPUs). It is particularly crucial if one considers that it strongly impacts on performance, power and energy efficiency of the whole application. Many different partitioning techniques have been proposed in the past to deal with either very regular workloads (static techniques) or with irregular...

chapter

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

Patrick Flick, Srinivas Aluru

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 12 - 21

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

A Suffix tree is a fundamental and versatile string data structure that is frequently used in important application areas such as text processing, information retrieval, and computational biology. Sequentially, the construction of suffix trees takes linear time, and optimal parallel algorithms exist only for the PRAM model. Recent works mostly target low core-count shared-memory implementations but...

chapter

A Compact-Trie-Based Structure for K-Nearest-Neighbour Searching

Peng Gong, Wendy Osborn

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA) > 578 - 585

2017 IEEE 31st International Conference on Advanced Information Networking and Applications (AINA)

This paper proposes a k-nearest neighbour search method inspired by grid space partitioning and the compact-trie structure. A compact trie structure, and a k-nearest neighbour search strategy are presented. Then, a k-nearest neighbour search performance comparison is carried out against two well-known methods, using one million two-dimensional spatial points and finding up to 1000 nearest neighbours...

chapter

Correlation Analysis among Java Nano-Patterns and Software Vulnerabilities

Kazi Zakia Sultana, Ajay Deo, Byron J. Williams

2017 IEEE 18th International Symposium on High Assurance Systems Engineering (HASE) > 69 - 76

2017 IEEE 18th International Symposium on High Assurance Systems Engineering (HASE)

Ensuring software security is essential for developing a reliable software. A software can suffer from security problems due to the weakness in code constructs during software development. Our goal is to relate software security with different code constructs so that developers can be aware very early of their coding weaknesses that might be related to a software vulnerability. In this study, we chose...

chapter

Dask & Numba: Simple libraries for optimizing scientific python code

James Crist

2016 IEEE International Conference on Big Data (Big Data) > 2342 - 2343

2016 IEEE International Conference on Big Data (Big Data)

Python is a high level language that is used by scientists for numeric computations. However, the performance of the language can be a hindrance when scaling to larger data sets, requiring some operations to be rewritten in a lower level language. To address this problem, we propose two libraries to allow numeric Python code to be optimized incrementally, requiring minimal changes. Here we describe...

chapter

DCA: A DRAM-cache-Aware DRAM Controller

Cheng-Chieh Huang, Vijay Nagarajan, Arpit Joshi

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 887 - 897

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

3D-stacking technology has enabled the option of embedding a large DRAM cache onto the processor. Since the DRAM cache can be orders of magnitude larger than a conventional SRAM cache, the size of its cache tags can also be large. Recent works have proposed storing these tags in the stacked DRAM array itself. However, this increases the complexity of a DRAM cache request, which now translates into...

chapter

A practical CRCs-ADSCL decoding scheme for systematic polar codes

Hongjun Feng Jing, Lei Erbao Li

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP) > 1 - 5

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP)

The systematic polar codes under successive cancellation list (SCL) decoding suffers from very high time and space complexity when list size becomes larger. Aimed at getting the tradeoff between error performance and algorithm complexity, a practical CRCs-ADSCL(Adaptive SCL) decoding scheme is proposed for systematic polar codes, in which CRC values will be held by the bit-pair arrays in the decoding...

chapter

Propagator-based algorithm for localization of coherently distributed sources

Renzheng Cao, Xiaofei Zhang, Feifei Gao

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP) > 1 - 5

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP)

In this work, the direction finding of coherently distributed (CD) sources using a uniform linear array (ULA) is investigated. Conventional DSPE algorithm for localization of CD sources requires an extensive two-dimensional search of nominal DOAs and angular spreads. The proposed algorithm exploits the propagator, which is a linear operator that can be easily estimated from the received data, to identify...

chapter

Predicting buffer overflow using semi-supervised learning

Qingkun Meng, Wen Shameng, Feng Chao, Tang Chaojing

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 1959 - 1963

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

As everyone knows vulnerability detection is a very difficult and time consuming work, so taking advantage of the unlabeled data sufficiently is needed and helpful. According the above reality, in this paper a method is proposed to predict buffer overflow based on semi-supervised learning. We first employ Antlr to extract AST from C/C++ source files, then according to the 22 buffer overflow attributes...

chapter

SuperGlue: Standardizing Glue Components for HPC Workflows

Jay Lofstead, Alexis Champsaur, Jai Dayal, Matthew Wolf, more

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 170 - 171

2016 IEEE International Conference on Cluster Computing (CLUSTER)

chapter

In-memory representations for mining big graphs

Shruti Goyal, P V Bindu, P Santhi Thilagam

2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN) > 163 - 168

2016 Second International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)

Graphs are ubiquitous and are the best data structure for representing linked data because of their flexibility, scalability, and power to deal with complexity. Storing big graphs in graph databases leads to difficult computation and increased time complexity. The best alternative is to use inmemory representations such as compact data structures. They compress the graph sufficiently such that it...

chapter

Contrastive analysis of bubble & merge sort proposing hybrid approach

Sehrish Munawar Cheema, Nadeem Sarwar, Fatima Yousaf

2016 Sixth International Conference on Innovative Computing Technology (INTECH) > 371 - 375

2016 Sixth International Conference on Innovative Computing Technology (INTECH)

A sorting algorithm is one that puts elements of a list in a certain order. It makes easy searching and locating the information. The most-used orders are numerical order and lexicographical order. An efficient sorting algorithm is that takes less time and space complexity. In this paper I make contrastive analysis of bubble sort and merge sort and tried to show why required some new approach to get...

chapter

A new approach to speed up combinatorial search strategies using stack and hash table

Bestoun S. Ahmed, Luca M. Gambardella, Kamal Z. Zamli

2016 SAI Computing Conference (SAI) > 1217 - 1222

2016 SAI Computing Conference (SAI)

Owing to the significance of combinatorial search strategies both for academia and industry, the introduction of new techniques is a fast growing research field these days. These strategies have really taken different forms ranging from simple to complex strategies in order to solve all forms of combinatorial problems. Nonetheless, despite the kind of problem these approaches solve, they are prone...

chapter

Connecting points to a set of line segments in Infrastructure Design Problems

Niels Neumann, Frank Phillipson

2016 21st European Conference on Networks and Optical Communications (NOC) > 147 - 151

2016 21st European Conference on Networks and Optical Communications (NOC)

Connecting points to the nearest point belonging to a set of lines is an interesting problem that arises in many practical problems, especially Infrastructure Design Problems. In this paper an algorithm is presented for this problem. This algorithm is based on enumeration. To test the performance we define and explain several heuristic approaches. The algorithms is then, in comparison to the other...

chapter

Design and implementation of 16 bit systolic multiplier using modular shifting algorithm

S. Jayarajkumar, K. Sivanandam

2016 Second International Conference on Science Technology Engineering and Management (ICONSTEM) > 532 - 537

2016 Second International Conference on Science Technology Engineering And Management (ICONSTEM)

The finite field multipliers consuming high-throughput rate and low-latency having grown excessive attention in recent cryptographic systems, and coding theory but such multipliers above Galois field GF(2^m) for National institute standard technology (NIST) pentanomials are not so plentiful. We introduce two pairs of low latency and high throughput bit-parallel and digit-serial systolic multipliers...

chapter

The 3-POCs Structure Based GPU Acceleration in Computational Spectral Imaging

Xuewen Geng, Yufei Guo, Lin Liu, Yi Niu, more

2016 15th International Symposium on Parallel and Distributed Computing (ISPDC) > 88 - 91

2016 15th International Symposium on Parallel and Distributed Computing (ISPDC)

In this paper, the computational spectral imaging system is reexamined, and the most time-consuming module, namely the two-step iterative shrinkage/thresholding (TwIST) reconstruction, is accelerated by GPU. The acceleration can be roughly divided into two level: 1) data parallelization and 2)operation parallelization. Data parallelization: by discovering that the observation data array in computational...

INFONA - science communication portal

Search results

Automatic Scan Parallelization in OpenMP

Correlation clustering: A parallel approach?

Concept of parallel graph processing system for large-scale network science

Efficient lowest density MDS array codes of column distance 4

A performance, power, and energy efficiency analysis of load balancing techniques for GPUs

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

A Compact-Trie-Based Structure for K-Nearest-Neighbour Searching

Correlation Analysis among Java Nano-Patterns and Software Vulnerabilities

Dask & Numba: Simple libraries for optimizing scientific python code

DCA: A DRAM-cache-Aware DRAM Controller

A practical CRCs-ADSCL decoding scheme for systematic polar codes

Propagator-based algorithm for localization of coherently distributed sources

Predicting buffer overflow using semi-supervised learning

SuperGlue: Standardizing Glue Components for HPC Workflows

In-memory representations for mining big graphs

Contrastive analysis of bubble & merge sort proposing hybrid approach

A new approach to speed up combinatorial search strategies using stack and hash table

Connecting points to a set of line segments in Infrastructure Design Problems

Design and implementation of 16 bit systolic multiplier using modular shifting algorithm

The 3-POCs Structure Based GPU Acceleration in Computational Spectral Imaging

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options