2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

The following is a very common question in numerous theoretical and application-related domains: given a graph G, does it satisfy some given property? For example, is G connected? Is its diameter smaller than a given threshold? Is its average degree larger than a certain threshold? Traditionally, algorithms to quickly answer such questions were developed for static and centralized graphs (i.e. G is...

chapter

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

Patrick Flick, Srinivas Aluru

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 12 - 21

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

A Suffix tree is a fundamental and versatile string data structure that is frequently used in important application areas such as text processing, information retrieval, and computational biology. Sequentially, the construction of suffix trees takes linear time, and optimal parallel algorithms exist only for the PRAM model. Recent works mostly target low core-count shared-memory implementations but...

chapter

The Reverse Cuthill-McKee Algorithm in Distributed-Memory

Ariful Azad, Mathias Jacquelin, Aydin Buluc, Esmond G. Ng

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 22 - 31

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Ordering vertices of a graph is key to minimize fill-in and data structure size in sparse direct solvers, maximize locality in iterative solvers, and improve performance in graph algorithms. Except for naturally parallelizable ordering methods such as nested dissection, many important ordering methods have not been efficiently mapped to distributed-memory architectures. In this paper, we present the...

chapter

SlimSell: A Vectorizable Graph Representation for Breadth-First Search

Maciej Besta, Florian Marending, Edgar Solomonik, Torsten Hoefler

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 32 - 41

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Vectorization and GPUs will profoundly change graph processing. Traditional graph algorithms tuned for 32- or 64-bit based memory accesses will be inefficient on architectures with 512-bit wide (or larger) instruction units that are already present in the Intel Knights Landing (KNL) manycore CPU. Anticipating this shift, we propose SlimSell: a vectorizable graph representation to accelerate Breadth-First...

chapter

SWhybrid: A Hybrid-Parallel Framework for Large-Scale Protein Sequence Database Search

Haidong Lan, Weiguo Liu, Yongchao Liu, Bertil Schmidt

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 42 - 51

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Computer architectures continue to develop rapidly towards massively parallel and heterogeneous systems. Thus, easily extensible yet highly efficient parallelization approaches for a variety of platforms are urgently needed. In this paper, we present SWhybrid, a hybrid computing framework for large-scale biological sequence database search on heterogeneous computing environments with multi-core or...

chapter

PUNAS: A Parallel Ungapped-Alignment-Featured Seed Verification Algorithm for Next-Generation Sequencing Read Alignment

Yuandong Chan, Kai Xu, Haidong Lan, Weiguo Liu, more

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 52 - 61

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

The progress of next-generation sequencing has a major impact on medical and genomic research. This technology can now produce billions of short DNA fragments (reads) in a single run. One of the most demanding computational problems used by almost every sequencing pipeline is short-read alignment; i.e. determining where each fragment originated from in the original genome. Most current solutions are...

chapter

Eliminating Irregularities of Protein Sequence Search on Multicore Architectures

Jing Zhang, Sanchit Misra, Hao Wang, Wu-chun Feng

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 62 - 71

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Finding regions of local similarity between biological sequences is a fundamental task in computational biology. BLAST is the most widely-used tool for this purpose, but it suffers from irregularities due to its heuristic nature. To achieve fast search, recent approaches construct the index from the database instead of the input query. However, database indexing introduces more challenges in the design...

chapter

Communication Optimization on GPU: A Case Study of Sequence Alignment Algorithms

Jie Wang, Xinfeng Xie, Jason Cong

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 72 - 81

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Data movement is increasingly becoming the bottleneck of both performance and energy efficiency in modern computation. Until recently, it was the case that there is limited freedom for communication optimization on GPUs, as conventional GPUs only provide two types of methods for inter-thread communication: using shared memory or global memory. However, a new warp shuffle instruction has been introduced...

chapter

Elastic-Cache: GPU Cache Architecture for Efficient Fine- and Coarse-Grained Cache-Line Management

Bingchao Li, Jizhou Sun, Murali Annavaram, Nam Sung Kim

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 82 - 91

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

GPUs provide high-bandwidth/low-latency on-chip shared memory and L1 cache to efficiently service a large number of concurrent memory requests (to contiguous memory space). To support warp-wide accesses to L1 cache, GPU L1 cache lines are very wide. However, such L1 cache architecture cannot always be efficiently utilized when applications generate many memory requests with irregular access patterns...

chapter

Content-Aware Non-Volatile Cache Replacement

Qi Zeng, Jih-Kwon Peir

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 92 - 101

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Spin-Transfer Torque Magnetoresistive Random-Access Memory (STT-MRAM) is a promising memory technology, which has high density, fast read speed, low leakage power, and non-volatility, and is suitable for multi-core on-chip last-level caches. However, the high write energy and latency, as well as less-than-desirable write endurance of STT-MRAM remain challenges. This paper proposes a new encoded content-aware...

INFONA - science communication portal

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Cover Art

Title Page i

Title Page iii

Copyright Page

Table of Contents

Technical Program

Message from the Program Chair

Message from the General Chair

Conference Organization

Computational Challenges in Constructing the Tree of Life

Monitoring Properties of Large, Distributed, Dynamic Graphs

Parallel Construction of Suffix Trees and the All-Nearest-Smaller-Values Problem

The Reverse Cuthill-McKee Algorithm in Distributed-Memory

SlimSell: A Vectorizable Graph Representation for Breadth-First Search

SWhybrid: A Hybrid-Parallel Framework for Large-Scale Protein Sequence Database Search

PUNAS: A Parallel Ungapped-Alignment-Featured Seed Verification Algorithm for Next-Generation Sequencing Read Alignment

Eliminating Irregularities of Protein Sequence Search on Multicore Architectures

Communication Optimization on GPU: A Case Study of Sequence Alignment Algorithms

Elastic-Cache: GPU Cache Architecture for Efficient Fine- and Coarse-Grained Cache-Line Management

Content-Aware Non-Volatile Cache Replacement

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)