ultigranularity partition and scheduling for stream
programs based on multiCPU and multiGPU
heterogeneous architectures

Computer Engineering & Science

Previous Articles Next Articles

ultigranularity partition and scheduling for stream

programs based on multiCPU and multiGPU

heterogeneous architectures

CHEN Wenbin,YANG Ruirui,YU Junqing

(School of Computer Science and Technology,Huazhong University of Science and Technology,Wuhan 430074,China)

Received:2016-09-05 Revised:2016-11-01 Online:2017-01-25 Published:2017-01-25

Abstract

Abstract:

Dataflow programming language simplifies the domain programming and offers an attractive way to express the parallelism of mission computing and data communication on task level and data level. For the problems such as too much data parallelism, task parallelism and pipeline parallelism in multiCPU and multiGPU architectures, we propose an efficient data flow compilation framework. The framework takes the synchronous data flow graph as the beginning input, and uses many partition methods to distribute the tasks to multiCPU and multiGPU. According to the parallelism of tasks and communication, the tasks classification method assigns the tasks to GPU or CPU. We propose a GPU task horizontal splitting method to divide the tasks distributed to GPU into many blocks, and one GPU executes one block. The GPU task horizontal splitting method avoids the communication between GPU and GPU. The CPU dispersed task balancing partition method chooses appropriate CPU cores and balances the tasks distributed to these CPU cores. The method satisfies load balancing and raises the utilization rate of CPU cores. We choose a multiCPU and multiGPU heterogeneous architecture as the experiment platform and the common algorithms in media processing applications as benchmarks. Our experiments verify the effectiveness of the proposed methods.

Key words: heterogeneous architecture, dataflow program, task partition, storage optimization

CHEN Wenbin,YANG Ruirui,YU Junqing.

ultigranularity partition and scheduling for stream

programs based on multiCPU and multiGPU

heterogeneous architectures

[J]. Computer Engineering & Science.

[1]	LI Ren-gang, REN Zhi-xin, HUANG Guang-kui, SUN Jie, WANG Feng, ZHANG Chuang, . Design and implementation of heterogeneous architecture for database query acceleration#br# #br# [J]. Computer Engineering & Science, 2020, 42(12): 2169-2178.
[2]	BAI Yan1,2，REN Qingchang2. Research on storage optimization strategy for central air conditioning monitoring system in intelligent buildings [J]. J4, 2014, 36(03): 558-565.

ultigranularity partition and scheduling for stream

programs based on multiCPU and multiGPU

heterogeneous architectures

PDF

Knowledge

Abstract

Cite this article

share this article

Related Articles 2

Recommended Articles

Metrics

Comments