[1] |
Hofstee H P. Power Efficient Processor Architecture and the Cell Processor[C]∥Proc of HPCA, 2005:258262.
|
[2] |
Thakkar S, Huff T. The Internet Streaming SIMD Extensions[M]. Intel Technology Journal Q2, 1999,32(12):18.
|
[3] |
Advanced Micro Devices, AMD CPU Roadmap[EB/OL].[20101016].http://developer.amd.com/assets/Develop_Brighton_Justin_Boggs1.pdf.
|
[4] |
Fan Z, Qiu F, Kaufman A, et al. Gpu Cluster for High Performance Computing[C]∥Proc of the 2004 ACM/IEEE Conference on Supercomputing,2004:47.
|
[5] |
Top500[EB/OL].[20100317].http://www.top500.org/lists/2010/11.
|
[6] |
Kunzman D, Zheng G, Bohm E, et al. Charm++, Offload API, and the Cell Processor[C]∥Proc of the Workshop on Programing Models for Vbiquitous Paralelism,2006.
|
[7] |
Lastovetsky A. Adaptive Parallel Computing on Heterogeneous Networks with MPC[J]. Parallel Computing, 2002,28(10):13691407.
|
[8] |
Gaissaryan S, Lastovetsky A. An ANSI C Superset for Vector and Superscalar Computers and Its Retargetable Compiler[J].The Journal of C Language Translation, 1994,5(3):183198.
|
[9] |
Hou Qiming, Zhou Kun, Guo Baining. BSGP: BulkSynchronous GPU Programming[C]∥Proc of SIGGRAPH’08,2008.
|
[10] |
AMD Brook[EB/OL].[20080901].http://ati.amd.com/technology/streamcomputing/AMDBrookplus.
|
[11] |
Kirk D. NVIDIA Cuda Software and GPU Parallel Computing Architecture[C]∥Proc of the 6th International Symposium on Memory Management,2007:103104.
|
[12] |
Open Computing Language[EB/OL].[20090716].http://www.khronos.org/.
|
[13] |
Buck I, Foley T, Horn D, et al. Brook for GPUS: Stream Computing on Graphics Hardware[J]. ACM Transactions on Graphics, 2004,23(3):777786.
|
[14] |
Buck I. Brook Spec v0.2[R]. TechReport, Stanford University, 2003.
|
[15] |
Hou Qiming, Zhou Kun, Guo Baining. SPAP:A Programming Language for Heterogeneous ManyCore Systems[R]. Technical Report, 2010.
|
[16] |
Ayguade E. A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures[C]∥Proc of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism,2009:154167.
|
[17] |
O’Brien K, O’Brien K, Sura Z, et al. 2008. Supporting OpenMP on Cell[J]. International Journal of Parallel Program,2008,36(3):289311.
|
[18] |
Bellens P, Perez J M, Badia R M,et al. CellSs: A Programming Model for the Cell BE Architecture[C]∥Proc of the 2006 ACM/IEEE Conference on Supercomputing,2006.
|
[19] |
Lee Seyong, Eigenmann R. OpenMPC: Extended OpenMP Programming and Tuning for GPUs[C]∥Proc of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis. IEEE Computer Society, 2010:111.
|
[20] |
Lee Seyong, Min SeungJai, Eigenmann R. OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization[C]∥Proc of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,2009:101110.
|
[21] |
Wang P H, Collins J D, Chinya G N, et al. EXOCHI: Architecture and Programming Environment for a Heterogeneous MultiCore Multithreaded System[C]∥Proc of the 2007 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2007:156166.
|
[22] |
Dolbeau R, Bihan S, Bodin F. HMPP: A Hybrid MultiCore Parallel Programming Environment[C]∥Proc of the Workshop on General Purpose Processing on Graphics Processing Units, 2007.
|
[23] |
The Portland Group. PGI Fortran and C Accelerator Compilers and Programming Model Technology Preview[R].2008.
|
[24] |
Limberg T, Ristau B, Fettweis G. A RealTime Programming Model for Heterogeneous MPSoCs[C]∥Proc of the 8th International Workshop on Embedded Computer Systems: Architectures, Modeling, and Simulation, 2008:7584.
|
[25] |
Saha B, Zhou Xiaocheng, Chen Hu, et al. Programming Model for a Heterogeneous x86 Platform[C]∥Proc of the 2009 ACM SIGPLAN Conference on Programming Language Design and Implementation, 2009:431440.
|
[26] |
Seiler L, Carmean D, Sprangle E, et al. Larrabee: A ManyCore x86 Architecture for Visual Computing[J]. ACM Transactions on Graphics, 2008,27(3):115.
|
[27] |
Linderman M D, Collins J D, Wang H, et al. Merge: A Programming Model for Heterogeneous MultiCore Systems[J]. SIGARCH Comput Archit News, 2008,42(2):287296.
|
[28] |
Dean J, Ghemawat S. MapReduce: Simplified Data Processing on Large Clusters[J]. Commun ACM, 2008,51(1):107113.
|
[29] |
Lastovetsky A, Reddy R. HeteroMPI: Towards a MessagePassing Library for Heterogeneous Networks of Computers[J]. J Parallel Distrib Comput, 2006,66(2):197220.
|
[30] |
Peisert S, Baden S. A Programming Model for Automated Decomposition on Heterogeneous Clusters of Multiprocessors[R]. Technical Report, UCSD CSE, 2001.
|
[31] |
Intel. Next Generation Processors, in Manufacturing, Chip design Expertise Driving Innovation and Integration, Historic Change to Computers[R]. 2009.
|