|Name:||Implementation & Optimization of Three-Dimensional UPML-FDTD Algorithm on GPU Clusters|
|Time:||Wednesday, June 25, 2014
10:00 am - 10:15 am
CCL - Congress Center Leipzig
|Breaks:||07:30 am - 10:00 am Welcome Coffee|
|Speaker:||Lei Xu, Shanghai Supercomputer Center|
|Abstract:||Co-processors with powerful foating-point operation capability have been used to study the electromagnetic simulations using the Finite Difference Time Domain (FDTD) method. This work focuses on the implementation and optimization of 3D UPML-FDTD parallel algorithm on GPU clusters. A set of techniques are utilized to optimize the FDTD algorithm, such as the application of GPU texture memory, asynchronization of data transfer between CPU and GPU. The performance of the parallel FDTD algorithm is tested on K20m GPU clusters. The alability of the algorithm is tested for up to 80 NVIDIA Tesla K20m GPUs with the parallel efficiency up to 95%, and the optimization techniques explored in this study are found to improve the performance.
Lei Xu & Ying Xu, Shanghai Supercomputer Center