Share Email Print
cover

Proceedings Paper

Using Intel Xeon Phi to accelerate the WRF TEMF planetary boundary layer scheme
Format Member Price Non-Member Price
PDF $14.40 $18.00

Paper Abstract

The Weather Research and Forecasting (WRF) model is designed for numerical weather prediction and atmospheric research. The WRF software infrastructure consists of several components such as dynamic solvers and physics schemes. Numerical models are used to resolve the large-scale flow. However, subgrid-scale parameterizations are for an estimation of small-scale properties (e.g., boundary layer turbulence and convection, clouds, radiation). Those have a significant influence on the resolved scale due to the complex nonlinear nature of the atmosphere. For the cloudy planetary boundary layer (PBL), it is fundamental to parameterize vertical turbulent fluxes and subgrid-scale condensation in a realistic manner. A parameterization based on the Total Energy – Mass Flux (TEMF) that unifies turbulence and moist convection components produces a better result that the other PBL schemes. For that reason, the TEMF scheme is chosen as the PBL scheme we optimized for Intel Many Integrated Core (MIC), which ushers in a new era of supercomputing speed, performance, and compatibility. It allows the developers to run code at trillions of calculations per second using the familiar programming model. In this paper, we present our optimization results for TEMF planetary boundary layer scheme. The optimizations that were performed were quite generic in nature. Those optimizations included vectorization of the code to utilize vector units inside each CPU. Furthermore, memory access was improved by scalarizing some of the intermediate arrays. The results show that the optimization improved MIC performance by 14.8x. Furthermore, the optimizations increased CPU performance by 2.6x compared to the original multi-threaded code on quad core Intel Xeon E5-2603 running at 1.8 GHz. Compared to the optimized code running on a single CPU socket the optimized MIC code is 6.2x faster.

Paper Details

Date Published: 28 May 2014
PDF: 10 pages
Proc. SPIE 9124, Satellite Data Compression, Communications, and Processing X, 91240T (28 May 2014); doi: 10.1117/12.2055040
Show Author Affiliations
Jarno Mielikainen, Univ. of Wisconsin-Madison (United States)
Bormin Huang, Univ. of Wisconsin-Madison (United States)
Allen Huang, Univ. of Wisconsin-Madison (United States)


Published in SPIE Proceedings Vol. 9124:
Satellite Data Compression, Communications, and Processing X
Bormin Huang; Chein-I Chang; José Fco. López, Editor(s)

© SPIE. Terms of Use
Back to Top