/LES

Large Eddy Simulator for Study of Urban Boundary-Layer Flows

Primary LanguageFORTRAN

Large Eddy Simulator for the Study of Urban Boundary-layer Flows

This is an OpenCL version of the Large Eddy Simulator developed by Hiromasa Nakayama and Haruyasu Nagai at the Japan Atomic Energy Agency and Prof. Tetsuya Takemi at the Disaster Prevention Research Institute of Kyoto University.

NOTE: This is not the original Fortran-77 source code used in the included papers, but a Fortran-95 version where the complete time iteration loop is implemented in OpenCL. The papers are included to explain the physics behind the simulator. NOTE: The included file F95Sources/GIS/Tokyo_20mgrid.txt does not contain the actual GIS data for Tokyo, but randomised values in the same range.

Caveats

  • At the moment the only way to configure the simulator is to edit the code.
  • Also, the OpenCL code is entirely fixed, in terms of numbers of threads and compile time options. You can only change it by hacking the OpenCL kernel code in OpenC/Kernels and the OpenCL interfacing module, module_LES_combined.f95 in F95Sources.

I will release the code generator for creating custom configurations soon, as part of a tool chain for facilitating conversion of Fortran code to OpenCL.

Prerequisites

To compile and run this code, you need to install the OclWrapper library from https://github.com/wimvanderbauwhede/OpenCLIntegration

You also need:

  • a Fortran-95 compiler (tested with gfortran 4.8 and 4.9, ifort 12.0.0 and pgf95 12.5-0)
  • a C++11-compliant C++ compiler for the OclWrapper (tested with g++ 4.8 and 4.9)
  • The scons build tool (I use v2.3) and therefore python v2.7

Compilation

Assuming you have installed the OclWrapper library correctly, compilation is simply:

$ cd F95Sources
$ scons

You can configure the device and platform, e.g. to explicitly select the GPU with id 1:

$ scons plat=NVIDIA dev=GPU gpu=1

or to select the Xeon Phi with id 0:

$ scons plat=MIC dev=ACC acc=0

To compile the Fortran-95 code without OpenCL for comparison:

$ cd F95Sources
$ scons ocl=0

To compile the Fortran-95 code without the OclWrapper library,

$ cd F95Sources
$ scons -f SConstruct.F95_only

Running

The OpenCL code:

$ ./les_main_ocl

The Fortran-only code:

$ ./les_main   

Performance

I've tested this on a GeForce GTX 480 and a 12-core Intel Xeon, the OpenCL code is 7x faster on the GPU and 2x faster on the multicore CPU compared to the fastest compilation I could achieve, which was using the Intel Fortran compiler using auto-vectorisation and auto-parallelisation.

Copyright and License

This work is (c) Wim Vanderbauwhede, School of Computing Science, University of Glasgow The source code may be used under the terms of either the

or the