Hardware Benchmarking Topic Single Precision VS Double Precision

From Tuflow
Jump to navigation Jump to search

Page Under Construction

Introduction

Both TUFLOW Classic and TUFLOW HPC can run using either a single precision or double precision. Please refer to the manual for a description on the differences between single precision (SP) and double precision. (DP) versions of TUFLOW and also a discussion on which might be appropriate for a model.
This page discuss the relative difference in performance of the SP and DP versions of TUFLOW. This includes comparisons for TUFLOW Classic, TUFLOW HPC on CPU hardware and TUFLOW HPC on GPU hardware.
When running a double precision version of TUFLOW this will require significantly more memory available to run a simulation. The memory requirement of DP is almost twice that of SP.
Note Single precision calculations are also referred to as FP32 (32 bit floating point) and double precision as FP64 (64 bit floating point) calculations. This seems to be a more common terminology in GPU benchmarks.

TUFLOW Classic

The table below has some runtimes for the benchmark model at 20m cell size. The same model has been run for both the SP and DP versions of TUFLOW using the Classic solution scheme on CPU hardware. This same test has been performed on a number of CPU chips.

CPU SP Runtime (mins) DP Runtime (mins) % Change
Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz ??? ??? 120
AMD Ryzen Threadripper 2990WX 32-Core Processor ??? ??? 120

As can be seen from the table above the change in run time varies from ??? to ??? and the average is approximately ???

TUFLOW HPC on CPU hardware

The table below has some runtimes for the benchmark model at 20m cell size. The same model has been run for both the SP and DP versions of TUFLOW using the HPC solution scheme on CPU hardware. This same test has been performed on a number of CPU chips.

CPU SP Runtime (mins) DP Runtime (mins) % Change
Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz ??? ??? 120
AMD Ryzen Threadripper 2990WX 32-Core Processor ??? ??? 120

As can be seen from the table above the change in run time varies from ??? to ??? and the average is approximately ???

TUFLOW HPC on GPU hardwre

For GPU devices, the quoted performance of GPU devices can be very different for single and double precision calculations. The table below has some runtimes for the benchmark model at 20m cell size. The same model has been run for both the SP and DP versions of TUFLOW using the HPC solution scheme on GPU hardware. This same test has been performed on a number of different GPU cards.

GPU Card SP Runtime (mins) DP Runtime (mins) % Change
GTX 1080ti ??? ??? 120
Tesla ??? ??? ??? 120
Quadro ??? ??? ??? 120

As can be seen from the table above the change in run time varies from ??? to ??? and the average is approximately ???

Discussion

To be completed.