Benchmark IO #606

youldrouis · 2015-06-22T10:02:14Z

testing the pull request procedure with Alexandre (no code is included)

prudhomm · 2015-06-29T11:49:07Z

@youldrouis could you update the pull request so that we can see what you are doing and also test your changes ?

prudhomm · 2015-06-29T11:49:31Z

you just have to commit to your repository and push to github to do that

youldrouis · 2015-06-29T14:38:37Z

Yes, as fast as I produce the last tests and optimizations today

youldrouis · 2015-06-29T15:48:39Z

IO PERF OPTIMIZATION

ENSIGHTGOLD EXPORTER
The IO perf tests revealed serious problems on the ensightgold exporter execution times.
As an example, on 512 procs, 7e6 dof , a factor 15 to 30 between ensightgold export and single-file-per-proc ensight format

further analysis, through score-p instrumentation and profiling, allowed to identify the bottlenecks : The problem was the choice and use of the MPIIO writing procedure.

The MPIIO procedure used was the collective operation, with collective implicit pointer MPI_File_write_ordered(). In IO, it is recommended to use collective operations, but in our code, more than 60% of the calls, only the Master rank has something to write. This uselessly multiplied the accesses to the writing pointers.

I - A first step consisted in benchmarking the IOs, using different MPIIO options at file opening (collective buffering, data sieving, striping factor and striping unit). The results showed an improvement of 10 or 20% when collective buffering option is enabled. When reading is not necessary, the execution is also a little faster when using a write only mode. This was not enough to solve the problem.

II - A second step consisted in refactoring some contiguous writing calls where "only master rank had something to write". This confirmed the observations, making the code 10% faster. This kind of optimization is limited, and a lot of time was still wasted.

III - A third step consisted in reconsidering the choice of writing operation : individual operations clearly fit better the writing algorithm. Choice was then :
1 - individual operation, with individual explicit pointers MPI_File_write_at
2 - individual operation, with a shared implicit pointer MPI_File_write_shared
3 - a mix between individual and global pointers (not sure it works)

The solution 1 was implemented, with an explicit management of the offsets for each process. The resulting writing times are much better :

overall improvement of factor *10
as an example, writeGeoHeader() where only the master rank writes, is improved, on the use case (*), from 7 seconds (2.8 after step II) to 0.00 seconds
on the use case (*), the resulting exporter, compared to single-file-per-proc ensight format, has now an average factor of 2

The modifications were applied in :
feelfilters/exporterensightgold.hpp
feelfilters/exporterensightgold_impl.hpp
feelfilters/detail/fileindex.cpp

Some tests are still needed, especially on multi-timestep cases that I did not try.

youldrouis added 2 commits June 22, 2015 11:55

test pull request

d771fb3

test pullrequest

9ecd1b7

aancel changed the title ~~testing my first pull request~~ Benchmark IO Jun 22, 2015

youldrouis added 2 commits June 29, 2015 17:03

IO optimization (exporter ensightgold)

47b2f1f

resolved merge conflict after IP optimization

f666044

youldrouis added 5 commits July 1, 2015 10:50

IO optimization (exporter ensightgold)

39d775d

IO optimization (exporter ensightgold)

7e5da7a

IO optimization > bug fix

c3871d1

Merge branch 'develop' of https://github.com/feelpp/feelpp into develop

ea174c0

Merge branch 'develop' of https://github.com/feelpp/feelpp into develop

9cae584

prudhomm merged commit 9cae584 into feelpp:develop Jul 11, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark IO #606

Benchmark IO #606

youldrouis commented Jun 22, 2015

prudhomm commented Jun 29, 2015

prudhomm commented Jun 29, 2015

youldrouis commented Jun 29, 2015

youldrouis commented Jun 29, 2015

Benchmark IO #606

Benchmark IO #606

Conversation

youldrouis commented Jun 22, 2015

prudhomm commented Jun 29, 2015

prudhomm commented Jun 29, 2015

youldrouis commented Jun 29, 2015

youldrouis commented Jun 29, 2015