Add Observable edges #3608

jngrad · 2020-03-30T21:06:59Z

Fixes partially #3599

Description of changes:

convert CylindricalProfile to CylindricalProfileObservable to make the observables framework homogeneous
for all four *ProfileObservable classes, add a method edges() to calculate the coordinates of the bins, producing the same output as numpy.histogramdd()
fix broken LB sample

Make CylinderProfile a subclass of Observable, cleanup include statements, add a Base member in the script interface classes.

src/core/observables/CylindricalProfileObservable.hpp

fweik · 2020-03-30T21:32:06Z

src/core/observables/ProfileObservable.hpp

  std::vector<size_t> shape() const override {
    return {n_x_bins, n_y_bins, n_z_bins};
  }
+
+  /** Calculate the bin edges for each dimension */
+  std::vector<std::vector<double>> edges() {


This whole class seems to be identical CylindricalProfileObservable except for the variable names, maybe this repetition can be avoided?

I spent 10 days refactoring these observables classes (#3599), but the AutoParameters framework keeps blocking me no matter what I try. I gave up using class templates, otherwise I won't get anything done.

But this is the core code, no? Also I don't think even a template is needed, they can just be the sample class, the only difference is the name of the data members for the size and the limits?

class ProfileObservable : virtual public Observable { std::array<std::pair<double, double>, 3> m_limits; std::array<size_t, 3> m_bins; public: ProfileObservable = ... std::vector<size_t> shape() const override { return {m_bins.begin(), m_bins.end()}; } std::array<std::vector<double>, 3> edges() { std::array<std::vector<double>, 3> ret; for(i in {0, 1, 2}) { boost::copy(Utils::make_lin_space(m_limits[i].first, m_limits[i].second, bins[i]), std::back_inserter(profile_edges[i])); } return ret; } }; class CartesianProfileObservable : public ProfileObservable { public: size_t n_bins_x() const { return m_bins[0]; } ... } class CylindricalProfileObservable : public ProfileObservable { public: size_t n_bins_phi() const { return m_bins[0]; } ... }

I'm not even sure if the derived classes are really needed. If not
you can use the same interface. Otherwise you don't have to use
the fact that they have a common base in the script interface, so
it should not matter there. Disclaimer: Just for illustration, but I
think something like this shoudl work?

I had another solution in mind, with a template class that takes a coordinate system struct as template parameter, and where the n_bins_*... are aliases (e.g. size_t &n_bins_x = n_bins_0) to avoid writing getters. I would prefer to defer this change to another PR to keep this one simple, and because such a change would have to be carried out on the LBProfileObservable classes as well, which will then collide with #3607.

If you don't want to change this now, you cold also put the implementation of edges() into a free function an reuse it without messing with the class hierarchy.

If you add reference data members, the class is no longer default constructible, because references can not be unbound. Just use functions, your construction looks fishy anyway.

I've converted them to getters in jngrad/espresso:observable-edges-5 but I still get the same compiler errors, plus new ones that are so long they don't even fit on my fullscreen terminal window. Probably caused by a typo, but I don't have the patience to dissect these 18136-character long compiler warnings.

This is interesting. From what I can tell the problem is already with the core classes, e.g. Observables::DensityProfile can not be constructed. This seems to be related to virtual
inheritance, or the lack thereof to be more precise. The following very simple unit test
shows the issue:

#define BOOST_TEST_MODULE Observables::DensityProfile #define BOOST_TEST_DYN_LINK #include <boost/test/unit_test.hpp> #include "observables/DensityProfile.hpp" BOOST_AUTO_TEST_CASE(ctor) { Observables::DensityProfile densityProfile({}, {}, {}, {}, {}, {}, {}, {}, {}, {}); }

Leading to

[...] /ssd/fweik/espresso/src/core/observables/DensityProfile.hpp:33:31: error: no matching function for call to ‘Observables::ProfileObservableBase::ProfileObservableBase()’

which lets me suspect that there is a second base ProfileObservableBase, in addition to the one whose ctor is explicitly called from ProfileObservableBase, which is then tried to default
construct, which fails, because there is no default ctor.

I don't understand how this happens, nor why there is virtual inheritance in this hierarchy in the first place?!

At least for DensityProfile, the issue can be fixed by not deriving ProfileBase from Observable, and implementing shape in the derived classes (most of them by either just calling ProfileBase::shape, or adding something to the result).

the issue can be fixed by not deriving ProfileBase from Observable

This is also what I ended up doing in c3c9afc.

src/script_interface/observables/CylindricalLBProfileObservable.hpp

Co-authored-by: Florian Weik <[email protected]>

Optional parameter `density=True` is not allowed in np.histogramdd() in numpy 1.11.

codecov · 2020-03-30T22:43:35Z

Codecov Report

Merging #3608 into python will increase coverage by 0%.
The diff coverage is 85%.

@@          Coverage Diff           @@
##           python   #3608   +/-   ##
======================================
  Coverage      88%     88%           
======================================
  Files         521     521           
  Lines       22430   22463   +33     
======================================
+ Hits        19836   19867   +31     
- Misses       2594    2596    +2

Impacted Files	Coverage Δ
...ace/observables/CylindricalLBProfileObservable.hpp	`9% <0%> (ø)`
.../core/observables/CylindricalProfileObservable.hpp	`87% <84%> (ø)`
...ore/observables/CylindricalLBProfileObservable.hpp	`100% <100%> (ø)`
...re/observables/CylindricalPidProfileObservable.hpp	`100% <100%> (ø)`
src/core/observables/ProfileObservable.hpp	`100% <100%> (ø)`
...ce/observables/CylindricalPidProfileObservable.hpp	`21% <100%> (+9%)`	⬆️
...ript_interface/observables/LBProfileObservable.hpp	`17% <100%> (+2%)`	⬆️
...ipt_interface/observables/PidProfileObservable.hpp	`22% <100%> (+9%)`	⬆️
src/core/polymer.cpp	`92% <0%> (-6%)`	⬇️
src/core/electrostatics_magnetostatics/p3m.cpp	`85% <0%> (-1%)`	⬇️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 06e8d3f...3a6ccc9. Read the comment docs.

fweik · 2020-04-01T13:34:11Z

Maybe we should start by first doing the core changes, with corresponding tests, and then doing the interface. I understand that this is a frustrating experience for you, but going forward with the messed
up class hierarchy in the core would probably come back to us. Maybe we can also recruit @KaiSzuttor to help with this, as he is responsible for the original code, iirc.

KaiSzuttor · 2020-04-02T12:05:16Z

I'm not so sure if I'm able to invest too much time here. I can have a look though.

jngrad · 2020-04-02T12:10:28Z

@KaiSzuttor We can wait for this PR to be merged first. The WIP in question is in c3c9afc, where the Cartesian/Cylindrical/Spherical Profile classes don't depend on Observable anymore.

fweik · 2020-04-14T09:31:48Z

@jngrad I'm no clear what your plan is here? First merge this and then fix the core stuff separately?

jngrad · 2020-04-14T09:38:19Z

I've already done some refactoring here, but it's minimal. I would like to keep this PR self-contained and not delay it any further. I'll do the ObservableProfile refactor in a subsequent PR, most likely in a WIP since I'm not totally sure which level of abstraction to use in the class design.

fweik · 2020-04-14T09:40:30Z

Alright, then I think we can just move forward with this. Discussing the design for the rest in an successive PR is a good idea.

fweik · 2020-04-14T09:41:40Z

@RudolfWeeber I've already reviewed this.

RudolfWeeber · 2020-04-14T09:56:37Z

Please wait. I'd like to take alook.

jngrad · 2020-04-14T10:04:24Z

I vaguely remember we briefly discussed the output of the edges() function, e.g. whether to return the N+1 values between the bins or the N bin centers. Should I add a parameter to the method to decide which tick marks to return?

fweik · 2020-04-14T10:08:47Z

@jngrad the centers can be easily calculated from the edges, so I wouldn't bother

KaiSzuttor · 2020-04-14T10:09:33Z

I think we agreed that the default behavior should be compatible to numpy which it is now

RudolfWeeber · 2020-04-14T10:17:45Z

I didn't look in detail yet, but a few first impresssions:

From reading the code, it is not immediately apparent to me if I can do

value = obs.result[i_x,i_y,i_z]
lower_edge = obs.edges()[i_x,i_y_iz]

Independently of what Numpy does, this is, what should work, IMO. Does it?

Is ther doc on this, somewhere?

IMO, rather than edges(), it should be bin_edges() for clarity's sake. I'd also find the bin_centers() method useful, but it can also be done in a subsequent PR.

Do we do something about the LB profile observables. I assume, the behavior of the bin edge calculation should be independent of the sampling_offset (as it is now)).

jngrad · 2020-04-14T11:19:17Z

@RudolfWeeber are you sure obs.edges()[i_x,i_y,i_z] is what we should be able to use? This requires returning a 4D matrix with the dimensionality of the actual histogram for the first 3 dimensions and a 3-tuple for the fourth dimension. I think this is quite wasteful. At the moment, edges() returns a 2D array. To get the lower corner of a bin, you would do the following:

obs_edges = np.array(density_profile.edges())
obs_edges[(0,1,2),(i_x, i_y, i_z)]

To plot it, you would either discard the last value to get the same dimensions of the histogram, or calculate the bin centers with a one-liner. To get the bin edges as a 4D matrix, there is probably a numpy function that can do the work for us.

If this is ok with you, I'll rename the method as bin_edges(), create bin_centers(), return them as numpy arrays directly, and add documentation.

jngrad · 2020-04-14T11:38:11Z

ok I just realized that we cannot do the indexing in obs_edges[(0,1,2),(i_x, i_y, i_z)] for a numpy array where the row sizes are different... Which is the most common case: rectangular boxes and cylindrical histograms almost always have different sizes per dimension. The only valid syntax to get the lower corner is then:

obs_edges = density_profile.edges()
[obs_edges[i][j] for i,j in zip((0,1,2),(i_x, i_y, i_z))]

KaiSzuttor · 2020-04-14T11:42:07Z

Maybe we should write down a use-case sample...

jngrad · 2020-04-14T11:59:05Z

My understanding was that bin edges were a needed feature to facilitate plotting of histogram slices. @RudolfWeeber do you have another application in mind?

RudolfWeeber · 2020-04-14T12:58:20Z

I’d say `Bin_edges[i_x,i_y,i_z]` is the clearst by a wide margin. The storage is only used, if the method is called. The size is comparable to that of the observable data itself. For me, usability and clarity would come firstg, here.

fweik · 2020-04-14T13:05:13Z

@jngrad you don't have to return the matrix, just a python object that behaves like it.

jngrad · 2020-04-14T13:32:02Z

@RudolfWeeber do you have a MWE that could help me understand how you intend to use this 4D object? I thought you would pass the bin centers to matplotlib.pyplot.axis to label your axes, but what you have in mind is rather different.

RudolfWeeber · 2020-04-14T13:51:18Z

<https://github.com/RudolfWeeber> @RudolfWeeber do you have a MWE that could help me understand how you intend to use this 4D object? I thought you would pass the bin centers to >matplotlib.pyplot.axis to label your axes, but what you have in mind is rather different.

I wasn’t thinking about a particular matplotlib command, but just about how I would expect the information to be presented on its own. But when it comes to plotting a 1d cut: ``` v_x_of_z = lb_vel_profile.calculate()[0,0,:,0] z = lb_vel_profile.bin_centers()[0,0,:,2] plt.plot(z,v_x_of_z) ``` But maybe the syntax for imshow() and other map-like plots would benefit more from the other variant. Whatever we choose in the end, we probably should provide sample fragments for those two cases in the documentation.

Reproduce the core ProfileObservable class in the Python interface. Calculate the bin edges and centers in Python.

Discuss ongoing efforts to convert functionality from the Analysis module to the Observable framework. Rewrite section explaining how to instantiate and use observables. Document the new bin_edges() and bin_centers() methods and provide a matplotlib script to illustrate how they differ.

jngrad · 2020-04-15T17:10:44Z

@RudolfWeeber Profile observables now provide methods bin_centers() and bin_edges() for convenience. Turns out, most matplotlib functions require bin centers, but some require bin edges, in particular pcolormesh() will silently trim the last coordinate of the dataset if you give it bin centers! The Observables documentation now has a matplotlib script to showcase these 2 methods.

jngrad added 4 commits March 30, 2020 21:50

Refactor Profile observables

72daefa

Make CylinderProfile a subclass of Observable, cleanup include statements, add a Base member in the script interface classes.

Calculate histogram edges in the core

1b0653c

Simplify code and add wrapper for histogramdd

1109b1d

Fix broken sample

9ec9025

jngrad added Core ScriptInterface labels Mar 30, 2020

jngrad added this to the Espresso 4.2 milestone Mar 30, 2020

fweik reviewed Mar 30, 2020

View reviewed changes

src/core/observables/CylindricalProfileObservable.hpp Outdated Show resolved Hide resolved

fweik reviewed Mar 30, 2020

View reviewed changes

src/script_interface/observables/CylindricalLBProfileObservable.hpp Outdated Show resolved Hide resolved

jngrad and others added 2 commits March 31, 2020 00:16

Apply code review feedback

d96da5b

Co-authored-by: Florian Weik <[email protected]>

Fix broken python tests

a304ae2

Optional parameter `density=True` is not allowed in np.histogramdd() in numpy 1.11.

jngrad requested a review from RudolfWeeber March 30, 2020 22:45

fweik self-assigned this Mar 31, 2020

Merge branch 'python' into observable-edges-3

8a3ad6a

fweik previously approved these changes Apr 14, 2020

View reviewed changes

KaiSzuttor added the automerge Merge with kodiak label Apr 14, 2020

KaiSzuttor removed the request for review from RudolfWeeber April 14, 2020 09:44

RudolfWeeber removed the automerge Merge with kodiak label Apr 14, 2020

fweik closed this Apr 14, 2020

fweik reopened this Apr 14, 2020

jngrad added 2 commits April 15, 2020 18:54

ProfileObservable: add bin_edges(), bin_centers()

8ccfe34

Reproduce the core ProfileObservable class in the Python interface. Calculate the bin edges and centers in Python.

jngrad dismissed fweik’s stale review via 2463118 April 15, 2020 17:10

RudolfWeeber approved these changes Apr 17, 2020

View reviewed changes

fweik added the automerge Merge with kodiak label Apr 17, 2020

Merge branch 'python' into observable-edges-3

3a6ccc9

kodiakhq bot merged commit 18ffb6f into espressomd:python Apr 17, 2020

jngrad deleted the observable-edges-3 branch January 18, 2022 12:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Observable edges #3608

Add Observable edges #3608

jngrad commented Mar 30, 2020

fweik Mar 30, 2020

jngrad Mar 30, 2020

fweik Mar 30, 2020

jngrad Mar 30, 2020

fweik Mar 31, 2020

fweik Mar 31, 2020 •

edited

Loading

jngrad Mar 31, 2020 •

edited

Loading

fweik Mar 31, 2020

fweik Mar 31, 2020

jngrad Apr 1, 2020

codecov bot commented Mar 30, 2020 •

edited

Loading

fweik commented Apr 1, 2020

KaiSzuttor commented Apr 2, 2020

jngrad commented Apr 2, 2020

fweik commented Apr 14, 2020

jngrad commented Apr 14, 2020

fweik commented Apr 14, 2020

fweik commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020

jngrad commented Apr 14, 2020

fweik commented Apr 14, 2020

KaiSzuttor commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020

jngrad commented Apr 14, 2020

jngrad commented Apr 14, 2020

KaiSzuttor commented Apr 14, 2020

jngrad commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020 via email

fweik commented Apr 14, 2020

jngrad commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020 via email

jngrad commented Apr 15, 2020

Add Observable edges #3608

Add Observable edges #3608

Conversation

jngrad commented Mar 30, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fweik Mar 31, 2020 • edited Loading

Choose a reason for hiding this comment

jngrad Mar 31, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Mar 30, 2020 • edited Loading

Codecov Report

fweik commented Apr 1, 2020

KaiSzuttor commented Apr 2, 2020

jngrad commented Apr 2, 2020

fweik commented Apr 14, 2020

jngrad commented Apr 14, 2020

fweik commented Apr 14, 2020

fweik commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020

jngrad commented Apr 14, 2020

fweik commented Apr 14, 2020

KaiSzuttor commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020

jngrad commented Apr 14, 2020

jngrad commented Apr 14, 2020

KaiSzuttor commented Apr 14, 2020

jngrad commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020 via email

fweik commented Apr 14, 2020

jngrad commented Apr 14, 2020

RudolfWeeber commented Apr 14, 2020 via email

jngrad commented Apr 15, 2020

fweik Mar 31, 2020 •

edited

Loading

jngrad Mar 31, 2020 •

edited

Loading

codecov bot commented Mar 30, 2020 •

edited

Loading