Reduce computation time massively in large het_map objects #1024

Bartdoekemeijer · 2024-11-13T08:51:46Z

Reduce computation time for large het_map objects

Currently, a new LinearNDInterpolant is prepared for each findex in a FLORIS timeseries evaluation with heterogeneous_map. The preparation of a LinearNDInterpolant required for the heterogeneous map is computationally intensive (especially when the het_map is defined for many coordinates) due to the Delaunay triangulation. However, this triangulation is identical between each findex, and therefore it makes sense to recycle this information rather than to recalculate it for each findex.

Related issue

I haven't made a separate issue for this. I figured I'd open a PR directly.

Impacted areas of the software

The flow_field.py module.

Additional supporting information

In my usage, it was taking about 45 seconds to load the heterogeneous map interpolants. This is really wasted time and was brought down to 0.4 seconds with this PR by recycling the object as conserving as much information as possible between the findices.

Test results, if applicable

Here's a test script to benchmark this functionality:

import numpy as np
import pandas as pd
from time import perf_counter as timerpc

from floris import (
    FlorisModel,
    TimeSeries,
    HeterogeneousMap
)


if __name__ == "__main__":
    # Create big grid of wind conditions and wind speeds for which we assume to have evaluated het_map
    wd_grid, ws_grid = np.meshgrid(
        np.arange(0.0, 360.0, 3.0),
        np.arange(0.5, 30.51, 1.0)
    )
    df = pd.DataFrame({"wd": wd_grid.flatten(), "ws": ws_grid.flatten()})
    print(f"We have {df.shape[0]} findices.")

    # Create a grid of sensors throughout the farm in x, y, and z
    xg, yg, zg = np.meshgrid(
        np.linspace(-3000.0, 3000.0, 11),
        np.linspace(-3000.0, 3000.0, 11),
        np.arange(0.0, 350.01, 25.0),
    )
    xg = xg.flatten()
    yg = yg.flatten()
    zg = zg.flatten()
    speedups = np.ones((df.shape[0], len(xg)))
    print(f"We have {len(xg)} number of coordinates with het_map information.")

    # Now create FLORIS and a timeseries object with het_map information
    fmodel = FlorisModel("inputs/gch.yaml")
    fmodel.set(wind_shear=0.0)  # Required when working with 3D het_map objects
    het_map = HeterogeneousMap(
        x=xg,
        y=yg,
        z=zg,
        speed_multipliers=speedups,
        wind_directions=wd_grid.flatten(),
        wind_speeds=ws_grid.flatten(),
    )

    print(f"Preparing a timeseries object for 360 findex conditions.")
    ts = TimeSeries(
        wind_directions=np.arange(0.0, 360.0, 1.0),
        wind_speeds=120.0 * np.ones(360),
        turbulence_intensities=0.06 * np.ones(360),
        heterogeneous_map=het_map,
    )
    t0 = timerpc()
    fmodel.set(wind_data=ts)
    print(f"Time spent in 'fmodel.set': {timerpc() - t0:.2f} s")

With the new PR, this takes 0.4 seconds on my system. With the old code, it takes 25 seconds. If you increase the number of findices, the old code scales the computation time linearly. In the new code, there is pretty no penalty for additional findices.

…g the recalculation of the delaunay triangulation for every findex

paulf81 · 2024-11-15T22:34:01Z

hi @Bartdoekemeijer , thank you for this! I made some small formatting changes and will take a deeper dive next week

paulf81 · 2024-12-12T22:42:09Z

floris/core/flow_field.py

-                self.interpolate_multiplier_xy(x, y, multiplier, fill_value=1.0)
-                for multiplier in speed_multipliers
-            ]
+            self.interpolate_multiplier_xy(x, y, multiplier, fill_value=1.0)


Wouldn't this need to be assigned to F as it is above @Bartdoekemeijer
?

The reason for this is that F.values is of shape (N, 1), rather than shape (N). If I don't maintain that shape, the code returns an error.

I think still if you don't assign to F in line 305 you can't reference it in line 308 (it doesn't exist). Similarly I think multiplier doesn't exist yet at line 305 so followed the example of the earlier case and used speed_multipliers[0]

paulf81 · 2025-02-04T20:16:08Z

@Bartdoekemeijer thanks again for this! I added a new test to check this was working as expected (the logic of the change is clear to me!). The test showed I think a need for a few small tweaks, if you could review my changes and let me know match your expectations

paulf81 · 2025-02-05T04:16:24Z

Also added timing tests of het to #1060 so hopefully we can clock the improvement there as well

paulf81 · 2025-02-05T04:17:40Z

Noting though there are some failing examples, probably this needs we still need another fix, and then also we shouldn't have all tests passing if examples are failing so we'll want a test that captures whatever failure mode is in the example

Bartdoekemeijer · 2025-02-05T09:51:27Z

Good catches! I haven't reran this, but the principle of this code is that you don't need to re-do the Delaunay triangulation for every interpolant. They are all identical, and you only need to swap out the interpolant values rather than the interpolant grid. Sorry if this wasn't clear. In principle this should be a tiny modification to the code.

paulf81 · 2025-02-07T21:49:35Z

ok @Bartdoekemeijer , I think I tracked down the issues. Since the failures were happening in examples and not in tests, I added some new tests that failed until I made the correction, as that feels like best practice. The issue came down to not working if lists were passed in for the multipliers (rather than arrays as is typical more recently). @misi9170 , would you mind to review now, I think it's about ready

misi9170 · 2025-02-11T14:45:25Z

@Bartdoekemeijer Thanks for this, and @paulf81 thank you for adding tests!

I've now approved. Since it wasn't initially clear to me what exactly was happening, I've now added a couple of comments to the code block to describe why we are replacing the values for each findex. I've also renamed variables to make it clearer what they are (in particular, the list of interpolants, previously named in_region, which sounded to me like a mask), as well as changing the loop to be explicitly over the findices.

I've also run all examples in examples/examples_heterogeneous/ and see no visual change to outputs.

Thanks, as always, for sharing the running script in the description @Bartdoekemeijer . That helps a lot.

Provided you are both happy with the changes I've made, I'll get this merged in.

paulf81 · 2025-02-11T15:23:44Z

Very happy, please merge, and then we can check https://nrel.github.io/floris/dev/bench/

floris/core/flow_field.py

Bartdoekemeijer added 3 commits November 13, 2024 09:26

Reduce computation time massively in large het_map objects by avoidin…

8122a09

…g the recalculation of the delaunay triangulation for every findex

Import copy library

588f1bd

Enforce appropriate shape for interpolant object

6054a64

Bartdoekemeijer mentioned this pull request Nov 13, 2024

Add Nearest-Neighbor interpolation to heterogeneous wind map #1025

Open

paulf81 requested review from paulf81 and misi9170 November 15, 2024 22:07

ruff formatting

f413c2a

Merge branch 'develop' into pr/Bartdoekemeijer/1024

4776c9a

paulf81 reviewed Dec 12, 2024

View reviewed changes

paulf81 added 3 commits February 4, 2025 13:14

bugfix

3762506

Add a test of applying a het map

b9c355d

Merge branch 'develop' into pr/Bartdoekemeijer/1024

8569786

paulf81 mentioned this pull request Feb 7, 2025

Add automatic benchmarking #1062

Merged

1 task

paulf81 added 6 commits February 7, 2025 14:14

Merge branch 'develop' into pr/Bartdoekemeijer/1024

906ec95

Add convert to array

83afb7b

Add convert to array

93b7eb7

Add to new tests

45a4dab

Merge branch 'develop' into pr/Bartdoekemeijer/1024

449f465

Clean up

11bffef

Add comments and rename variables for clarity.

7e38413

misi9170 approved these changes Feb 11, 2025

View reviewed changes

rafmudaf reviewed Feb 11, 2025

View reviewed changes

floris/core/flow_field.py Outdated Show resolved Hide resolved

rafmudaf reviewed Feb 11, 2025

View reviewed changes

floris/core/flow_field.py Outdated Show resolved Hide resolved

Change FlowField.het_map to be a numpy array rather than a list.

181f78a

misi9170 merged commit f3f42ed into NREL:develop Feb 11, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce computation time massively in large het_map objects #1024

Reduce computation time massively in large het_map objects #1024

Bartdoekemeijer commented Nov 13, 2024

paulf81 commented Nov 15, 2024

paulf81 Dec 12, 2024

Bartdoekemeijer Jan 6, 2025

paulf81 Feb 4, 2025

paulf81 commented Feb 4, 2025

paulf81 commented Feb 5, 2025

paulf81 commented Feb 5, 2025

Bartdoekemeijer commented Feb 5, 2025

paulf81 commented Feb 7, 2025

misi9170 commented Feb 11, 2025 •

edited

Loading

paulf81 commented Feb 11, 2025

Reduce computation time massively in large het_map objects #1024

Reduce computation time massively in large het_map objects #1024

Conversation

Bartdoekemeijer commented Nov 13, 2024

Reduce computation time for large het_map objects

Related issue

Impacted areas of the software

Additional supporting information

Test results, if applicable

paulf81 commented Nov 15, 2024

paulf81 Dec 12, 2024

Choose a reason for hiding this comment

Bartdoekemeijer Jan 6, 2025

Choose a reason for hiding this comment

paulf81 Feb 4, 2025

Choose a reason for hiding this comment

paulf81 commented Feb 4, 2025

paulf81 commented Feb 5, 2025

paulf81 commented Feb 5, 2025

Bartdoekemeijer commented Feb 5, 2025

paulf81 commented Feb 7, 2025

misi9170 commented Feb 11, 2025 • edited Loading

paulf81 commented Feb 11, 2025

misi9170 commented Feb 11, 2025 •

edited

Loading