-
Notifications
You must be signed in to change notification settings - Fork 830
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add node-specific tolerances to intersection consolidation #1160
Add node-specific tolerances to intersection consolidation #1160
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Given we are in the realm of networkx.MultiDiGraph here, a "column" does not really have a meaning. The conversion to gdf is internal only. Since we are fetching the value from a node attribute, I would suggest using something like tolerance_attribute
instead.
Do you have any idea how would you define local tolerance for each node? One thing is to enable it in the function, the other how to actually use it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since you are touching the function anyway, I would advise using dissolve
and explode
over unary_union and iteration over parts.
Like this (including my comments from above)
def _merge_nodes_geometric(G, tolerance, tolerance_column):
gdf_nodes = convert.graph_to_gdfs(G, edges=False)
if tolerance_column and tolerance_column in gdf_nodes.columns:
buffer_distances = gdf_nodes[tolerance_column].fillna(tolerance)
merged = gdf_nodes['geometry'].buffer(distance=buffer_distances).dissolve()
else:
merged = gdf_nodes['geometry'].buffer(distance=tolerance).dissolve()
return merged.explode(ignore_index=True)
Co-Authored-By: Martin Fleischmann <[email protected]>
- Remove unused numpy import (after refactoring it wasn't necessary anymore). - Add tolerance_attribute docstring to private functions
Thanks for the swift review! I implemented the suggested changes.
I think we can leave that up to users. An example is given in the PR description, but you can basically do any kind of custom mask, loop, function etc.
Sounds like a good idea, but I would like to keep this PR as small as possible (and atomic), so let's do that in another PR. |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## v2 #1160 +/- ##
=======================================
Coverage 98.30% 98.31%
=======================================
Files 24 24
Lines 2365 2370 +5
=======================================
+ Hits 2325 2330 +5
Misses 40 40 ☔ View full report in Codecov by Sentry. |
I'll try to take a look later this week. In the meantime, please base this PR off the |
Right. If that’s the case, could you set |
Yes, soon the |
I think I resolved al the conflicts when targeting the v2 branch correctly. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @EwoutH. If a simple user-defined per-node tolerance like this is useful to the community, then it certainly makes sense to add it. I've requested a change. Additionally, the tests are currently failing. Consider running the pre-commit hooks locally for ease.
This commit updates the consolidate_intersections function to accept the tolerance parameter as either a float or a dictionary mapping node IDs to floats. It removes the previously suggested tolerance_attribute.
f0b3817
to
933a7e3
Compare
Reimplemented I tested it on one of my own notebooks, and it works beautifully! # Create a dict mapping node ids to tolerances: 10 if part of the city network, 50 if part of the surrounding area network
tolerance_dict = {node: 10 if data["network"] == city_name else 50 for node, data in merged_network.nodes(data=True)}
# Consolidate intersections
merged_network = ox.consolidate_intersections(merged_network, tolerance=tolerance_dict, rebuild_graph=True) CC @anastassiavybornova, @martinfleis, @jdmcbr and @jGaboardi, please test and review if you have the chance :) |
Update `_merge_nodes_geometric` to manage absent tolerance values by reverting to original geometries instead of creating POLYGON EMPTY.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @EwoutH. I left a few final comments.
In addition, please 1) update the changelog, and 2) add simple tests to hit the changed lines to maintain coverage. I will then merge.
@martinfleis I missed this comment earlier, but your efficiency suggestion is well taken. Quick question for you: only GeoDataFrame has |
Don't cover the actual behaviour, just check if passing a dictionary (with and without all nodes covered) leads to a runtime error or not.
Thanks for the review. All comments should be addressed. Note that the tests don't cover the actual behaviour, just check if passing a dictionary (with and without all nodes covered) leads to a runtime error or not. While not ideal, this is in line with the other tests currently. |
CI is stuck on a type hinting thing, it doesn't like the int values in the tolerance dict, while is doesn't find it a problem when inputted directly just a few lines above. Maybe we should create a custom "number" type that Preferably in a separate PR though, I'm leaving for a week tonight. |
Correct, I didn't realise that. |
You had to type hint the dict for typeguard to know what data type comes out of |
@martinfleis yes, that is along the lines of what I was saying earlier in the issue: #1150 (comment) It's relatively straightforward to implement a per-node tolerance, but the real challenge is in actually defining it per node. They are only useful in tandem, and the latter is infinitely variable depending on the theoretical and empirical needs of the individual analysis, so I have been reluctant to include this functionality in the past. I believe defining the local tolerance will most likely remain outside of OSMnx's scope accordingly. @EwoutH identified a group of folks who want to implement per-node tolerance, and since his solution hands off the definition of tolerance values to the user, I think this PR satisfies my scope and implementation concerns. The rest is up to the users now. |
Thanks for merging!
Agreed! I think we landed on a quite elegant solution that still offers a lot of flexibility to users. |
The first pre-release OSMnx v2 beta has been released. Testers needed! See #1123 for details. |
This PR enhances the
consolidate_intersections
function in the OSMnx simplification module to support variable node consolidation tolerances. Instead of a single tolerance parameter, this update introduces thetolerance
parameter that can accept either a float or a dictionary. This allows users to specify precise tolerance levels per node directly within their graph, enabling finer control over intersection consolidation.Motivation
Currently
consolidate_intersections
applies a uniform tolerance across a network, which may not suit complex urban models where different areas require different levels of detail. For example, a dense city center often needs a lower tolerance due to more intricate street layouts, while suburban areas might warrant higher tolerances.See #1150.
Key Changes
tolerance
parameter now accepts a dictionary mapping node IDs to specific tolerance values, in addition to the existing option of a single float._merge_nodes_geometric
has been updated to dynamically adjust buffer distances based on node-specific tolerances provided in the dictionary, with a default fallback tolerance if specific values are not provided.Usage Example
In this example, nodes with higher street connectivity are assigned a smaller tolerance value (5), suitable for areas with denser road networks. Other nodes default to a higher tolerance of 10.
Closes #1150.
This branch can be installed with the following command:
@anastassiavybornova, @martinfleis, @jdmcbr and @jGaboardi please test and review if you have the chance :)