You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
_cull_dependencies in HashJoinP2PLayer creates a defaultdict that is huge as the number of partitions grows. we have 10476 parts out and 10476 partitions, which will create relatively big sets. The memory footprint of deps is over 10GB
Describe the issue:
_cull_dependencies
inHashJoinP2PLayer
creates a defaultdict that is huge as the number of partitions grows. we have 10476 parts out and 10476 partitions, which will create relatively big sets. The memory footprint of deps is over 10GBMinimal Complete Verifiable Example:
sorry for the non-ideal reproducer
Anything else we need to know?:
Can we reduce the memory footprint here somehow?
cc @fjetter
Environment:
The text was updated successfully, but these errors were encountered: