[Feature Request]: Enhance Error Handling in FileSystem Imports to Improve Troubleshooting #31218
Closed
1 of 16 tasks
Labels
Milestone
What would you like to happen?
I would like to make the import of Filesystem, which is defined in the top-level code of
apache_beam.io.filesystems
, easier to troubleshoot.https://github.com/apache/beam/blob/v2.56.0/sdks/python/apache_beam/io/filesystems.py#L36-L59
AS-IS:
PROPOSAL:
For context, I encountered a problem when launching a Beam job on CentOS 7 with apache-beam[gcp]==2.55.0 installed. The error occurs at the time of job initiation and is not an issue that occurs during job execution.
The error itself occurs on this line and is due to the failure to load
GCSFileSystem
at module initialization. This, in turn, is becauseGCSFileSystem
relies on therequests
package which, from version 2 onwards, requires OpenSSL 1.1.1 due to OS dependencies. CentOS 7 has OpenSSL 1.0.2 installed, so the behavior has changed with Beam version 2.55.0 and later. (This is not essential, so I have not investigated in detail.)I was able to resolve this quickly because I happened to know about these circumstances, but considering the future, it seems better to handle
ImportError
not just by suppressing it, but by logging a warning error.I can send a Pull Request. However, since it involves committing to a core area, I've raised an Issue first.
Issue Priority
Priority: 2 (default / most feature requests should be filed as P2)
Issue Components
The text was updated successfully, but these errors were encountered: