Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[request]: Enable the running of containers locally through Conveyor CLI #70

Open
din14970 opened this issue Oct 13, 2023 · 0 comments
Open
Labels
dev expierence Capabilities used by data product developer proposed Community submitted issue

Comments

@din14970
Copy link

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

Tell us about your request
Tangentially related to #69, for testing and debugging it would be nice if containers could be run locally by Conveyor with the right credentials associated with a role.

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
Testing and debugging of a pipeline is quite slow and frustrating. conveyor run is faster than conveyor build && conveyor deploy as one avoids UI interaction, but it still involves:

  • building the container image
  • pushing the container image
  • waiting for Airflow to schedule the task

Additionally, conveyor run requires that whatever you run must be associated with a task inside a DAG; it can not be any arbitrary command that could be run from the container.

For running short/simple tasks in our test environment & writing data in the test bucket, it would be perfectly OK to run and iterate on them locally. This only requires building and running the image and cuts out the push, DAG, and Airflow, which is much faster. However, it requires that the container gets the right credentials to access data.

Are you currently working around this issue?
We can request local credentials as detailed in #69 and inject them into the container with docker run -e. A conveyor run --local could make this much cleaner.

Additional context
Of course running locally will not work for all tasks (e.g. tasks which require access to on-prem systems behind a firewall), but it would be helpful for a large subset of tasks. In addition, running locally could be handy if it could be done from within conveyor IDEs.

Attachments

@din14970 din14970 added the proposed Community submitted issue label Oct 13, 2023
@pascal-knapen pascal-knapen added the dev expierence Capabilities used by data product developer label Oct 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dev expierence Capabilities used by data product developer proposed Community submitted issue
Projects
None yet
Development

No branches or pull requests

2 participants