Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Troubleshooting Installation capabilities #14882

Closed
5 tasks done
slemeur opened this issue Oct 14, 2019 · 4 comments
Closed
5 tasks done

Troubleshooting Installation capabilities #14882

slemeur opened this issue Oct 14, 2019 · 4 comments
Labels
area/install Issues related to installation, including offline/air gap and initial setup kind/epic A long-lived, PM-driven feature request. Must include a checklist of items that must be completed.

Comments

@slemeur
Copy link
Contributor

slemeur commented Oct 14, 2019

Goal

The goal of this epic is to make the installation of Eclipse Che easier to diagnose.

As an admin of Eclipse Che, I'd like to be able to get diagnostic of my current installation so I can troubleshoot it.

Target User: Admin of Eclipse Che

Problem:
When installing Eclipse Che, there are a lot of opportunities for things to not behave properly.
Are resources sufficient?

  • Is access to the container registry working?
  • Are certificates properly configured

When the deployment is failing it is also challenging to get the logs of what was going on. Admin (who is doing the deployment) will need to investigate and use ‘oc’ or ‘kubectl’ and in certain cases the logs are even not accessible once the installation failed.

Solutions

Deployment Pre Checks

Chectl should allow to pre-check the target environment for the deployment. It would check the various conditions that are needed for Che to be properly deployed there:

  • Resources
  • Access to registry
  • Various configuration options

Installation logs retrieval

Chectl should allow the admin to get the logs of the installation. It would provide the logs in a convenient manner, they’ll be downloadable and shareable.

Sub Tasks

@slemeur slemeur added the kind/epic A long-lived, PM-driven feature request. Must include a checklist of items that must be completed. label Oct 14, 2019
@slemeur slemeur changed the title Troubleshooting Installation Diagnosis capabilities Troubleshooting Installation capabilities Oct 14, 2019
@che-bot che-bot added the status/need-triage An issue that needs to be prioritized by the curator responsible for the triage. See https://github. label Oct 14, 2019
@ibuziuk ibuziuk removed the status/need-triage An issue that needs to be prioritized by the curator responsible for the triage. See https://github. label Oct 14, 2019
@ibuziuk ibuziuk added this to the Backlog - Epics milestone Oct 14, 2019
@nickboldt
Copy link
Contributor

I would recommend adding a preflight check to chectl/crwctl that verifies the deployment name is not too long for the internet.

Scenario where a too-long project name gets half way into a deployment but can't open a workspace:
https://issues.jboss.org/browse/CRW-528 (using project = "codeready-workspaces" results in the server being exposed on route "codeready-codeready-worksapaces.etc.etc.etc" which results in PVC problems and route failures.

@tolusha
Copy link
Contributor

tolusha commented Dec 15, 2019

I personally see several areas where we can improve troubleshooting:

  • gathering logs, they should be available once installation failed. There is the doc [1] which will help to understand which type of logs we can have. Besides that it is necessary to collect the information about environment, installation tool and its version etc.

  • logging
    Messages should be understandable, contain proposals (if possible) and reference to a documentation (if needed)

  • documentation,
    There are several issues which indicates that the doc is outdated (absent) in some of aspects of installation process

  • pre-flight checks.
    This is a continuous improvement process. It will be helpful to browse existed chectl issues [2] to find out what we need to check

[1] https://www.eclipse.org/che/docs/che-7/retrieving-che-logs/
[2] https://github.com/eclipse/che/issues?utf8=%E2%9C%93&q=is%3Aopen+is%3Aissue+label%3Aarea%2Fchectl

@mmorhun @AndrienkoAleksandr

@AndrienkoAleksandr
Copy link
Contributor

I personally see several areas where we can improve troubleshooting:

+1

@tolusha
Copy link
Contributor

tolusha commented Feb 27, 2020

I consider this epic is mainly completed.

@tolusha tolusha closed this as completed Feb 27, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/install Issues related to installation, including offline/air gap and initial setup kind/epic A long-lived, PM-driven feature request. Must include a checklist of items that must be completed.
Projects
None yet
Development

No branches or pull requests

7 participants