This document discusses the requirements, current expected behavior, and how to try out what exists so far.
In addition, it covers the installation with the default CNI (OpenShiftSDN), as well as with the Kuryr SDN.
Kuryr is a CNI plug-in that uses Neutron and Octavia to provide networking for pods and services. It is primarily designed for OpenShift clusters that run on OpenStack virtual machines. Kuryr improves the network performance by plugging OpenShift pods into OpenStack SDN. In addition, it provides interconnectivity between OpenShift pods and OpenStack virtual instances.
Kuryr is recommended for OpenShift deployments on encapsulated OpenStack tenant networks in order to avoid double encapsulation, such as running an encapsulated OpenShift SDN over an OpenStack network.
Conversely, using Kuryr does not make sense in the following cases:
- You use provider networks or tenant VLANs.
- The deployment will use many services on a few hypervisors. Each OpenShift service creates an Octavia Amphora virtual machine in OpenStack that hosts a required load balancer.
- UDP services are needed.
In order to run the latest version of the installer in OpenStack, at a bare minimum you need the following quota to run a default cluster. While it is possible to run the cluster with fewer resources than this, it is not recommended. Certain cases, such as deploying without FIPs, or deploying with an external load balancer are documented below, and are not included in the scope of this recommendation. NOTE: The installer has been tested and developed on Red Hat OSP 13.
- Floating IPs: 2
- Security Groups: 3
- Security Group Rules: 60
- Routers: 1
- Subnets: 1
- RAM: 112 GB
- vCPUs: 28
- Volume Storage: 175 GB
- Instances: 7
When using Kuryr SDN, as the pods, services, namespaces, network policies, etc., are using resources from the OpenStack Quota, the minimum requirements are higher:
- Floating IPs: 3 (plus the expected number of services of LoadBalancer type)
- Security Groups: 100 (1 needed per network policy)
- Security Group Rules: 500
- Routers: 1
- Subnets: 100 (1 needed per namespace)
- Networks: 100 (1 needed per namespace)
- Ports: 1000
- RAM: 112 GB
- vCPUs: 28
- Volume Storage: 175 GB
- Instances: 7
The default deployment stands up 3 master nodes, which is the minimum amount required for a cluster. For each master node you stand up, you will need 1 instance, and 1 port available in your quota. They should be assigned a flavor with at least 16 GB RAM, 4 vCPUs, and 25 GB Disk. It is theoretically possible to run with a smaller flavor, but be aware that if it takes too long to stand up services, or certain essential services crash, the installer could time out, leading to a failed install.
The default deployment stands up 3 worker nodes. In our testing we determined that 2 was the minimum number of workers you could have to get a successful install, but we don't recommend running with that few. Worker nodes host the applications you run on OpenShift, so it is in your best interest to have more of them. See here for more information. The flavor assigned to the worker nodes should have at least 2 vCPUs, 8 GB RAM and 25 GB Disk. However, if you are experiencing Out Of Memory
issues, or your installs are timing out, you should increase the size of your flavor to match the masters: 4 vCPUs and 16 GB RAM.
The bootstrap node is a temporary node that is responsible for standing up the control plane on the masters. Only one bootstrap node will be stood up and it will be deprovisioned once the production control plane is ready. To do so, you need 1 instance, and 1 port. We recommend a flavor with a minimum of 16 GB RAM, 4 vCPUs, and 25 GB Disk.
Swift must be enabled. The user must have swiftoperator
permissions and temp-url
support must be enabled. As an OpenStack administrator:
openstack role add --user <user> --project <project> swiftoperator
openstack object store account set --property Temp-URL-Key=superkey
NOTE: Swift is required as the user-data provided by OpenStack is not big enough to store the ignition config files, so they are served by Swift instead.
- You may need to increase the security group related quotas from their default values. For example (as an OpenStack administrator):
openstack quota set --secgroups 8 --secgroup-rules 100 <project>`
You must have a clouds.yaml
file in order to run the installer. The installer will look for a clouds.yaml
file in the following locations in order:
- Value of
OS_CLIENT_CONFIG_FILE
environment variable - Current directory
- unix-specific user config directory (
~/.config/openstack/clouds.yaml
) - unix-specific site config directory (
/etc/openstack/clouds.yaml
)
In many OpenStack distributions, you can generate a clouds.yaml
file through Horizon. Otherwise, you can make a clouds.yaml
file yourself.
Information on this file can be found here and it looks like:
clouds:
shiftstack:
auth:
auth_url: http://10.10.14.42:5000/v3
project_name: shiftstack
username: shiftstack_user
password: XXX
user_domain_name: Default
project_domain_name: Default
dev-evn:
region_name: RegionOne
auth:
username: 'devuser'
password: XXX
project_name: 'devonly'
auth_url: 'https://10.10.14.22:5001/v2.0'
The file can contain information about several clouds. For instance, the example above describes two clouds: shiftstack
and dev-evn
.
In order to determine which cloud to use, the user can either specify it in the install-config.yaml
file under platform.openstack.cloud
or with OS_CLOUD
environment variable. If both are omitted, then the cloud name defaults to openstack
.
Get the latest RHCOS image here. The installer requires a proper RHCOS image in the OpenStack cluster or project:
openstack image create --container-format=bare --disk-format=qcow2 --file rhcos-${RHCOSVERSION}-openstack.qcow2 rhcos
NOTE: Depending on your OpenStack environment you can upload the RHCOS image as raw
or qcow2
. See Disk and container formats for images for more information. At the time of writing, the installer looks for an image named rhcos
.
The public network should be created by the OpenStack administrator. Verify the name/ID of the 'External' network:
openstack network list --long -c ID -c Name -c "Router Type"
+--------------------------------------+----------------+-------------+
| ID | Name | Router Type |
+--------------------------------------+----------------+-------------+
| 148a8023-62a7-4672-b018-003462f8d7dc | public_network | External |
+--------------------------------------+----------------+-------------+
NOTE: If the neutron
trunk
service plug-in is enabled, trunk port will be created by default. For more information, please refer to neutron trunk port.
As highlighted in the minimum quota recommendations, when using Kuryr SDN, there is a need for increasing the quotas as pods, services, namespaces, network policies are using OpenStack resources. So, as an administrator,the next quotas should be increased for the selected project:
openstack quota set --secgroups 100 --secgroup-rules 500 --ports 500 --subnets 100 --networks 100 <project>
Kuryr CNI makes use of the Neutron Trunks extension to plug containers into the OpenStack SDN, so the trunks
extension must be enabled for Kuryr to properly work.
In addition, if the default ML2/OVS Neutron driver is used, the firewall must be set to openvswitch
instead of ovs_hybrid
so that security groups are enforced on trunk subports and Kuryr can properly handle Network Policies.
Kuryr SDN uses OpenStack Octavia LBaaS to implement OpenShift services. Thus the OpenStack environment must have Octavia components installed and configured if Kuryr SDN is used.
NOTE: Depending on your OpenStack environment Octavia may not support UDP listeners, which means there is no support for UDP services if Kuryr SDN is used.
If you would like to set up an isolated development environment, you may use a bare metal host running CentOS 7. The following repository includes some instructions and scripts to help with creating a single-node OpenStack development environment for running the installer. Please refer to this documentation for further details.
Download the latest versions of the OpenShift Client and installer and uncompress the tarballs with the tar
command:
tar -xvf openshift-install-OS-VERSION.tar.gz
You could either use the interactive wizard to configure the installation or provide an install-config.yaml
file for it. See all possible customization via the install-config.yaml
file.
If you choose to create yourself an install-config.yaml
file, we recommend you create a directory for your cluster, and copy the configuration file into it. See the documents on the recommended workflow for more information about why you should do it this way.
mkdir ostest
cp install-config.yaml ostest/install-config.yaml
All the OpenShift nodes get created in an OpenStack tenant network and as such, can't be accessed directly in most OpenStack deployments. We will briefly explain how to set up access to the OpenShift API with and without floating IP addresses.
This method allows you to attach two floating IP (FIP) addresses to endpoints in OpenShift.
You will need to create two floating IP addresses, one to attach to the API load balancer (lb FIP), and one for the OpenShift applications (apps FIP). Note that the LB FIP is the same floating IP as the one you added to your install-config.yaml
. The following command is an example of how to create floating IPs:
openstack floating ip create <external network>
You will also need to add the following records to your DNS:
api.<cluster name>.<base domain> IN A <lb FIP>
If you don't have a DNS server under your control, you should add the records to your /etc/hosts
file.
NOTE: this will make the API accessible only to you. This is fine for your own testing (and it is enough for the installation to succeed), but it is not enough for a production deployment.
At the time of writing, you need to create a second floating IP and attach it to the ingress-port if you want to be able to reach the provisioned workload externally. This can be done after the install completes in three steps:
Get the ID of the ingress port:
openstack port list | grep "ingress-port"
Create and associate a floating IP to the ingress port:
openstack floating ip create --port <ingress port id> <external network>
Add a wildcard A record for *apps.
in your DNS:
*.apps.<cluster name>.<base domain> IN A <apps FIP>
Alternatively, if you don't control the DNS, you can add the hostnames in /etc/hosts
:
<apps FIP> console-openshift-console.apps.<cluster name>.<base domain>
<apps FIP> integrated-oauth-server-openshift-authentication.apps.<cluster name>.<base domain>
<apps FIP> oauth-openshift.apps.<cluster name>.<base domain>
<apps FIP> prometheus-k8s-openshift-monitoring.apps.<cluster name>.<base domain>
<apps FIP> grafana-openshift-monitoring.apps.<cluster name>.<base domain>
<apps FIP> <app name>.apps.<cluster name>.<base domain>
If you cannot or don't want to pre-create a floating IP address, the installation should still succeed, however the installer will fail waiting for the API.
WARNING: The installer will fail if it can't reach the bootstrap OpenShift API in 30 minutes.
Even if the installer times out, the OpenShift cluster should still come up. Once the bootstrapping process is in place, it should all run to completion. So you should be able to deploy OpenShift without any floating IP addresses and DNS records and create everything yourself after the cluster is up.
To deploy with Kuryr SDN instead of the default OpenShift SDN, you simply need to modify the install-config.yaml
file to include Kuryr
as the desired networking.networkType
and proceed with the same steps as with the default OpenShift SDN:
apiVersion: v1
...
networking:
networkType: Kuryr
...
platform:
openstack:
...
trunkSupport: true
octaviaSupport: true
...
NOTE: both trunkSupport
and octaviaSupport
are automatically discovered by the installer, so there is no need to set them. But if your environment doesn't meet both requirements Kuryr SDN will not properly work, as trunks are needed to connect the pods to the OpenStack network and Octavia to create the OpenShift services.
There are known limitations when using Kuryr SDN:
- There is an amphora load balancer VM being deployed per OpenShift service with the default Octavia load balancer driver (amphora driver). If the environment is resource constrained it could be a problem to create a large amount of services.
- Depending on the Octavia version, UDP listeners are not supported. This means that OpenShift UDP services are not supported.
- There is a known limitation of Octavia not supporting listeners on different protocols (e.g., UDP and TCP) on the same port. Thus services exposing the same port for different protocols are not supported.
- Due to the above UDP limitations of Octavia, Kuryr is forcing pods to use TCP for DNS resolution (
use-vc
option atresolv.conf
). This may be a problem for pods running Go applications compiled withCGO_DEBUG
flag disabled as that forces to use thego
resolver that is only using UDP and is not considering theuse-vc
option added by Kuryr to theresolv.conf
. This is a problem also for musl-based containers as it's resolver does not supportuse-vc
option. This would include e.g., images build fromalpine
.
Finally, you can run the installer with the following command:
./openshift-install create cluster --dir ostest
NOTE: At the time of writing, once your ingress-port
comes up, you will need to attach the apps FIP to it. First, find the ingress port like this:
openstack port show <cluster name>-<clusterID>-ingress-port
Then attach the FIP to it:
openstack floating ip set --port <ingress port id> <apps FIP>
Currently:
- Deploys an isolated tenant network
- Deploys a bootstrap instance to bootstrap the OpenShift cluster
- Deploys 3 master nodes
- Once the masters are deployed, the bootstrap instance is destroyed
- Deploys 3 worker nodes
Look for a message like this to verify that your install succeeded:
INFO Install complete!
INFO To access the cluster as the system:admin user when using 'oc', run 'export KUBECONFIG=/home/stack/ostest/auth/kubeconfig'
INFO Access the OpenShift web-console here: https://console-openshift-console.apps.ostest.shiftstack.com
INFO Login to the console with user: kubeadmin, password: xxx
If you want to see the status of the apps and services in your cluster during, or after a deployment, first export your administrator's kubeconfig:
export KUBECONFIG=ostest/auth/kubeconfig
After a finished deployment, there should be a node for each master and worker server created. You can check this with the command:
oc get nodes
To see the version of your OpenShift cluster, do:
oc get cluster version
To see the status of you operators, do:
oc get clusteroperator
Finally, to see all the running pods in your cluster, you can do:
oc get pods -A
Destroying the cluster has been noticed to sometimes fail. We are working on patching this, but in a mean time the workaround is to simply run it again. To do so, point it to your cluster with this command:
./openshift-install --log-level debug destroy cluster --dir ostest
Then, you can delete the folder containing the cluster metadata:
rm -rf ostest/
This documents how to shift from the internal load balancer, which is intended for internal networking needs, to an external load balancer.
The load balancer must serve ports 6443, 443, and 80 to any users of the system. Port 22623 is for serving ignition start-up configurations to the OpenShift nodes and should not be reachable outside of the cluster.
The first step is to add floating IPs to all the master nodes:
openstack floating ip create --port master-port-0 <public network>
openstack floating ip create --port master-port-1 <public network>
openstack floating ip create --port master-port-2 <public network>
Once complete you can see your floating IPs using:
openstack server list
These floating IPs can then be used by the load balancer to access the cluster. An example of HAProxy configuration for port 6443 is below.
listen <cluster name>-api-6443
bind 0.0.0.0:6443
mode tcp
balance roundrobin
server <cluster name>-master-2 <floating ip>:6443 check
server <cluster name>-master-0 <floating ip>:6443 check
server <cluster name>-master-1 <floating ip>:6443 check
The other port configurations are identical.
The next step is to allow network access from the load balancer network to the master nodes:
openstack security group rule create master --remote-ip <load balancer CIDR> --ingress --protocol tcp --dst-port 6443
openstack security group rule create master --remote-ip <load balancer CIDR> --ingress --protocol tcp --dst-port 443
openstack security group rule create master --remote-ip <load balancer CIDR> --ingress --protocol tcp --dst-port 80
You could also specify a specific IP address with /32 if you wish.
You can verify the operation of the load balancer now if you wish, using the curl commands given below.
Now the DNS entry for api.<cluster name>.<base domain>
needs to be updated to point to the new load balancer:
<load balancer ip> api.<cluster-name>.<base domain>
The external load balancer should now be operational along with your own DNS solution. The following curl command is an example of how to check functionality:
curl https://<loadbalancer-ip>:6443/version --insecure
Result:
{
"major": "1",
"minor": "11+",
"gitVersion": "v1.11.0+ad103ed",
"gitCommit": "ad103ed",
"gitTreeState": "clean",
"buildDate": "2019-01-09T06:44:10Z",
"goVersion": "go1.10.3",
"compiler": "gc",
"platform": "linux/amd64"
}
Another useful thing to check is that the ignition configurations are only available from within the deployment. The following command should only succeed from a node in the OpenShift cluster:
curl https://<loadbalancer ip>:22623/config/master --insecure
See the troubleshooting installer issues in OpenStack guide.
Please see the Issue Tracker for current known issues. Please report a new issue if you do not find an issue related to any trouble you’re having.