Pelias Kubernetes Configuration

This repository contains Kubernetes configuration files to create a production ready instance of Pelias.

We use Helm to manage Pelias installation.

This configuration is meant to be run on Kubernetes using real hardware or full sized virtual machines in the cloud. It could work on a personal computer with minikube, but larger installations will likely benefit from additional RAM.

Setup

First, set up a Kubernetes cluster however works best for you. A popular choice is to use kops on AWS. The Getting Started on AWS Guide is a good starting point.

Helm Installation

Helm must be installed before continuing. See https://github.com/helm/helm#install for instructions.

Sizing the Kubernetes cluster

A working Pelias cluster contains at least some of the following services:

Pelias API (requires about 50MB of RAM)
Libpostal Service (requires about 2GB of RAM)
Placeholder Service (Requires 256MB of RAM)
Point in Polygon (PIP) Service (Requires up to 6GB of RAM for a full planet build) (required for reverse geocoding)
Interpolation Service (requires ~2GB of RAM)

See the Pelias Services documentation to determine which services to run.

Some of the following importers will additionally have to be run to initially populate data

Who's on First (requires about 1GB of RAM)
OpenStreetMap (requires between 0.25GB and 6GB of RAM depending on import size)
OpenAddresses (requires 1GB of RAM)
Geonames (requires ~0.5GB of RAM)
Polylines (requires 1GB of RAM)

Finally, the importers require the PIP service to be running

Use the data sources documentation to decide which importers to be run.

Importers can be run in any order, in parallel or one at a time.

This means around 10GB of RAM is required to bring up all the services, and up to another 15GB of RAM is needed to run all the importers at once. 2 instances with 8GB of RAM each is a good starting point just for the services.

If using kops, it defaults to t2.small instances, which are far too small (they only have 2GB of ram).

You can edit the instance types using kops edit ig nodes before starting your cluster. m4.large is a good choice to start.

Pelias Helm Chart installation

It's recommended to use a .yaml file to configure the Pelias chart. See values.yaml for a starting point.

The pelias helm chart can be installed as follows:

helm install --name pelias --namespace pelias ./path/to/pelias/charts -f path/to/pelias-values.yaml

Running a build

The build directory in this repository contains an additional chart for running a Pelias build. It can be run in a similar way to the Pelias services chart:

helm install --name pelias-build --namespace pelias ./path/to/pelias/build/chart -f path/to/pelias-values.yaml

values.yaml can be reused between the two charts, however, the Pelias services chart must be up and running first.

Build chart provides tamplates to build full planet from data sources. Before any of data source job is started below jobs must be completed:

schema-create-job - Creates Pelias index in Elasticsearch (Elasticsearch details are provided by configmap template in Pelias chart)
efs-presistent-volume and efs-pvc - Creates PV/PVC in pelias namespace in k8s. It's used to store downloaded data by data source importers before data is loaded into Elasticsearch

To build a smaller piece instead of full planet i.e. single country modification of configmap.tpl in Pelias chart is required. For download links and country codes see documentation of each individual data sources.

Elasticsearch

Elasticsearch is used as the primary datastore for Pelias data. As a powerful database with built in scalability and replication abilities, it is not currently well suited for running in Kubernetes.

Instead, it's preferable to create "regular" instances in your cloud provider or on your own hardware. To help with this, the elasticsearch/ directory in this repository contains tools for setting up a production ready, Pelias compatible Elasticsearch cluster. It uses Terraform and Packer to do this. See the directory README for more details.

If you decided to lanuch Elasticsearch on AWS via AWS Console instead of provided Terraform/Packer solution you should change default port and protocol in values.yaml as AWS use https and 443 by default.

Handy Kubernetes commands

We find ourselves using these from time to time.

debugging 'init containers'

Sometimes an 'init container' fails to start, you can view the init logs:

# kubectl logs {{pod_name}} -c {{init_container_name}}
kubectl logs geonames-import-4vgq3 -c geonames-download

opening a bash prompt in a running container

It can be useful to open a shell inside a running container for debugging:

# kubectl exec -it {{pod_name}} -- {{command}}
kubectl exec -it pelias-pip-3625698757-dtzmd -- /bin/bash

Helm Chart Configuration

The following table lists common configurable parameters of the chart and their default values. See values.yaml for all available options.

Parameter	Description	Default
`elasticsearch.host`	Elasticsearch hostname	`elasticsearch-service`
`elasticsearch.port`	Elasticsearch access port	`9200`
`elasticsearch.protocol`	Elasticsearch access protocol	`http`
`elasticsearch.auth`	Elasticsearch authentication `user:pass`	`-`
`pip.enabled`	Whether to enable pip service	`true`
`pip.host`	Pip service url	`http://pelias-pip-service:3102/`
`pip.pvc.create`	To use a custom PVC	`-`
`pip.pvc.name`	Name of the PVC	`-`
`pip.pvc.storageClass`	Storage Class to use for PVC	`-`
`pip.pvc.storage`	Amount of space to claim for PVC	`-`
`pip.pvc.annotations`	Storage Class annotation for PVC	`-`
`pip.pvc.accessModes`	Access mode to use for PVC	`-`
`interpolation.enabled`	Whether to enable interpolation service	`false`
`interpolation.host`	Pip service url	`http://pelias-interpolation-service:3000/`
`interpolation.pvc.create`	To use a custom PVC	`-`
`interpolation.pvc.name`	Name of the PVC	`-`
`interpolation.pvc.storageClass`	Storage Class to use for PVC	`-`
`interpolation.pvc.storage`	Amount of space to claim for PVC	`-`
`interpolation.pvc.annotations`	Storage Class annotation for PVC	`-`
`interpolation.pvc.accessModes`	Access mode to use for PVC	`-`

Name		Name	Last commit message	Last commit date
Latest commit History 333 Commits
build		build
elasticsearch		elasticsearch
experimental		experimental
templates		templates
.helmignore		.helmignore
Chart.yaml		Chart.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
values.yaml		values.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pelias Kubernetes Configuration

Setup

Helm Installation

Sizing the Kubernetes cluster

Pelias Helm Chart installation

Running a build

Elasticsearch

Handy Kubernetes commands

debugging 'init containers'

opening a bash prompt in a running container

Helm Chart Configuration

About

Releases

Packages

Languages

License

jdomag/kubernetes

Folders and files

Latest commit

History

Repository files navigation

Pelias Kubernetes Configuration

Setup

Helm Installation

Sizing the Kubernetes cluster

Pelias Helm Chart installation

Running a build

Elasticsearch

Handy Kubernetes commands

debugging 'init containers'

opening a bash prompt in a running container

Helm Chart Configuration

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages