Elasticsearch

Security

If you have the AWS Plugin installed you can perform snapshots to S3.

You can watch snapshots in progress: curl $ES_URL:9200/_snapshot/_status

Always use explicit index mappings
discovery.zen.minimum_master_nodes should be (n/2 + 1) where n is the number of nodes in your cluster
use doc_values if you're doing large amounts of aggregation queries

Modifying or upgrading your ES cluster (for anything other than API level config changes) generally involves a rolling restart operation.

The instance type you require is quite dependant on the amount of data you have and the queries and aggregation you perform.

Some rough guidelines:

Greater than 4GB of memory (though 2GB has been known to work).
To protect against data loss a cluster should have at least 3 nodes, preferably distributed accross availability zones.
Assign ~50% of instance memory to ES: This can be done by setting the ES_HEAP_SIZE environment variable. See https://www.elastic.co/guide/en/elasticsearch/reference/current/setup-configuration.html.

The Head Plugin provides a visual overview of cluster health & shard status.
Export statistics to Cloudwatch with https://github.com/guardian/elasticsearch-cloudwatch
Watch recoverery of nodes using:

#!/bin/bash

ES_URL=$1

watch -d "curl -s $ES_URL:9200/_cat/recovery?v | grep -v done | sort"