EMR cluster monitoring configuration: Ganglia + InfluxDb

573 Views Asked by At

i have an EMR cluster. It is set up by terraform script

resource "aws_emr_cluster" "emr-test" {
   name = "emr-test"
   applications = [..., "Ganglia", ...]
   ...
}

I would like to integrate ganglia with influxDb+Grafana. Found an example of cofiguration: example.

That requires to update gmetad.conf file on master node. Is that possible to do that with terraform script? emr step?

1

There are 1 best solutions below

0
On

You can use the bootstrap_action attribute to list actions that should run before Hadoop is started on the cluster nodes. You can also apply filters to only run those actions on the master node:

resource "aws_emr_cluster" "emr-test" {

  ...

  bootstrap_action {
    path = "s3://your-bucket/update-gmetad.sh"
    name = "update-gmetad-on-master-node"
    args = ["instance.isMaster=true"]
  }

}