I am using SignalFX to create an alert via metrics from a Google Cloud GKE cluster. I want to be able to trigger an alert when number-of-nodes-in-node-pool == node-pool-max-size. I am not using GCP autoscaling under the k8s cluster as the cluster is managing the node pool itself. I can use metric "instance_group/size" to get the current number of nodes in the node group but cannot find a metric to get node-pool-max-size (I cannot use an autoscaler metric as autoscaler is not being used). Can anyone point me to a metric or combination of metrics that can be used here?
Google Cloud GKE Node Pool Metrics instance_group/size == <node-pool-max-size>
1.3k Views Asked by AlRal At
1
There are 1 best solutions below
Related Questions in GOOGLE-CLOUD-PLATFORM
- Why do I need to wait to reaccess to Firestore database even though it has already done before?
- Unable to call datastore using GCP service account key json
- Troubleshooting Airflow Task Failures: Slack Notification Timeout
- GoogleCloud Error: Not Found The requested URL was not found on this server
- Kubernetes cluster on GCE connection refused error
- Best way to upload images to Google Cloud Storage?
- Permission 'storage.buckets.get' denied on resource (or it may not exist)
- Google Datastream errors on larger MySQL tables
- Can anyone explain the output of apache-beam streaming pipeline with Fixed Window of 60 seconds?
- Parametrizing backend in terraform on gcp
- Nonsense error using a Python Google Cloud Function
- Unable to deploy to GAE from Github Actions
- Assigned A record for Subdomain in Cloud DNS to Compute Engine VM instance but not propagated/resolved yet
- Task failure in DataprocCreateClusterOperator when i add metadata
- How can I get the long running operation with google.api_core.operations_v1.AbstractOperationsClient
Related Questions in GOOGLE-KUBERNETES-ENGINE
- Golang == Error: OCI runtime create failed: unable to start container process: exec: "./bin": stat ./bin: no such file or directory: unknown
- Kubectl command throws error when executed from python script but manual execution works fine
- Unable to add TLS certificate to GKE from Google Secret Manager
- GKE Clusters no indication within metrics or logs after failure
- Getting connection refused to Private GKE Internal point. Autopilot private cluster
- Can I have the Google managed service range on a standard gke cluster created with Terraform- non auto-pilot
- Configure Lens with GKE cluster - gke_gcloud_auth_plugin issue
- Having issues joining my kubernetes worker noed to a controller node
- How to deploy airflow in kubernetes cluster that uses istio
- GKE cluster shutdown
- Artifactory 404 error from virtual repository where docker pulls fine but crictl does not
- Scraping from Mexico in GCP EKG?
- not able to connect via cloud sql proxy
- SQL connection throws error when adding DistributedSession, SessionMiddleware
- CronJob pod repeats pending forever even after deleting it
Related Questions in MONITORING
- Monitoring Thread pool metrics through promethues
- Filter input metrics in vmagent (prometheus)
- Trying to get net.if.in and net.if.out values with zabbix api python
- Global event monitoring with WPF
- database "telegraf" creation failed: 401 Unauthorized
- Zabbix parsing macros value
- Is it possible for my prometheus container to pull metrics from Azure Monitor?
- APM Open source : Angular + Java Spring + Postgresql
- Poller is not picking up the Queued tasks, the Host and Service checks are getting timed out
- Can I monitor progress of spacy parsing?
- What's the difference between every 1m, group_by in MQL Alert vs rolling window in Google alerting
- Objective tools for monitoring WCF APIs for latency, failures, and breakdowns?
- Retain Metric Values in Prometheus TSDB Across Application Restarts?
- Grafana Base64 Image/Video/Audio/PDF plugins unable to display
- How do I measure pagespeed scores on my pages using datadog? Or rather, is it even possible to keep track of pagespeed scores?
Related Questions in ALARM
- Getting Scheduled Alarms from Default Alarm Clock App (specifically Clock on Android)
- flutter_local_notifications is NOT working - zonedSchedule
- How can I make it so other people (and myself) can open the .py file and not have it crash when using mp3 files in the code?
- When we set the alarm, the alarm will run automatically with javascript. without pressing any buttons
- Alarm rules in device profile with Dynamic source type doesn't work?
- How do I create an alarm with nested alarms from 2 arrays
- Alarm BroadcastReceiver is never called
- How to schedule reminder for daily habits
- Unable to start service Intent u=0 not found: AlarmReceiver
- Creating an alarm application using Expo and React Native
- how to add a screen when the screen is off like the default alarm screen when the alarm ring with flutter, alarm screen is not its not coming
- Issue on Android 13 : Android — Repeat notification daily on specific time
- Notification sound never stop when connected to a bluetooth device - Android
- Infinite local notifications for alarm app
- how to kill winsound task after an if condition
Related Questions in SIGNALFX
- How to search for detectors by tags in splunk sfx?
- How can I access the TraceId generated by splunk-otel-collector within an ASP.NET web application?
- How to exclude certain time-frames from a SignalFx Detector?
- How workflows.duration.ns.p99 is calculated in SignalFX
- How to write query in PromQL and assign it to a variable for alert conditions?
- Signal fx signal flow calculating error rate
- How to set signalfx.accessToken in spring application
- Service, host_name, process_id, runtime information is not available in signalfx when using micrometer for pushing custom metrics
- How to expose SPLUNK_ACCESS_TOKEN in spring boot application
- SignalFX detector data().count() based on condition
- How to mute signalfx detector alerts using cli with the help of bash script
- Monitoring Azure AKS pod logs with SignalFX w/out Log Observer
- Signalfx formula set 0 if no timeseries found
- Unexplained delay in ASP.NET WebAPI in hop from web app directory to action route
- Google Cloud GKE Node Pool Metrics instance_group/size == <node-pool-max-size>
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Popular # Hahtags
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
As per the documentation currently there is no metric to monitor the node pool maximum size. The metric you are using Instance_group/size is the only metric to monitor the current number of nodes and maximum size of node pool.