I have an activity task scheduled but never executed(started) by looking at the history, and then workflow got timeouted. I can confirm that cadence worker is running because other workflow works fine at the same time.
Why the activity is not started/executed in history? How should I investigate issue like this?
My activity timeout is same as workflow timeout.
The question is from by this Github issue.
First of all, an activity with retry will only write started event when activity is completed or finally failed/timeouted.
From the history, the workflow timeout before activity event can be written into history. You need to make sure the workflow timeout is larger than activity timeout so that activity timeout event can be written into history before workflow timeouts.
So how do we make sure if an activity is indeed started or not?
Or through webUI describe workflow view, it shows the pending activities.
Finally, the reason behind activities with retry: history needs to be immutable as invariance, but start event can changed for activity being retried until it finally settled down. This invariance is important for Cadence architecture. But it's indeed confusing in webUI. And here is the issue to improve it.