Skip to content

Opinions - How to recognize "stuck" instances #1770

Answered by olitomlinson
mpaul31 asked this question in Q&A
Discussion options

You must be logged in to vote

(As we discussed on Twitter) I think this comes down to understanding how long your orchestrations typically take to complete.

For example, if you have a use case where your orchestrations typically complete between 5 and 10 minutes, then you can query the for running instances that are older then 10 minutes, and begin raising Azure Monitor events/metrics to let you know that something might not be right.

However, if the typical orchestration lifetime varies wildly (e.g. you have a human-in-the-loop) then it becomes harder to define what is an acceptable amount of time monitor against.

Durable Functions is unsupervised by virtue of its design, so it really is a hard problem to solve at th…

Replies: 8 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by cgillum
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #1342 on April 01, 2021 21:09.