How to get notified when a node is down #1174
-
Hi, thanks for the project! I would like to know how I can get notified when a node is down, searching the issues shows me the Is there any way I can implement this? Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
When assessing the status of a follower node in a Raft cluster, it's important to consider the difference between the leader's To determine this:
It's crucial to note that in a distributed system, you can't definitively determine if a node is offline. However, a large replication lag suggests a higher probability that the follower node is experiencing issues or may be offline. By monitoring this metric, you can make informed decisions about the health of your Raft cluster and take appropriate actions when followers fall too far behind. |
Beta Was this translation helpful? Give feedback.
-
Close as completed in #1177. |
Beta Was this translation helpful? Give feedback.
You're correct. To better assess the connectivity status of a follower node, it would be beneficial to add a metric that tracks the timestamp of the last acknowledged heartbeat from each follower. This additional information would provide a more accurate and timely indication of the follower's connectivity status. Here's how we could refine this idea:
RaftMetrics
struct specifically for follower heartbeat information: