-
Notifications
You must be signed in to change notification settings - Fork 745
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
error: "rtnetlink socket closed" #5428
Comments
Try killing Lighthouse with I think this is a bug in libp2p where the listener gets interrupted by a shutdown signal during startup and then goes into an infinite loop. See: Related:
If Lighthouse doesn't restart after the |
The issue happens again on restart. I am using systemd to manage the service which does sigkill if sigint doesn't stop it after long enough.
The disk has 3TB of extra space. I'm going to try to get the logs from when it starts happening next time, but it is challenging because the log-flood eventually cycles those logs. |
If you could DM some debug logs from |
I can't glean much from the debug logs, it just shows Lighthouse eventually shutting down due to the
The debug logs filled up with warnings pretty quickly, so the 5 files only covered:
So there's only ~1 minute of history covered by the logs. @AgeManning It looks like we expect some listener errors: lighthouse/beacon_node/lighthouse_network/src/service/mod.rs Lines 1793 to 1801 in 5ce1619
I guess we can't change that event handler to shutdown immediately because some listener errors are non-fatal? e.g. listening on |
It happened again but I don't have logs from when it started because it produces 1 million lines of errorlogs every 2 minutes and so my logs cycle out before I even notice it went down. |
@wjmelements Do you want to try modifying Lighthouse so it just crashes on the lines I linked above? If you replace the current warning by |
Yeah. So the reason we tolerate the listen errors is that generally libp2p will try and listen on a number of interfaces. The host might not have ip6 support or some tcp port failure. Each individual listen error should not be a catastrophic failure, as we can fail on ip6 but succeed on ip4 (a common case). I'll reach out to Michael and try and have a look at the logs. As of 5.1 we now store libp2p logs in the .lighthouse//beacon_node/logs directory. There might be useful information there. |
Also including @jxs who may be interested in this |
@wjmelements It might be helpful to check your system logs (e.g. The |
I have logs from where it starts, on holesky:
after which the same log repeats, mostly Right with the occasional Left. The Left here is shown between two rights.
|
FYI: I have submitted a PR to if-watch to improve the |
Description
Lighthouse is not connecting to geth (v1.13.12) and floods the logs with error messages about the rtnetlink socket being closed
Version
stable v5.1.1
Present Behaviour
The logs is only this error, repeated
It does not recover. Lighthouse only spams this log repeatedly.
Geth config:
Lighthouse config:
Expected Behaviour
Healthy node
Steps to resolve
Unknown
The text was updated successfully, but these errors were encountered: