-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple nic (communication) card causes disconnections for gateway type devices #267
Comments
@joepavitt, @knolleary. This is the issue that I raised concern for in the #dept-engineering channel. |
@gdziuba It would be useful to have more context on this - we don't have much to go on here. It feels like this is going to be a very device-specific thing and we haven't had any other reports of this type of behaviour. The EG5120 was mentioned in a support ticket earlier this week regarding IPv6 connectivity. I wonder if that is related at all here. |
@knolleary That is a fair statement. The customer was incorrect above and will be updated: https://app-eu1.hubspot.com/contacts/26586079/record/0-2/7790198479 Customer notes: Compulab IoT gateway Reduplicate the issue:
Resolution for EG5120
|
We will need to know how the network is configured on both interfaces and what routes are configured with what weights Also how else does the network change when the interface changes, e.g. does the ethernet network need a proxy, where as the cellular network does not? |
I have a device we can troubleshoot the issue with provide by a partner. |
The devices we can test are OEM devices, where it is my understanding it is a clean installation with only 1 active in-use network card. |
@gdziuba Can you let me know where we are with a device that we can investigate this on please? This is the sort of thing that it will be useful to have access to the device while it changes network, so if it has 2 network interfaces then maybe having a serial console attached would be useful. Give me a shout when you have time for a chat about this. |
@hardillb sent an email to you. It is in the thread with Russell. |
We were able to replicate the issue with a customer. Here is a video: https://flowforgeworkspace.slack.com/archives/C032Q63FGG1/p1723493785229809 Update on this. There is only 1 nic card on these device. He was switching from a local physical connection to a Sim card solution then back. Though, when he wanted to connect back to physical, it could never recover. Service had to be restarted. Do we have a way that we would have a self recovery mechanism? AKA, network configuration change, try to reconnect? 4:48 |
@gdziuba We need to know how the network fail over is triggered, and exactly how the network is configured. e.g.
Also for the |
Can we get the following command run as root:
All three want to be run 3 times:
I suspect the best way to do this will be via a serial console attached to the device so that a ssh session doesn't need to be maintained to the device for each step. |
I may be able to reproduce something similar (not confirmed until I see the network config). Upgrading to the latest mqtt.js libraray allows connections to drop cleanly (after keepalive timeout) when current interface is brought down and new connection is established. The new mqtt.js library does require at least NodeJS v16 so will only be available as part of #263 |
Current Behavior
When a device with multiple nic cards runs the device agent. It seems as if it causes network issues on the device and the device then starts to try to switch between communications.
Device issues can be elevated when one of the communication cards is disabled.
We have been working with robustel and have access to a device where we can reduplicate. Please reach to go @gdziuba for details on how to connect to the device.
Expected Behavior
Device agent to maintain connection to the platform.
Steps To Reproduce
In this case we leverage an EG5120 from robustel. Running device agent with a sim card and wifi will cause the device to start disconnecting and reconnecting.
Environment
Linked Customers
This customer seems to be having similar issues with a different product:
The text was updated successfully, but these errors were encountered: