-
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Devices going offline, extreme slowness and various errors #23329
Comments
I've noted similar traits (on 1.39.0 with same adapter and firmware version on x86 mini PC also); invariably a restart of Z2M (via Settings / Add-ons) will recover all unavailable devices but some stubborn ones need unplugging / replugging etc. As I've been away I've not had chance to investigate and likely like you not wholly sure where I'd start but thought I'd chip in and indicate it's not just you! |
I have the same problem. If I go to 1.39.0 version. Number of participants in Mosquitto broker is reduced from 114 to 66 and everything works very sluggishly. I have Sonoff dongle Plus-P and dongle with 2538 chip |
Strange errors keep occurring. Unexpectedly, the Z2M system crashes. Sometimes I can't even start Z2M, generating the error below.
I feel less tense knowing that there are more people with this problem, however, I don't see many complaints about the topic, leaving the doubt as to whether there is actually someone from Z2M working on this. I really don't know what to do anymore. Does anyone know if it is possible to rollback just Z2M to version 1.38.0, without using a backup? Because it's been several days since I updated, I no longer have backups from that time. Thank you for the community support. |
Same here. Zigbee2MQTT is totally broken and useless since last update with ember. Devices disconnecting, errors here and there, map is totally broken. I can only operate from time to time less than half of all the devices. "Error ZCL" is the most common one. |
I get lots of "Failed to ping"-errors on pretty much all my devices when this happens. Sonoff Plus P (20230507), running in Docker. |
I am in the same or a similar situation. It's driving me nuts. I may actually go back to ZHA if I can't fix this. I have had profound issues with my devices, both routers and otherwise. The best solution I have is winding back to 1.35.1 (I have no older backups). Device = Sonoff USB 3.0 Dongle-P on Raspberry Pi 4 As it stands half my devices are offline but the battery shows it's correct level, so I wonder how the device can detect battery levels offline. Similarly, some of my Tuya smart plugs (routers) show up as offline (how??) I have had this issue for >4 months. |
Where do you get this log? From Z2M or HA? It seems we have the same problem and Sonoff Dongle P device. You mention restarting Z2M works, which it doesn't for me. @Skeletorjus does your system detect devices which you force remove then reset? I cannot get re-adding devices to work ever. Even brand new devices wouldn't add recently. |
@wizardofozzie, the messages are from Z2M. I haven't had any issues pairing devices, but I sometimes have to force them to pair via a certain device to make it work - they won't always find the path themselves. |
Same problems here. Migrated from a ConBeeII setup to a SonOff Dongle E setup, using neweset 7.4.3.0 Ember Firmware an 1.39.0 zigbe2mqtt Version. Deleted old Database and repaired every device from scratch and my network is really unstable. Most of the time these errors accure:
Resulting in either slow response time when for example using switches or resulting in a completly unresponsible state, so i have to restart the whole system. I really need a solution here, because there are importend things that are controlled via Home Assistant. I now ordered the Dongle P, maybe zstack works better. |
Just to test, I completely removed Z2M and the sensors created, installed ZHA, reparing all my 90 devices and the network is working perfectly, without delays or crashes. This leads me to believe that, in fact, it is the Z2M version that has a problem. Unfortunately rtunately my knowledge is limited to be able to deepen my analysis, but I would be grateful if any member with deeper knowledge of Z2M could support the topic. I am willing to do tests and send logs or any activity that is necessary. |
Same error here. Did anyone of you tried to use the dev/edge branch? |
I've been struggling for two weeks, trying to figure out what's been going wrong with my HA installation. I've been using VirtualBox on Windows without issues for over 2 years, and everything has always worked fine. But for the past few weeks, everything's been failing—super slow system, taking ages to restart, and no clear way to identify the problem. After a lot of trial and error, I realized the issue lies with Z2MQTT. I didn't think it could be that because I only have 10 devices through Z2M... all the others are on ZHA. But after several attempts, I found that stopping the Z2MQTT add-on entirely reduces the CPU usage to around 8%. When it's running, CPU usage shoots up to 80%. The virtual machine has 4GB of RAM, 4 CPUs, and 100GB of storage, so it's definitely not the machine. The curious thing is that even restoring a backup to the add-on version 1.36.1-1 and core 2024.6.4, the problem persists... I don't remember it being like this before. The issue is that I don't have any more backups to go back further. Any ideas? |
Yes.. no luck. Issue persists. |
Can you elaborate on this? |
@Koenkk are these issues best kept here or are these new and separate issues we should lodge? |
Since updating yesterday my lights sometimes don't turn off when sending the (group) commands. Docker on pi 4, with conbee 2 stick. |
I've managed to get back on the Stand Core version |
|
Hi @Koenkk how are you? Thank you for supporting the theme! I'm already using this indicated firmware. I'm using all the recommended and most current versions, both Home Assistant and Z2M and the coordinator, and even so the problems continue. To test, I uninstalled Z2M and put ZHA using the same coordinator and the same Home Assistant and I have no problem. For this reason, I believe that in fact the problem is in Z2M. If I can do any test or provide a more detailed log to support the analysis of the problem, tell me what to do. Thank you |
Same issue, same errors for me. Z2M addon even crashing from time to time. |
@santanar00 could you provide the debug logging from starting z2m until it crashes with the See this on how to enable debug logging. |
The "Permit join"-button has an arrow to the right where you can select a device to pair through. This way you can force a route which sometimes help in the pairing process. I still have the crashes, and I think it could be related to sending commands to multiple devices at once - like when you press a button to turn off several lights. I'm not at all sure that this is what triggers the errors, but it seems to happen before going to bed or when I arrive home and many lights have to turn on or off. Haven't updated from 20230507 yet. Will do soon, but it has been running solid for me for about a year until a couple of weeks ago, so not sure that it really is the culprit. |
I commented earlier to advise I was suffering somewhat random devices dropping offline / "crashes" etc; and also noted the response to revise coordinator firmware due to 20230507 being advised as unstable; so have recently upgraded firmware. I've had days when the network is rock solid and stays on-line for days and then periods where things go offline very quickly (hours) and as I'm drafting this I'm watching device after device drop-off over a circa 30 minute period. I'd earlier fathomed out how to pull off the debug logs and also noted from github dialogue Z2M map that the new firmware 20240710 has reduced capacity for devices that will connect directly to the coordinator. There seemed to be no end devices connected to my coordinator either. As I restarted Z2M a few times I'm sure I observed a differing "set" of router devices that would be connected to the coordinator (in the order of 30). My coordinator is Sonoff Dongle Plus P Currently have 119 devices; 61 router (lots of Ikea plugs etc) / 57 end devices (temp sensors and buttons) and 1 unknown (spotted as the system had gone offline) But logs attached as follows: Log2 - begins circa 16.25 hrs Log 1 begins circa 16:45 hrs Log begins circa 17:14 hrs and probably with all the mains devices offline and only end devices not yet updated to reflect being offline. I'll post logs into Koenkk/Z-Stack-firmware#505 also as contingency for this being firmware related and not just interference of other issues. If the logs info isn't correct and/or I can do anything else to monitor and troubleshoot please advise - accepting that I can't make sense of the logs and thus would appreciate any recommendations or advice on corrective changes or things to try. Edit - Z2M map AFTER restart following the above with 32 router devices shown with DIRECT connection to the coordinator and the others somewhat more hidden that do NOT have direct connections |
I provoced the error and crashed my network by accident today, and I'm pretty sure that in my case it boils down to excessive zigbee traffic. I have disabled the automation and use different cards to control the lights now, It will be interesting to see if this keeps the network up and all devices paired and online.
|
After a lot of trial and error... Unistalling Passive BLE monitor integration (HACS) solved the issue... |
have the same problem, upgrato to 1.40.1 does not resolve |
Same problem here with 1.40.1. |
I'm using 1.40.2 for about a day now. No issues till now. |
My old hue bulbs are all almost completely unresponsive at the moment. They have good signal strength, even moved a router next to them to make sure. Running an older version of Ember on my Sonoff E. I was experiencing stability problems with the newer versions. Lights never responded well. Might have to hook up the lights back to the hue hub. |
Today nearly half of all my devices went offline. I don't know what to do at this point. |
@kafisc1 - I too had ended up back on 20221226 albeit not wholly sure what is going on. It seemed more stable but had lost all devices (again) overnight so I'm now trying a Sonoff P ONLY FW variant (so far the day 14 variant only); which I've flashed today to see if I can get through the night for once..... I've been monitoring in Koenkk/Z-Stack-firmware#518 also but there seems such a mixture of experiences across everyone it's hard to see what is able to make a difference. |
What happened?
I have a zigbee mesh with around 80 devices, half of which are Routers. My network has always worked well, however, since the last updates to the HA Core and Z2M my network has become completely unstable.
My devices go more than half offline out of nowhere, router, for example, the connection drops. I also have several timeout errors in Z2M.
Devices that are plugged into Tuya or Sonoff work fine. I using the last version driver (
I've already changed my PC (I currently use a dedicated MINI S beelink, Sonoff plus P coordinator. I don't even know where to start analyzing further, as everything I could do in terms of analysis, repairs I've already done. I followed different websites and topics on the internet, but nothing helped me.
Has anyone here ever been through something like this and can help with something?
What did you expect to happen?
No response
How to reproduce it (minimal and precise)
No response
Zigbee2MQTT version
1.39.0-1
Adapter firmware version
20230507
Adapter
Sonoff Dongle Plus-P
Setup
Add-on on Home Assistant OS on Beelink Intel NUC
Debug log
No response
The text was updated successfully, but these errors were encountered: