Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feedback development firmware 2022/07 #383

Closed
dumpfheimer opened this issue Jul 26, 2022 · 248 comments
Closed

Feedback development firmware 2022/07 #383

dumpfheimer opened this issue Jul 26, 2022 · 248 comments

Comments

@dumpfheimer
Copy link

After seeing in the changelog that the routing table sizes have increased I wanted to test the latest DEVELOPMENT firmware.

I am having issues which I believe are caused by the firmware update.

It seems to me that the firmware crashes after a few hours / an amout of requests.
Unfortunately I cannot provide detailed feedback, but am glad to try with some guidance.

The first time it got stuck I did not pay a lot of attention and simply restarted everything. The second time I un- and replugged the coordinator and things recovered without any issues worth mentioning. The logs were full of messages as shown below (1). Later it changed to other error messages (2).

On the positive side:
I do feel like the larger routing table might have had a positive effect on my environment. I have ~120 zigbee devices of which probably 2/3 are routers. Especially when toggling a bunch of lights at the same time I feel like it has less "hickups"

My environment:
I am using a CC1352P2 launchpad with zigpy/zha/home assistant. The firware in use was https://github.com/Koenkk/Z-Stack-firmware/blob/develop/coordinator/Z-Stack_3.x.0/bin/CC1352P2_CC2652P_launchpad_coordinator_20220724.zip

Error message 1:

2022-07-26 01:06:59 ERROR (MainThread) [homeassistant.helpers.entity] Update for sensor.server_electricity_power fails
Traceback (most recent call last):
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/helpers/entity.py", line 514, in async_update_ha_state
    await self.async_device_update()
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/helpers/entity.py", line 709, in async_device_update
    raise exc
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/components/zha/sensor.py", line 297, in async_update
    await super().async_update()
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/components/zha/entity.py", line 250, in async_update
    await asyncio.gather(*tasks)
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/components/zha/core/channels/homeautomation.py", line 100, in async_update
    result = await self.get_attributes(attrs, from_cache=False, only_cache=False)
  File "/srv/homeassistant/lib/python3.10/site-packages/homeassistant/components/zha/core/channels/base.py", line 460, in _get_attributes
    read, _ = await self.cluster.read_attributes(
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy/zcl/__init__.py", line 441, in read_attributes
    result = await self.read_attributes_raw(to_read, manufacturer=manufacturer)
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy/quirks/__init__.py", line 233, in read_attributes_raw
    results = await super().read_attributes_raw(
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy/device.py", line 291, in request
    radio_result, msg = await self._application.request(
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/zigbee/application.py", line 302, in request
    return await self._send_request(
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/zigbee/application.py", line 1161, in _send_request
    response = await self._send_request_raw(
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/zigbee/application.py", line 1047, in _send_request_raw
    self._znp.request_callback_rsp(
AttributeError: 'NoneType' object has no attribute 'request_callback_rsp'

Error message 2:


2022-07-26 01:10:04 ERROR (MainThread) [zigpy_znp.zigbee.application] Failed to reconnect
Traceback (most recent call last):
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/api.py", line 652, in _skip_bootloader
    result = await responses.get()
  File "/usr/lib/python3.10/asyncio/queues.py", line 159, in get
    await getter
asyncio.exceptions.CancelledError

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/zigbee/application.py", line 886, in _reconnect
    await self.connect()
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/zigbee/application.py", line 111, in connect
    await znp.connect()
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/api.py", line 694, in connect
    self.capabilities = (await self._skip_bootloader()).Capabilities
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/api.py", line 651, in _skip_bootloader
    async with async_timeout.timeout(CONNECT_PROBE_TIMEOUT):
  File "/srv/homeassistant/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/srv/homeassistant/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError
2022-07-26 01:10:19 ERROR (MainThread) [zigpy_znp.zigbee.application] Failed to reconnect
Traceback (most recent call last):
  File "/srv/homeassistant/lib/python3.10/site-packages/zigpy_znp/api.py", line 652, in _skip_bootloader
    result = await responses.get()
  File "/usr/lib/python3.10/asyncio/queues.py", line 159, in get
    await getter
asyncio.exceptions.CancelledError
@TheJulianJES
Copy link

TheJulianJES commented Jul 26, 2022

Also running 20220724 with ZHA here. Had a lockup after a few hours of uptime and needed to replug the stick. After the replug, it's been fine for ~18 hours now.
I can try 20220726/704d0e7 later but it doesn't look like it would solve the issue(?)

Edit: Had another lockup after ~22 hours

@io53
Copy link
Contributor

io53 commented Jul 26, 2022

I've been running it for ~12h without issues on both my networks, running z2m with Sonoff cc2652p. Will report if things change.

@TheJulianJES
Copy link

Update: 20220726 crashed after 3 hours for me

@Koenkk
Copy link
Owner

Koenkk commented Jul 27, 2022

Currently running the 20220726 but this runs fine for me with z2m. Lets monitor it further.

@dumpfheimer
Copy link
Author

I am able to reproduce this when I call the light.toggle service with a few seconds transition (12) with 2 areas selected that total in 12 light bulbs. Wild guess: Maybe the flood of incoming messages causes memory issues?

@dumpfheimer
Copy link
Author

Correction: This happens as described above ONLY WHEN I have the Home Assistant Companion App open...? I then get a bunch of these warnings and my network freezes:
2022-07-27 20:00:19 WARNING (MainThread) [asyncio] Executing <Handle _async_track_state_change_event.<locals>._async_state_change_dispatcher(<Event state_...920903+02:00>>) at /srv/homeassistant/lib/python3.10/site-packages/homeassistant/helpers/event.py:272 created at /srv/homeassistant/lib/python3.10/site-packages/homeassistant/core.py:456> took 0.416 seconds

@dmulcahey I believe once said that one of the ZHA issues is that it is limited to the HA event loop. Could it be, that..?:

  1. A bunch of messages are received from lamps reporting their transition (reporting their brightness repetedly)
  2. The messages pile up on the controller/firmware side, because zigpy is not able to read them, because the event loop from HA is blocked

@MattL0
Copy link

MattL0 commented Aug 3, 2022

Running it since 4-5 days without any issue.

I have a 82 node network wich 30 are routers.

@TheJulianJES
Copy link

@MattL0 You're running Zigbee2MQTT, right? (and not Home Assistant's ZHA integration)

@MattL0
Copy link

MattL0 commented Aug 4, 2022

Yes I am on zigbee2mqtt

@dumpfheimer
Copy link
Author

It seems like this issue only causes the controller to lock up with ZHA.

Is there a way to test if the controller locks up when there is some kind of backing up of messages?

@digiblur
Copy link

digiblur commented Aug 4, 2022

No issues on Z2M here since this firmware dropped on a USB CC2652P2 coordinator with a mixture of about 45 devices on mains and battery.

@Koenkk
Copy link
Owner

Koenkk commented Aug 4, 2022

Also still running fine in my case. @dumpfheimer I suggest to open an issue in the ZHA issue tracker

@puddly
Copy link

puddly commented Aug 4, 2022

I believe this may be caused by zigpy-znp registering callbacks for all known ZDO commands at startup, which increases its runtime RAM usage beyond Z2M's: https://github.com/zigpy/zigpy-znp/blob/695ebf50a86117daeec92ac08c256fc1bfd11e60/zigpy_znp/zigbee/application.py#L212-L218

@dumpfheimer
Copy link
Author

I will. But there still seems to be some kind of regression with the firmware. IMHO it should not freeze like this and I have never had this issue before updating the firmware.

@puddly
Copy link

puddly commented Aug 4, 2022

@dumpfheimer if you have a second, try applying this patch to see if it still crashes. Only five of the thirty defined ZDO commands will be registered:

diff --git a/zigpy_znp/zigbee/application.py b/zigpy_znp/zigbee/application.py
index 354edb7..382120e 100644
--- a/zigpy_znp/zigbee/application.py
+++ b/zigpy_znp/zigbee/application.py
@@ -214,6 +214,14 @@ class ControllerApplication(zigpy.application.ControllerApplication):
             # Ignore outgoing ZDO requests, only receive announcements and responses
             if cluster_id.name.endswith(("_req", "_set")):
                 continue
+            elif cluster_id not in (
+                zdo_t.ZDOCmd.Node_Desc_rsp,
+                zdo_t.ZDOCmd.Simple_Desc_rsp,
+                zdo_t.ZDOCmd.Active_EP_rsp,
+                zdo_t.ZDOCmd.Mgmt_Lqi_rsp,
+                zdo_t.ZDOCmd.Mgmt_Permit_Joining_rsp,
+            ):
+                continue

             await self._znp.request(c.ZDO.MsgCallbackRegister.Req(ClusterId=cluster_id))

It won't be possible to bind/unbind things or properly join new devices since those all use additional ZDO commands but this should be enough to "run" ZHA once it's set up.

@dumpfheimer
Copy link
Author

I'll try as soon as possible, thanks!

@dumpfheimer
Copy link
Author

So, a first quick test did not crash the coordinator.

This to me seems promising but not conclusive.
I believe in Hass 2022.8 the state updates while transitioning are ignored which could prevent the loop to be busy. Ill keep you posted.

@TheJulianJES
Copy link

TheJulianJES commented Aug 4, 2022

@dumpfheimer The attribute reports during light transitions are ignored on the HA side (but still sent by the light, so they still travel through the network). (It can be disabled in the ZHA settings too.)

Also, before/after you’ve applied the patch, you could also try to do a topology scan. (puddly mentioned that could cause higher memory usage and possibly the crash.)

@dumpfheimer
Copy link
Author

The amount of messages might only be part of the issue. If memory on the controller is the issue it might be fine as long as zigpy is reading them in the same speed they are received. But my best guess was that something like this is happening:

  1. A bunch of attribute changes (like brightness during transitioning) are received.
  2. Zigpy reads an Attribute change
  3. Tbe state change runs through state change callbacks (eg update via websocket) that block the asyncio event loop preventing zigpy from reading the serial port
  4. The messages on the controller back up and cause memory errors
  5. The controller fails to handle the memory issue correctly and freezes

If the brightness during transition does not cause Hass to update the state the websocket does not block the loop and the messages are read in time for the controller to not block up.

There are a few wild guesses here but the fact that it only happens when my Hass companion is open causing warning messages that the events took long and that it seems to be some kind of memory issue makes me believe this just might be the case.

I am however a complete noob with python/asyncio
@puddly do you have deeper insight into the event loop? Could a "rogue" task in the event loop cause zigpy to hang?

@dumpfheimer
Copy link
Author

dumpfheimer commented Aug 4, 2022

I tried it with a topology scan.
I did get a lot of similar error messages: 2022-08-04 21:20:06.295 WARNING (MainThread) [asyncio] Executing <Handle _async_track_state_change_event.<locals>._async_state_change_dispatcher(<Event state_...058055+02:00>>) at /srv/homeassistant/lib/python3.10/site-packages/homeassistant/helpers/event.py:272 created at /srv/homeassistant/lib/python3.10/site-packages/homeassistant/core.py:456> took 0.116 seconds

But it seems like the interruptions went from .4 down to .1 seconds.
However it did not crash the controller. ( With puddly's test fix)

@TheJulianJES
Copy link

Mhm right, puddly's test-fix also breaks topology scans then. Might be worth to see if a topology scan without the fix triggers the crash.

@dumpfheimer
Copy link
Author

Right now I cannot make it crash deliberately by transitions or topology scans. (Without test fix)

@Wireheadbe
Copy link

Wireheadbe commented Aug 5, 2022

Running 20220726 here on an LAUNCHXL-CC26X2R1 and it's been very stable (81 Zigbee devices, groups,..) Topology scans don't crash,...

edit: still rocksolid. In the past, some of my LED2005r5 would stop reporting/fail to respond to direct commands (group kept working!), so far, this hasn't occured on this firmware

edit2: after a week, still solid, no timeouts from reporting. Everything seems to behave better.

@alexruffell
Copy link

Trying to figure out why I keep having so many zigbee issues I updated my coordinator to this release of the firmware. Something new now started... my devices seem to stop updating. I have to reload ZHA and they work for a bit then issues happen all over again. In the logs I found this:

Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1023, in request_callback_rsp
    await self.request(request, **response_params)
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 984, in request
    response = await response_future
asyncio.exceptions.CancelledError
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1022, in request_callback_rsp
    async with async_timeout.timeout(timeout):
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError
2022-08-07 21:15:20.750 ERROR (MainThread) [homeassistant] Error doing job: Task exception was never retrieved
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1023, in request_callback_rsp
    await self.request(request, **response_params)
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 960, in request
    async with self._sync_request_lock:
  File "/usr/local/lib/python3.10/asyncio/locks.py", line 14, in __aenter__
    await self.acquire()
  File "/usr/local/lib/python3.10/asyncio/locks.py", line 114, in acquire
    await fut
asyncio.exceptions.CancelledError
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1022, in request_callback_rsp
    async with async_timeout.timeout(timeout):
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError
2022-08-07 21:15:20.912 ERROR (MainThread) [homeassistant] Error doing job: Task exception was never retrieved
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1025, in request_callback_rsp
    return await callback_rsp
asyncio.exceptions.CancelledError
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/site-packages/zigpy_znp/api.py", line 1022, in request_callback_rsp
    async with async_timeout.timeout(timeout):
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 129, in __aexit__
    self._do_exit(exc_type)
  File "/usr/local/lib/python3.10/site-packages/async_timeout/__init__.py", line 212, in _do_exit
    raise asyncio.TimeoutError
asyncio.exceptions.TimeoutError

I don't know whether this is a ZHA issue, or coordinator issue. I see it is zigpy erroring out but I assume an issue with the coordinator could lead to that too. Before I go there and get blamed for the issue because I am using a development firmware, I'd like to rule this out. Thanks!

@dumpfheimer
Copy link
Author

I am completely uneducatedly convinced that this is primarily a firmware issue (bad memory management?) and probably needs to be fixed by Texas Instruments. But in a similar fashion I believe there could be things done in zigpy/ZHA that could prevent this bug from happening. Lastly, I also think that the zigpy/ZHA developers are aware of the "issue".

I have been playing around with python event loops/threads which might be the reason this is happening with ZHA but not with z2m. I am too inexperienced with python and zigpy to do anything productive myself but I might start a discussion over at zigpy to get some guidance if and how it could be possible to decouple zigpy from the Home Assistant event loop which in the worsed case should decrease delays and in the best case might fix this bug from being triggered.

@salopette
Copy link

I also have error messages with the Dev FW

Coordinator Typ zStack3x0
Coordinator Version 20220726
LAUNCHXL-CC1352P-2

2022-08-15 17:42:18Publish 'set' 'state' to '0x001788010b4b1466' failed: 'Error: Command 0x001788010b4b1466/11 genLevelCtrl.moveToLevelWithOnOff({"level":0,"transtime":10}, {"sendWhen":"immediate","timeout":10000,"disableResponse":false,"disableRecovery":false,"disableDefaultResponse":false,"direction":0,"srcEndpoint":null,"reservedBits":0,"manufacturerCode":null,"transactionSequenceNumber":null,"writeUndiv":false}) failed (SREQ '--> ZDO - extRouteDisc - {"dstAddr":28838,"options":0,"radius":30}' failed with status '(0xc7: NWK_TABLE_FULL)' (expected '(0x00: SUCCESS)'))'

@Shred99
Copy link

Shred99 commented Aug 16, 2022

I'll add my experience to the thread:

  • Network with 26 sensors and three routers
  • Coordinator is a LAUNCHXL-CC1352P-2
  • I'm running Zigbee2MQTT in a docker container on a Raspberry Pi 3

I upgraded three days ago and haven't had any issues at all.

@alexruffell
Copy link

After the issues in my post above 9 days ago, I wiped the stick and reformed a network on ch25. It has been working great since then. My coordinator is now also linking directly to 40ish devices instead of being limited to 20ish as before this latest network reforming.

Using this CC1352P2_CC2652P_launchpad_coordinator_20220726 on my Sonoff ZBDongle-P

@dumpfheimer
Copy link
Author

I finally got zigpy_znp working in a dedicated thread. Will let you know if it has any impact.

@Koenkk
Copy link
Owner

Koenkk commented Aug 16, 2022

Can the people using ZHA check if the issue is fixed with the following fw? fws.zip

Changes in this fw compared to 20220726:

HEAPMGR_SIZE   0xc000 -> 0x6000
SRC_RTG_EXPIRY_TIME 2 -> 255

@pipiche38
Copy link
Contributor

pipiche38 commented Dec 30, 2022

Do you think that version could impact positively that issue zigbeefordomoticz/Domoticz-Zigbee#1341 ?

For memo: the ZIgbee for Domoticz plugin is based on zigpy libraries

@Koenkk
Copy link
Owner

Koenkk commented Dec 31, 2022

@dumpfheimer

I saw that you have some generated files in your patch file that seem to be there only for the LED control. Did you let the IDE re-generate those files when you updated SDK and CLang and other stuff?

Yes I regenerate those files on every SDK update. All my changes to these files are annotated with a // CUSTOM comment, e.g. // CUSTOM: set in preinclude.h

@pipiche38

Do you think that version could impact positively that issue zigbeefordomoticz/Domoticz-Zigbee#1341 ?

I don't expect that.

@mzanetti
Copy link

mzanetti commented Jan 2, 2023

Fwiw, still running 20221220 with nymea without a single issue. So from my point of view that's a good one. I've only seen now that there is a newer one too. Will try to update asap and report back with that too.

@lux73
Copy link

lux73 commented Jan 2, 2023

just for Information:

tried 20221220 too with my CC2652RB - testet out appr. 6 Days. Runs without any Problems in my Environment.

now running 20221226 since 30.12. no Problem so far

i have 14 Router & 33 End Devices in my Network

@boris1000
Copy link

boris1000 commented Jan 2, 2023

Not really dependent on the latest version 20221220, but don´t want to open a separate issue for this crazy finding.
I am just setting up my first Zigbee Network. Completely new with Zigbee.
One thing is sure -> Absolutely impressed by Zigbee2MQTT.
Now to my finding:
I am using the Sonoff2652P stick with the 20221102 firmware. For testing purposes I am just setting up two devices
one Phillips Hue Motion sensor and one Silvercrest (Lidl) Switch button. So far all OK and working.
Next was playing with the transmit_power option. As I understood the default setting is 5db and the device can handle
up to 20db - Sure I would like to get the most for my money :-)
I just moved the two devices one stock and 25m away, and YES still connected with the default setting.

  • Silvercrest button reported 10lqi
  • Hue motion sensor reported 24lqi
    Next was increasing transmit_power to 20 and check the devices again
  • Silvercrest button reported 28lqi
  • Hue motion sensor reported 0lqi it even does not connect on that distance anymore, need to move it half distance
    to get it connected again
    Just to mention My 2.4 Wifi is on channel 1 till 6 and Zigbee on channel 11
    Question:
    Can it be that some devices cannot handle the increased transmit_power, in this case the Hue motion sensor?

@digiblur
Copy link

digiblur commented Jan 2, 2023

@boris1000 probably better to jump to a discussion on this and off this issue. But do remember increasing TX power doesn't increase RX power and Zigbee is 2 way comms. Sometimes increasing power makes stuff worse. Just build a bigger mesh of mains powered devices.

@jerrm
Copy link

jerrm commented Jan 2, 2023

Wifi is on channel 1 till 6 and Zigbee on channel 11

About as bad a conflict as can be had. Change it now before you have a lot of devices.

Agree with @digiblur, increasing tx power at the coordinator gives mixed results at best, often only confusing end devices and causing them to prefer the more remote coordinator when they would be better off connecting to a nearer router.

That hue motion sensor is only transmitting at whatever its normal power level is. It may be able to hear the coordinator shouting at it, but the coordinator probably can't hear the hue if it's only answering at a relative whisper.

@lux73
Copy link

lux73 commented Jan 2, 2023

i agree with @jerrm increasing tx Power brings you nothing in most Cases... most of the zigbee battery devices could not connect to your Coordinator back route

it makes more sense install more Routers to make your Zigbee Mesh stronger & more reliable

@darko-mijic
Copy link

darko-mijic commented Jan 5, 2023

Hello!

I've upgraded from 20220219 (the latest release from the master branch) to 20221226, and after only 20 minutes of usage, I can report drastic performance improvements. I have 108 devices in my Zigbee network, and I could not use availability before. Previously, it constantly reported errors, and my devices were often unresponsive. Also, adding devices to groups failed often, so I needed to retry many times.

After upgrading to 20221226, availability works fine for me, and my devices are very responsive. After quick testing adding devices to groups works from the first attempt.

I will do more testing and report back. For now, this release looks like a major performance improvement compared to the latest release on master for my case with Zigbee2MQTT and over 100 devices. I will not need to split my network to have decent performance, which I was about to do.

Router: 102
End devices: 6

@Jabe
Copy link

Jabe commented Jan 5, 2023

Quick feedback: I ran 20221220 over the holidays with no problems, except for one crash of z2m. But I didn't do much zigbee work as everybody here was sick (kids, yeah!).

Been running 20221226 for a few days and reorganized some lights in the house. I'm having a lot of trouble pairing devices, it sometimes takes many tries to pair a light. After my reorg I've also lost five other devices that dropped from the network that I need to recover. Performance of the network has always been solid for me and seems unchanged.

Router: 51
End devices: 34

@dumpfheimer
Copy link
Author

I seems to me the latest feedback is overwhelmingly positive and the original issue was addressed and seems fixed (or at least mitigated/worked around)

If nobody disagrees I would close this issue as resolved.

To keep any discussion going I was going to propose to open a discussion as is possible in zigpy but I was not able to find it here.
@Koenkk is that a feature that can be enabled by you?

@MattL0
Copy link

MattL0 commented Jan 5, 2023

No issue to report here too, I have been on 20221226 since day one.

78 devices (45 routers)

@Koenkk
Copy link
Owner

Koenkk commented Jan 5, 2023

@dumpfheimer see https://github.com/Koenkk/Z-Stack-firmware/discussions

Great, I'll release this fw with the 1 February z2m release.

@LAD47
Copy link

LAD47 commented Jan 8, 2023

CC1352P2_CC2652P_launchpad_coordinator_20221226 works well.
But router_20221102 gives the following error in Flash Programmer 2

Initiate access to target: COM4 using 2-pin cJTAG.
Reading file: C:/router/CC1352P2_CC2652P_launchpad_router_20221102.hex.
Unknown record type: 3.
Reset target ...
Reset of target successful.

Does anyone have the same experience?

@jerrm
Copy link

jerrm commented Jan 8, 2023

I flashed three router sticks with 20221102 yesterday using cc2538-bsl.

@Char-r
Copy link

Char-r commented Jan 10, 2023

Based on the feedback from @sjorge who got NWK_TABLE_FULL errors with 20221220 I've improved the firmware further.
The 20221226 is now available.
@dumpfheimer since you are the OP, can you confirm this FW is stable? It would be good if some other ZHA users can confirm this fw is stable now.
To all: if you test this firmware and it doesn't work correctly, provide me the following:

* Did the `20220219` work correctly?

* What was the last known working firmware?

* Are you using Z2M or ZHA?

* Herdsman debug log starting from the moment where your start z2m until it fails (only if you are using z2m)

See https://www.zigbee2mqtt.io/guide/usage/debug.html on how to enable the herdsman debug logging. Note that this is only logged to STDOUT and not to log files.

Just updated, the TRV that had stopped taking commands is now accepting them again, let's see how we fair in a week or so.

Been running solid for two weeks now, none of my TRV has dropped (I am having an issue with one Aqara temp sensor but will delete and re-add and see if that solves it)

@LAD47
Copy link

LAD47 commented Jan 11, 2023

@jerrm You are absolutely right. If you use cc2538-bsl, you can flash router_20221102.
TI Flash Programmer 2, can flash all coordinator versions, but not the latest versions of router firmware. We learn something new every day....>;o)

@pannal
Copy link

pannal commented Jan 15, 2023

CC2652R_coordinator_20221226.hex works like a treat. First firmware that doesn't suddenly drop devices or times out, since 20220219. Great work!

@SargonofAssyria
Copy link

SargonofAssyria commented Jan 26, 2023

The CC1352P2_CC2652P_launchpad_coordinator_20221226.hex firmware works without flaws for 3 weeks now with 90+ devices.
I am happy.

@dumpfheimer
Copy link
Author

Fourth attempt on writing this 🙈
This is most certainly fixed / mitigated.
Thanks to everyone involved for taking the time.
Especially @Koenkk 🚀

@Koenkk
Copy link
Owner

Koenkk commented Jan 29, 2023

Thanks for all the feedback! I've just released the 20221226 firmware 😄

@Nik71git
Copy link

Thanks for all the feedback! I've just released the 20221226 firmware 😄

Better later than never 🤭

@boris1000
Copy link

boris1000 commented Feb 6, 2023 via email

@Speedy1496
Copy link

Hi,
Trying also to flash the CC1352P2_CC2652P_launchpad_router_20221102 with the TI Flash programmer 2 and I have the same issue.
The coordinator firmware is successful flashed when the router encounter the same Unknown error type 3.

Initiate access to target: COM5 using 2-pin cJTAG.
Reading file: C:/Temp/CC1352P2_CC2652P_launchpad_router_20221102/CC1352P2_CC2652P_launchpad_router_20221102.hex.
Unknown record type: 3.
Reset target ...
Reset of target successful.

@Nik71git
Copy link

Hi,
Trying also to flash the CC1352P2_CC2652P_launchpad_router_20221102 with the TI Flash programmer 2 and I have the same issue.
The coordinator firmware is successful flashed when the router encounter the same Unknown error type 3.

Initiate access to target: COM5 using 2-pin cJTAG.
Reading file: C:/Temp/CC1352P2_CC2652P_launchpad_router_20221102/CC1352P2_CC2652P_launchpad_router_20221102.hex.
Unknown record type: 3.
Reset target ...
Reset of target successful.

forget TI flash programmer 2, use python script instead, It works

@Speedy1496
Copy link

Indeed, it were a little bit more complicated but it worked
Thanks

@Koenkk
Copy link
Owner

Koenkk commented Apr 1, 2023

Follow up: #439

Repository owner locked as resolved and limited conversation to collaborators Apr 1, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests