Flow deck and Loco deck in TWR mode makes Loco deck hang #368

tobbeanton · 2018-09-14T12:23:43Z

If the Flow deck and the Loco deck are mounted together after a while ~1min the Loco deck hangs and the leds turn off.

Guesses of this behavior is collision on the SPI bus, interrupts not services fast enough, or stat machine bug. Note also that flow deck continue to work as it should so SPI bus is not blocked.

miracatici · 2018-10-04T11:03:52Z

We're currently use same setup. Do you confirm that you're using latest firmware (not released)? You can clone the repo and compile yourself.

krichardsson · 2018-11-07T13:31:01Z

@miracatici This was probably on the codebase of Sep 14 (or there about).

ataffanel · 2018-12-06T10:41:33Z

I think I have verified that this is a timing problem (so a state machine bug) and not an SPI problem: I have had the problem when developing another deck that was only using the UART.

dastoqc · 2019-04-15T19:21:50Z

using the latest firmware on all our units, we also experienced the issue with our Crazyflies 2.0+Flow deck 1.0. We need to reboot 2 to 4 times the Crazyflie in order to get the UWB deck not to crash. The issue is not present with our newer Crazyflies 2.1+ Flow deck 2.0.

Any solution?

krichardsson · 2019-04-16T09:53:02Z

@dastoqc that is interesting. Are you running the same firmware on the CF 2.0 and 2.1?
Have you tried CF 2.0 + Flow 2.0 and CF2.1 + Flow 1.0?

There is a difference in how the IMU is read between CF 2.0 and 2.1 that could change the timing and support what @ataffanel found.

Unfortunately there is no solution available at the moment.

dastoqc · 2019-04-17T20:51:25Z

Hi @krichardsson

We made some more tests today, not sure what is related to this issue:

we have a hard time writing to the anchors using the CF (neither positions, nor change mode), it works well only with CF2.1+Flow2 (or no flow deck)
all anchors and CF are running the latest firmware (pulled this morning), compiled from source
all CF are running on 2Mhz band and the UWB deck crashes happen in both TWR and TDoA2 modes
CF2.1+Flow1: still experience crashes and unable to write position to anchors
CF2.0+Flow2: seems to be all good...

krichardsson · 2019-04-18T08:16:46Z

Thanks @dastoqc for the testing!
We'll digest it and see what we can do

shushuai3 · 2019-06-25T09:42:34Z

By adding a task delay in uwbTask function, the timing problem can be solved such that flow deck and loco deck can be used together.

static void uwbTask(void* parameters)
{
...
while(1) {
vTaskDelay(5); // the line to be added
...

dastoqc · 2019-06-26T16:27:23Z

thanks @shushuai3 it seems to fix the issue indeed! However, now the 'kalman.resetEstimation' always converge to a wrong position.... will look into it.

dastoqc · 2019-06-26T16:59:36Z

looks like a delay of 3 works to solve the LPS+Flow issue and still have the kalman reset working!

krichardsson · 2019-07-26T09:34:58Z

Thanks @dastoqc

shushuai3 · 2019-12-12T16:32:39Z

Wow! The latest firmware solves this issue, the timing problem when using UWB and Flow decks at the same time. Tests are done with CF2.0+UWB+Flow2.0

krichardsson · 2019-12-16T12:31:36Z

@shushuai3 Thanks for the report! I'm happy it works for you :-)

knmcguire · 2020-07-03T09:34:51Z

I guess we need to test this out on a 2.1 and see if this has been completely fixed.

youngbin-song · 2020-07-06T04:36:21Z

I'm working on a Crazyflie 2.1 and it doesn't seem to be working for it. My current setup is CF2.1+Crazyradio+Flow2.0.

While waiting for the fix, where should I implement shushuai3's workaround for this issue?

knmcguire · 2020-07-06T07:54:58Z

Hi @youngbin-song . This Github issue is about the combination of the flow deck and the LPS deck, not only the Flowdeck. Could you please ask your question on forum.bitcraze.io, with an output of the console tab of the cfclient and a more detailed description of your problem (which LEDS on the crazyflie, when it happens etc etc).

youngbin-song · 2020-07-06T07:58:15Z

Apologies for being unclear. Yes, my issue also includes the LPS deck as well.

knmcguire · 2020-07-06T08:00:19Z

ahh oke yes then you are at the right place! Yes go for it, try out Shushuais fix, but it is good to know that apperently there is still thing to be done in this github issue. I will try to investigate this week a well.

youngbin-song · 2020-07-06T08:01:29Z

Sorry to ask but where should I integrate with Shushuai's fix, specifically which code should I add it into?

Edit: I've figured it out. Thanks!

knmcguire · 2020-07-06T08:40:20Z

Great! the fix still seems to work?

knmcguire · 2020-07-07T14:10:39Z

I also checked it out and it seems to work for me as well.

Although I'm not quite sure why this hasn't been implemented yet, I will probably still wait a bit. There are several issues to fix for LPS I think, so it is good to do it when the rest is back at the office as I haven't worked or developed for the LPS system myself.

youngbin-song · 2020-07-08T04:06:27Z

Sorry, I wanted to give my reply after I've tested and made sure that it works.

It seems to work fine for me. There are some occasions that it the flowdeck turns off after a crash/sequence but I assume that it's because of some connection issue.

knmcguire · 2020-07-08T08:37:24Z

thanks for letting us know. I was able to contact my colleagues about this and the reason that this hasn't been implemented is that this fix is just a patch on the symptom of the real issue that needs to be fixed. So we need to investigate it further.

shushuai3 · 2020-07-14T20:56:02Z

Just a clarification: the fix of adding a task delay (vTaskDelay(5)) in the uwbTask function will decrease the ranging frequency roughly from 400Hz to 120Hz. Luckily, the latest firmware has no issue on CF2.0+Flow2.0+LocoDeck such that the ranging frequency is not influenced. But I am not sure if the latest firmware works on CF2.1.

knmcguire · 2020-07-15T06:56:57Z

Yeah indeed figured that it would slow it down. It does work on the CF2.1 but I will need to wait until the others are back to the office in August to go into this again, since it is believed that there is more to it.

I might also test out if the same happens with the zranger, to see if the motion sensor has anything to do with it or not.

krichardsson · 2020-08-17T12:05:21Z

I have put some time into this issue and have found some interesting information:

First some basics about the implementation:
There is a semaphore protecting access to the SPI bus to make sure it is only used to communicate with one chip at a time. While one SPI transaction is ongoing, other tasks have to wait for the semaphore to be given before they can access the SPI bus (tasks are blocked).
The LPS deck is implemented using an interrupt to indicate that there is new data from the DWM1000 module. When the interrupt is triggered, the service routine gives a binary semaphore, which in turn releases the uwb task that will handle the interrupt. The task does all the SPI communication, the interrupt service routine simply gives the semaphore.

It seems as the flow deck hogs the SPI bus for a fair amount of time when reading data from the flow sensor. The current theory is that this blocks the uwb task from accessing the SPI buss (and thus takes longer to complete), and that during this time a second interrupt from the DWM1000 chip is triggered, which is not handled properly.
The LPS TWR algorithm relies heavily on a continuous flow of events from the DWM1000 chip, and since no transmission is scheduled (due to the missed interrupt) the process stops.

If this is actually the problem, the best solution is probably to use task notification with counting functionality instead of the binary semaphore.

There was a mixup of pin definitions which caused the wrong pin to be read. This in turn caused the interrupt handling task to exit the handling function, when the status register was updated during handing.

krichardsson · 2020-08-18T12:22:00Z

The previous comment was not too far off, but not correct.

There was already a mechanism for checking if new events arrived while the interrupt is handled, the problem was that it did not work.
The interrupt mechanism in the DWM1000 is based on a status register and a mask. A pin on the chip is set high whenever there is one or more bits in status register that is set and the corresponding bits are set in the mask. The pin is used to assert an interrupt in the STM CPU and start the interrupt handling routine. In the handling routine various events are detected and the appropriate status bits are cleared. At the end of the handling routine the interrupt pin is read and if it is still set the handler is run once more.
The problem was that the pin that was read at the end of the handler was the wrong pin, always set to 0. There was a mixup of pin definitions: there are definitions GPIO pins in the STM library, and a second definition in the deck pin API (deck_constants.h). A GPIO definition was used together with the deck pin API function which caused the wrong pin to be read.

We should possibly consider a check to make sure the correct definition is used to avoid similar problems in the future?

…el 5 when interrupt handling works correctly

… with pin definitions for the CPU. This change will catch errors at compile time.

delfy22 mentioned this issue Apr 16, 2019

Velocity output doesn't look correct (looks more like the corresponding acceleration curve) #422

Closed

ataffanel mentioned this issue Sep 12, 2019

LPS deck stops working in the system test rig #436

Closed

krichardsson mentioned this issue Oct 30, 2019

Extract kalman filter into separate task #495

Closed

knmcguire added LPS labels Jun 26, 2020

knmcguire mentioned this issue Jul 27, 2020

Error using micro-SD deck with flowdeck #606

Closed

knmcguire mentioned this issue Aug 14, 2020

Problems in kalman estimator using LPS and Flow deck at the same time #523

Closed

knmcguire removed the needs verification label Aug 18, 2020

krichardsson added a commit that referenced this issue Aug 18, 2020

Use digital pin API for the reset pin as well. #368

baf1af9

krichardsson added a commit that referenced this issue Aug 18, 2020

Converted tabs to space #368

5ab9681

krichardsson mentioned this issue Aug 18, 2020

Use of digitalRead on STM Pin instead of Deck Pin #336

Closed

krichardsson added this to the next-release milestone Aug 18, 2020

krichardsson closed this as completed Aug 18, 2020

krichardsson added a commit that referenced this issue Aug 19, 2020

#368 Lowered task priority for the UWB task, no need to run it on lev…

5c6d0f3

…el 5 when interrupt handling works correctly

krichardsson added a commit that referenced this issue Aug 19, 2020

#368 Use task notification instead of semaphore

3263307

krichardsson added a commit that referenced this issue Aug 19, 2020

#368 Modified the deck API to use a type for deck pins to avoid mixup…

b8ba12c

… with pin definitions for the CPU. This change will catch errors at compile time.

krichardsson mentioned this issue Jan 12, 2023

Loco decks on Roadrunners stop in test lab #1186

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flow deck and Loco deck in TWR mode makes Loco deck hang #368

Flow deck and Loco deck in TWR mode makes Loco deck hang #368

tobbeanton commented Sep 14, 2018

miracatici commented Oct 4, 2018

krichardsson commented Nov 7, 2018

ataffanel commented Dec 6, 2018 •

edited

Loading

dastoqc commented Apr 15, 2019

krichardsson commented Apr 16, 2019

dastoqc commented Apr 17, 2019 •

edited

Loading

krichardsson commented Apr 18, 2019

shushuai3 commented Jun 25, 2019

dastoqc commented Jun 26, 2019

dastoqc commented Jun 26, 2019

krichardsson commented Jul 26, 2019

shushuai3 commented Dec 12, 2019

krichardsson commented Dec 16, 2019

knmcguire commented Jul 3, 2020

youngbin-song commented Jul 6, 2020 •

edited

Loading

knmcguire commented Jul 6, 2020

youngbin-song commented Jul 6, 2020

knmcguire commented Jul 6, 2020

youngbin-song commented Jul 6, 2020 •

edited

Loading

knmcguire commented Jul 6, 2020

knmcguire commented Jul 7, 2020

youngbin-song commented Jul 8, 2020

knmcguire commented Jul 8, 2020

shushuai3 commented Jul 14, 2020

knmcguire commented Jul 15, 2020

krichardsson commented Aug 17, 2020

krichardsson commented Aug 18, 2020 •

edited

Loading

Flow deck and Loco deck in TWR mode makes Loco deck hang #368

Flow deck and Loco deck in TWR mode makes Loco deck hang #368

Comments

tobbeanton commented Sep 14, 2018

miracatici commented Oct 4, 2018

krichardsson commented Nov 7, 2018

ataffanel commented Dec 6, 2018 • edited Loading

dastoqc commented Apr 15, 2019

krichardsson commented Apr 16, 2019

dastoqc commented Apr 17, 2019 • edited Loading

krichardsson commented Apr 18, 2019

shushuai3 commented Jun 25, 2019

dastoqc commented Jun 26, 2019

dastoqc commented Jun 26, 2019

krichardsson commented Jul 26, 2019

shushuai3 commented Dec 12, 2019

krichardsson commented Dec 16, 2019

knmcguire commented Jul 3, 2020

youngbin-song commented Jul 6, 2020 • edited Loading

knmcguire commented Jul 6, 2020

youngbin-song commented Jul 6, 2020

knmcguire commented Jul 6, 2020

youngbin-song commented Jul 6, 2020 • edited Loading

knmcguire commented Jul 6, 2020

knmcguire commented Jul 7, 2020

youngbin-song commented Jul 8, 2020

knmcguire commented Jul 8, 2020

shushuai3 commented Jul 14, 2020

knmcguire commented Jul 15, 2020

krichardsson commented Aug 17, 2020

krichardsson commented Aug 18, 2020 • edited Loading

ataffanel commented Dec 6, 2018 •

edited

Loading

dastoqc commented Apr 17, 2019 •

edited

Loading

youngbin-song commented Jul 6, 2020 •

edited

Loading

youngbin-song commented Jul 6, 2020 •

edited

Loading

krichardsson commented Aug 18, 2020 •

edited

Loading