Optimizing stack setup on SVC calls #230

SenRamakri · 2017-08-21T20:59:15Z

This is work in progress needing initial feedback on proposed changes.

The optimization we are proposing is to remove the SVC_SETUP_PSP on svc_indirect() calls and check for which stack(MSP or PSP) was active when SVC call was made and then using that stack pointer(MSP or PSP) accordingly in SVCHandler. By doing this we saved around 260 bytes of code space when I tried building a test with mbed-os and there will be some improvement in performance as well since calls to intrinsic macros like __get_CONTROL, __set_PSP, __get_MSP in SVC_SETUP_PSP can be avoided although I have not measured quantitatively. Adding @c1728p9 as well.

JonatanAntoni · 2017-09-01T13:27:47Z

Hi @SenRamakri,

Many thanks for your contribution.
We discussed your proposal in the team controversially.

The current solution was implemented with all the xxxNew calls in mind. Only those are typically usable before osKernelStart, i.e. running on MSP. As soon as the Kernel runs we are in thread context using PSP.

The new solution would save some code space per xxNew call used (sums up to 260 bytes max). But on the other hand it adds a common overhead of about 2 or 3 cycles for each and every SVC call (which is about 1-2% to the 100-200 cycles we currently need).

So it is a trade of between code space and execution speed. What do you think? What does count more for you? 260 Bytes flash usage or 1-2% performance per SVC call?

Cheers,
Jonatan

SenRamakri · 2017-09-01T14:33:54Z

Hi @JonatanAntoni,

Thanks much for your time looking into this and reviewing.
And I agree that the new changes added 2 more instructions to the SVC_Handler. But, isn't that every call into RTX kernel go through one of SVC0_xx macro calls(defined in core_cm.h). And many of those macros did the __get_CONTROL, __set_PSP, __get_MSP inside the SVC_SETUP_PSP macro. Aren't we saving on those? So, overall, we performance impact due to 2 more instructions would be negligible or may be advantageous depending on what SVC_xx calls are used. Is that right? Let me know what you think.

Regards,
Senthil

RobertRostohar · 2017-09-04T04:57:52Z

Only the osXxxNew Calls (and osKernelInitialize/Start) used the SVC macro with SVC_SETUP_PSP. So only those functions had a few cycles of overhead. More precise the osXxxNew functions (which already take quite some time) had the additional 5 cycles overhead (when called after osKernelStart).

Summary:
2..3 additional cycles for every osXxx function (except osXxxNew) and 2..3 less cycles for osXxxNew functions.

…ck detection in SVC_Handler

RobertRostohar · 2017-10-25T11:57:43Z

RTX5 has been updated as suggested: Stack setup for Cortex-M has been replaced with stack detection in SVC_Handler in order to save code space.

JonatanAntoni · 2017-10-25T14:45:48Z

Hi @SenRamakri,

Robi changed the code according to your suggestions. May I ask you to double check if the latest version fits your expectation? Please close this PR if you don't have further remarks.

Thanks for contributing,
Jonatan

SenRamakri · 2017-10-25T15:18:34Z

Hi @RobertRostohar and @JonatanAntoni,

Thanks for your time looking into this and adopting the enhancements I suggested for SVC_Handler.
We will plan on syncing-up these changes into mbed-os now( @sg- ).
Closing this PR.

-Senthil

Optimizing stack setup on SVC calls

a9f0ad0

JonatanAntoni added review RTOS labels Aug 30, 2017

JonatanAntoni added question and removed review labels Sep 1, 2017

RobertRostohar added a commit that referenced this pull request Oct 25, 2017

RTX5: Optimization (#230): Stack setup for Cortex-M replaced with sta…

d05a7dc

…ck detection in SVC_Handler

JonatanAntoni added DONE and removed question labels Oct 25, 2017

SenRamakri closed this Oct 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing stack setup on SVC calls #230

Optimizing stack setup on SVC calls #230

SenRamakri commented Aug 21, 2017

JonatanAntoni commented Sep 1, 2017

SenRamakri commented Sep 1, 2017

RobertRostohar commented Sep 4, 2017

RobertRostohar commented Oct 25, 2017

JonatanAntoni commented Oct 25, 2017

SenRamakri commented Oct 25, 2017

Optimizing stack setup on SVC calls #230

Optimizing stack setup on SVC calls #230

Conversation

SenRamakri commented Aug 21, 2017

JonatanAntoni commented Sep 1, 2017

SenRamakri commented Sep 1, 2017

RobertRostohar commented Sep 4, 2017

RobertRostohar commented Oct 25, 2017

JonatanAntoni commented Oct 25, 2017

SenRamakri commented Oct 25, 2017