Intake v2: investigate parallelizing decode and validate #1285

roncohen · 2018-08-15T16:52:35Z

As previously discussed, it would be interesting to investigate parallelizing the decoding and validation of documents. Transformation is already being done in parallel.

At the moment, decoding and validation is happens in parallel at the request level. E.g. incoming data from separate requests are being decoded and validated in parallel. As part of this issue, we should also seek to put a bound on that parallelization

alvarolobato · 2018-10-02T14:41:45Z

We agreed that it's a bit premature to implement this know, not even needed probably. Deferred from 6.5

graphaelli · 2020-04-01T18:30:53Z

@axw do the changes you've proposed and implemented make this obsolete?

axw · 2020-04-02T01:28:15Z

Nope.

#3551 will weave validation and decoding together such that parallelising them wouldn't make sense.

It might make sense to decode concurrently, e.g. have the stream processor send events to a channel to be decoded, and then have multiple goroutines processing those. This would enable parallel decoding within a single stream.

stuartnelson3 · 2022-03-16T09:35:31Z

Closing this for now as we aren't prioritizing time to investigate whether it's worth implementing this

This was referenced Aug 15, 2018

Intake protocol v2 #1260

Merged

Intake v2 support #1237

Closed

alvarolobato added the [zube]: Ready label Aug 20, 2018

alvarolobato added [zube]: Backlog and removed [zube]: Ready labels Oct 2, 2018

alvarolobato modified the milestone: 6.6 Oct 2, 2018

alvarolobato added this to the 6.5 milestone Oct 26, 2018

simitt removed this from the 6.5 milestone Feb 13, 2019

jalvz added enhancement performance labels Aug 26, 2019

axw mentioned this issue Sep 10, 2020

processor/stream: server blocks on stream end when batch size < 10 #3265

Open

simitt removed the [zube]: Backlog label Dec 31, 2021

stuartnelson3 closed this as completed Mar 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intake v2: investigate parallelizing decode and validate #1285

Intake v2: investigate parallelizing decode and validate #1285

roncohen commented Aug 15, 2018 •

edited

Loading

alvarolobato commented Oct 2, 2018

graphaelli commented Apr 1, 2020 •

edited

Loading

axw commented Apr 2, 2020

stuartnelson3 commented Mar 16, 2022

Intake v2: investigate parallelizing decode and validate #1285

Intake v2: investigate parallelizing decode and validate #1285

Comments

roncohen commented Aug 15, 2018 • edited Loading

alvarolobato commented Oct 2, 2018

graphaelli commented Apr 1, 2020 • edited Loading

axw commented Apr 2, 2020

stuartnelson3 commented Mar 16, 2022

roncohen commented Aug 15, 2018 •

edited

Loading

graphaelli commented Apr 1, 2020 •

edited

Loading