Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Make ingesters limit how many events they process at once Currently we limit how many messages are processed at once However a single message may contain thousands of events, so limit on messages is a bit flawed - A limit of 1000, could mean you're processing 1000 or 1000000+ events at once This should make the ingesters less prone to long pauses if some large messages come along Also adjusted event ingester config to be more standard with other config naming Signed-off-by: JamesMurkin <[email protected]> * Comment improvement Signed-off-by: JamesMurkin <[email protected]> * Comment improvement Signed-off-by: JamesMurkin <[email protected]> * gofumpt Signed-off-by: JamesMurkin <[email protected]> * Improve config descriptions Signed-off-by: JamesMurkin <[email protected]> * Limit Pulsar messages to have a configurable max number of events per message Currently we can publish very large messages (100k+ events per message) This can make the time to process messages quite unpredictable, as they can be anywhere between 1 event and 100000+ events Now we restrict how many messages we put into each message (via `maxAllowedEventsPerMessage`), which should make how many changes a given message may contain somewhat more predictable Signed-off-by: JamesMurkin <[email protected]> * Revert "Limit Pulsar messages to have a configurable max number of events per message" This reverts commit 11a8a2a. * Improve logging in ingester pipeline This should help us understand what is happening in our ingestion pipelines - Should log if we are no longer receiving pulsar messages for 2mins - Will log a summary of how many messages and event in each "batch" - Will log a summary of the types of events in each batch - Will log a summary of how long Convert took for each batch This is admittedly quite a "quick" fix and better long term steps would be: - Metrics or spans - Some of these could be at the ingseter pipeline level (generic) - Some would need to be done in each ingester to expose more detailed information such as which query is all the time being spent on Signed-off-by: JamesMurkin <[email protected]> --------- Signed-off-by: JamesMurkin <[email protected]>
- Loading branch information