-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Filebeat] Okta Module Crashing Frequently #18530
Comments
Pinging @elastic/siem (Team:SIEM) |
* Add check for time because time.Until does not correctly handle a time that is earlier than time.Now(). * Update getRateLimit to handle a rate limit time is earlier than now. * Add some additional safe guard to check Ticker * Add descriptions for getRateLimit function and test functions. * Update CHANGELOG.next.asciidoc
* Add check for time because time.Until does not correctly handle a time that is earlier than time.Now(). * Update getRateLimit to handle a rate limit time is earlier than now. * Add some additional safe guard to check Ticker * Add descriptions for getRateLimit function and test functions. * Update CHANGELOG.next.asciidoc (cherry picked from commit db62862)
…w-oss * upstream/master: (27 commits) Disable host fields for "cloud", panw, cef modules (elastic#18223) [docs] Rename monitoring collection from legacy internal collection to legacy collection (elastic#18504) Introduce auto detection of format (elastic#18095) Add additional fields to address issue elastic#18465 for googlecloud audit log (elastic#18472) Fix libbeat import path in seccomp policy template (elastic#18418) Address Okta input issue elastic#18530 (elastic#18534) [Ingest Manager] Avoid Chown on windows (elastic#18512) Fix Cisco ASA/FTD msgs that use a host name as NAT address (elastic#18376) [CI] Optimise stash/unstash performance (elastic#18473) Libbeat: Remove global loggers from libbeat/metric and libbeat/cloudid (elastic#18500) Fix PANW bad mapping of client/source and server/dest packets and bytes (elastic#18525) Add a file lock to the data directory on startup to prevent multiple agents. (elastic#18483) Followup to 12606 (elastic#18316) changed input from syslog to tcp/udp due to unsupported RFC (elastic#18447) Improve ECS field mappings in Sysmon module. (elastic#18381) [Elastic Agent] Cleaner output of inspect command (elastic#18405) [Elastic Agent] Pick up version from libbeat (elastic#18350) Update communitybeats.asciidoc (elastic#18470) [Metricbeat] Change visualization interval from 15m to >=15m (elastic#18466) docs: Fix typo in kerberos docs (elastic#18503) ...
It appears that this issue is still occurring after upgrade to 7.8.0:
|
I'm experiencing the same issue. My configuration is same as provided above. I'm using docker container filebeat:7.8.0.
|
* Add check for time because time.Until does not correctly handle a time that is earlier than time.Now(). * Update getRateLimit to handle a rate limit time is earlier than now. * Add some additional safe guard to check Ticker * Add descriptions for getRateLimit function and test functions. * Update CHANGELOG.next.asciidoc (cherry picked from commit db62862)
@alakahakai it looks like this is being corrected by handling a specific rate-limit condition. Is there any chance of handling this by adding some sort of generic retry mechanism that makes the whole input more resilient to transient errors? Right now the input is prone to crashing when any sort of unexpected behavior occurs. |
@alakahakai after upgrading to 7.8.1 we saw a significant improvement and the module is no longer crashing due to rate limiting constantly, but there appear to still be conditions that cause a crash. As mentioned above, it would be ideal to implement some sort of generic retry logic for retryable error conditions. Our most recent instance, after running for ~12 hours:
|
Two fixes for the Okta module will be released in v7.9.0.
Then looking forward to v7.10.0 we plan to make the module be able to persist the last read event timestamp and be able to resume reading from the point on restart. Part of that work is in happening in #20226. |
(elastic#18549) * Address Okta input issue elastic#18530 (elastic#18534)
After enabling the Okta module in Filebeat 7.7.0 we have been observing frequent crashes with the error message
panic: non-positive interval for NewTicker
Debug logs:
Config:
For confirmed bugs, please report:
The text was updated successfully, but these errors were encountered: