-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Filestream input: Fingerprint for inode race detection #27278
Comments
Pinging @elastic/agent (Team:Agent) |
Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane) |
Hi! We're labeling this issue as |
This issue will be addressed by #34419 |
I will close this one for now and we will reopen it if it's not fixed by #34419 |
Log collection using the filestream or logs input sometimes can treat an old file as a new file if we are seing an inode being reused. The
clean_removed
setting allows us to remove the state from the registry more early (filestream input can even detect removal asynchronously), but especially with autodiscovery in place we might have the input being shutdown before we managed to detect that the file was removed.In order to better detect the inode reuse race condition, we want the harvester/prospector to add a fingerprint to the file metadata in the registry. The fingerprint would be computed from the first 4KB (configurable). The harvester (prospector) would check the fingerprint after opening the file, in order to check that the contents matches the original file.
#19990 Also discusses fingerprinting for identity tracking. But the solution proposed here is supposed to be used in conjunction with any other identity tracking we already have in place.
The text was updated successfully, but these errors were encountered: