-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Module/metricset for Metricbeat: RAID Metrics #5600
Comments
Here an example content of
@andrewkroh We should probably add support for this in gosigar? https://github.com/elastic/gosigar |
@ruflin There's support for this in procfs so we should use that. https://github.com/elastic/procfs/blob/master/mdstat.go The question I'm pondering is where do we add this. linux/mdstat, system/mdstat, or a maybe more general system/raid? |
My thought process here:
This kind of leads to
|
@plinde I put a PR together with the data in. #5642 @andrewkroh The metricset is not tested yet as I'm not sure how test this best with "actual" raid data. Some ideas here? |
@ruflin Looks great! I think it would be beneficial to include the following additional metrics per RAID device. However, I can see this would be more of an enhancement to procfs's mdstat.go
If possible, it would also be good to compare the blocks.synced/total and perhaps include a Boolean for "synced: true". The scenarios for this being false would include during the rebuild (syncing) of a disk. |
The raid metricset will create each time an event for each device. The metrics you described above are more a summary which you probably do best with a query in ES / KB. You can group by the field In case you are mainly interested in the overall stats we could thinkg about either only doing the overview or have something similar to the For the blocks.synced / total are you referring to the recovery lines? See https://raid.wiki.kernel.org/index.php/Mdstat |
Leaving this open because I think we need
|
It'd be also nice to support common hardware RAID controllers like MegaRAID. |
@jsoriano How is data access for these? |
Access to them is usually via commands, for example |
I would prefer if we would not have to execute commands (if possible). So far we stayed away from it for security reasons. |
Oh ok, I understand, probably these commands use It may be complex if their commands are not used because sometimes they are based in propietary solutions. |
@plinde As we did a first part of the implementation but so far didn't get to follow up with the second part but didn't hear back yet, I wonder if there is still need for par 2? |
@ruflin I'd say that
|
@plinde Could you share some details on what exact values you are interested in from the above? |
I think particularly relevant metrics might be:
|
@plinde is this something you still need? |
So, there are 3 sources of this information:
Tools like The data that @plinde mentions above is from GET_ARRAY_INFO Annoyingly, other interesting information, such as recovery/rebuild/resync percentage in |
Addendum: The Array size that |
So, I made a brief PoC using |
@fearful-symmetry In general you think it's worth pursuing this route further? |
@ruflin Yes. I'm currently putting together a PR for the ioctl implementation of this. I'm going to be running around airports most of the day, so it probably won't happen until Monday. |
Enhancement to Metricbeat for collecting RAID-related metrics; specifically for the equivalents of these commands:
cat /proc/mdstat
mdadm
The text was updated successfully, but these errors were encountered: