

But: storcli /c0/fALL showĭescription = Couldn't find any foreign ConfigurationĪllright then I guess, maybe we picked a previous hot spare that had in fact not yet been in use anywhere. Since the replacement disk had been in use before, I'd expected a foreign configuration to show up. So, in conclusion: meh.Īfter locating it, I just pulled the disk out, added a replacement in, and found the RAID controller had already started rebuilding on its own. We bought new boxen in the meantime, those actually blink the failure LED as they should. It seems to be off by default though, and also this behavior only occurs on our old hardware. So, better make sure your server isn't doing anything other than locating the fucking disk.Ī kind soul wrote me that there's a setting to control this behavior: storcli /cALL set activityforlocate=off

Probably this is why sane RAID controllers blink the failure LED for locating disks, but hey, this is LSI. This is of course less than helpful if your box is currently under load, because then the activity LED will blink really fast anyway. In my setup, this will make the activity LED blink really fast. :( Locating the diskįire this command: storcli /c0/e8/s2 start locate
#Megacli check disk health manual#
Unfortunately I haven't yet managed to find out the incarnation necessary to make it actually spit out that info, and the manual is less than helpful in that regard too. Storcli seems to be able to also output this information: grep 'Critical Disks' /usr/local/sbin/storcliīinary file /usr/local/sbin/storcli matches MegaCli has a nice way of getting a quick overview, using the MegaCli -AdpAllInfo -aAll command: megacli64 -AdpAllInfo -aALL | less State is still online and the array is not yet degraded, it's only throwing random IO errors. Notably though, it's not yet broken enough to be actually thrown out of the array: storcli /c0/e8/s2 showĭescription = Show Drive Information Succeeded.ĮID:Slt DID State DG Size Intf Med SED PI SeSz Model SpĨ:2 11 Onln 1 558.406 GB SAS HDD N N 512B HUS156060VLS600 U Here's how.įor a quick glance on which disk might be causing trouble, try: storcli /c0/eALL/sALL show all | grep -e 'State :' -e "Predictive Failure Count"įor disks that are okay, this count should be 0.
#Megacli check disk health how to#
Google knows how to do that using MegaCli, but I totally hate that tool and want to do the same thing using storcli instead, which is a bit less insane.

For example, for Symbios/LSI/Avago MegaRAIDs and recent Dell PERCs: smartctl -i /dev/sda -d megaraid,0 smartctl 6.6 r4324 x8664-linux-4.15. I often find myself in need of locating disks in an LSI RAID that are not quite dead yet, but in the process of dying. In general, you smartctl from smartmontools is able to see hard disk info through various hardware RAID controllers.
