How to manual replace disk in 3PAR when you get: Proactive Disk Replacement Required: NBD.

We received a new disk from HP because we had proactive failure on disk.

If you ssh to 3par, we can not see any alarms,logs or anything that indicates that a disk have proactive failure.

In short words:

1.servicemag start 0 1                       Failing the drive manually
2. servicemag status -d                      to check the servicemag running in the background , check if the servicemag completes
3.showpd -failed -degraded
4.showpd -c 1                                         Check the OK and Fail column should be 0
5.Replace the drive on the cage postion 0:1:0
6. Wait for the servicemag status -d to finnish
7. If the failed drive is still in the system
8. showpd -failed -degraded
9. dismisspd 1
10. showpd -failed -degraded (Shoul’d be 0 disks now)

 

The long one:

But this is how I did it.

The disk that we need to replace is: 1 0:1:0

To see disks

HP8200-01 cli% showpd

Id CagePos Type RPM State Capacity(GB)
0 0:0:0 FC 10 normal 1800
1 0:1:0 FC 10 normal 1800
2 1:9:0 FC 10 normal 1800
3 0:3:0 FC 10 normal 1800
4 0:4:0 FC 10 normal 1800

As you can see no warning or anything.

Then you need to fail the disk manually

#servicemag start 0 1

(or you can use servicemag start -pdid 1)

where 0 represents the Cage and 1 represent slot and space will be in between
So cage 0 slot 1 serviecmag start will run.

Then you can monitory this or wait some hours in our case (9 hours)

To check the servicemag running in the background , check if the servicemag completes:

servicemag status -d

HP8200-01 cli% servicemag status -d
Cage 0, magazine 1:
The magazine was successfully brought offline by a servicemag start command.
The command completed at Tue Oct 22 21:14:30 2019.
The output of the servicemag start was:
servicemag start 0 1
… servicing disks in mag: 0 1
… normal disks: WWN [] Id [ 1] diskpos [0]
… not normal disks:
… relocating chunklets to spare space…
… Could not spin down disk WWN [] Id [ 1]:
Opcode = SCCMD_DOCDB
Node = 0
Tpd error code = TE_FAIL — Generic failure code
Tpd error info = Scsi failure code 0xb2
… spinning down disk WWN [] Id [ 1]
… bypassing mag 0 1
… bypassed mag 0 1
servicemag start 0 1 — Succeeded
HP8200-01 cli%

After this you can run below command.
HP8200-01 cli% showpd -failed -degraded

Id CagePos Type RPM State Total Free A B Capacity(GB)
1 0:1:0 FC 10 degraded 1715200 0 ----- ----- 1800
--------------------------------------------------------------------
1 total 1715200 0 
HP8200-01 cli%

Check the OK and Fail column should be 0

showpd -c 1

showpd -c 1 
------- Normal Chunklets -------- ---- Spare Chunklets ----
- Used - -------- Unused -------- - Used - ---- Unused ----
Id CagePos Type State Total OK Fail Free Uninit Unavail Fail OK Fail Free Uninit Fail
1 0:1:0 FC degraded 1675 0 0 0 0 1675 0 0 0 0 0 0
------------------------------------------------------------------------------------------
1 total 1675 0 0 0 0 1675 0 0 0 0 0

After this replace the drive

  1. servicemag status -d (Wait for this to finnsih)
  2. showpd -failed -degraded
  3. dismisspd 1 (1 is the number of the ID of the disk)
  4. showpd -failed -degraded (Should be 0)

Done!

Advertisement

3 thoughts on “How to manual replace disk in 3PAR when you get: Proactive Disk Replacement Required: NBD.

Add yours

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Blog at WordPress.com.

Up ↑

%d bloggers like this: