Untitled Document

NOTE: Use at your own risk, especially for back up and recovery, test things and do not simply trust these instructions.

Shutdown primary Lustre MDS. Ensure poweroff as duplicate MDTs will be bad news bears.
Shutdown OSTs.
On backup MDS, change the network settings for em1, ib0 and /etc/sysconfig/network to match the primary MDS. Ensure that /etc/ldev.conf matches the right vdev devices.
Reboot the backup MDS to change over network settings.
Ensure that winbind is working appropriately. If not, "net join -U Administrator" and restart winbind. Verify via "id" that usernames are resolving.
Turn on Lustre on backup MDS via "service lustre start" after verifying that network settings are correct.
Turn on Lustre on OSTs.
Recovery must complete on OSTs and MDT. Check both before mounting any clients.

Method for switch-over:

snapshot transfer example:

zfs send -R lustre-meta@2014051521 | ssh server-2 zfs receive -uv backup/lustre-meta

This procedure assumes you already performed your snapshot / backups or they are current. If you have not, review:

umount the OSTs
edit its /etc/sysconfig/network-scripts/ifcfg-ib0, ifcfg-ib1 and ifcfg-em1 so they match the backup server.
update /etc/sysconfig/network so that hostname is the secondary hostname (server-2 in our example)
verify the lustre server won't start if it is powered on: chkconfig lustre off
Review /etc/ldev.conf and the ZFS file system mount point
Verify that your backup script exists on server to become the primary before shutting this one down.
poweroff the server

edit its /etc/sysconfig/network-scripts/ifcfg-ib0, ifcfg-ib1 and ifcfg-em1 so they match the primary server
update /etc/sysconfig/network so that hostname is the secondary hostname (server-1 in our example)
reboot
Check winbind (or whatever authentication you might use..)
Review /etc/ldev.conf and the ZFS file system mount point
Upon boot completion: start lustre (service lustre start)

Ensure MDT is mounted
start service / ensure OST mounted on each OSS (rocks run host $geoarc_oss command=service lustre start)
Monitor recovery (do we really want to even allow recovery? This is going to recover the client list as of the time of the snapshot, which probably doesn't make sense.. otherwise it's just a wasted 5 minutes minimum)
verify that your backup script is now set to execute when this becomes the primary

Verify the volume is unmounted
service lnet stop
service lustre stop
Rename the file system (mount point) with zfs rename so that it matches what should be in /etc/ldev.conf. ex:

change the contents of /etc/ldev.conf to reflect final ZFS mount points ("device-path")
zfs rename backup/lustre-meta/mgs lustre-meta/mgs
zfs rename backup/lustre-meta/arcdata-meta lustre-meta/arcdata-meta

Lustre on ZFS Metadata Failover