User Tools

Site Tools


gpfs:gpfs_healthcheck

This is an old revision of the document!


A.	Objectif

Contrôler l’état d’un filesystem GPFS et rectifier les anomalies

B.	Principales entités concernées

Système

C.	Description générale et flux des données



D.	Définition des termes

Node names
Correspondance GPFS-AIX

GPFS	        AIX
p5-gpfs-h	oragpfh
p5-gpfs-r	oragpfr
p5-gpfs-k	oragpfk



E.	Instructions spécifiques

1)	Affichez les informations

mmlsnsd affiche les filesystemes GPFS connues sur la machine. (peut s’exécuter sur les 3 machines)

root@oragpfh:/home/root>mmlsnsd

 File system   Disk name    Primary node             Backup node           
---------------------------------------------------------------------------
 orafs        DiskR        p5-gpfs-r                p5-gpfs-h
 orafs        DiskH        p5-gpfs-r                p5-gpfs-h
 orafs        DiskK        gpfs-k                   
 orafs2       DiskR2       p5-gpfs-r                p5-gpfs-h
 orafs2       DiskH2       p5-gpfs-r                p5-gpfs-h
 orafs2       DiskK2       gpfs-k                   

root@oragpfh:/home/root>mmlsnsd -L

L’option –L affiche les NSD volume ID

 File system   Disk name    NSD volume ID      Primary node             Backup node          
--------------------------------------------------------------------------------------------
 orafs        DiskR        0A040120452A20B4   p5-gpfs-r                p5-gpfs-h
 orafs        DiskH        0A040120452A20B6   p5-gpfs-r                p5-gpfs-h
 orafs        DiskK        AC13131D48008499   gpfs-k                   
 orafs2       DiskR2       0A04012046827A9E   p5-gpfs-r                p5-gpfs-h
 orafs2       DiskH2       0A04012046827AA0   p5-gpfs-r                p5-gpfs-h
 orafs2       DiskK2       AC13131D480084BD   gpfs-k                   
 

root@oragpfh:/home/root>mmlsnsd -M

L’option –L affiche la correspondance disque GPFS – disque OS/machine 
ex : DiskR = hdisk0 sur p5-gpfs-r (primary node)= hdisk3 sur p5-gpfs-h (backup node)

 Disk name    NSD volume ID      Device         Node name                Remarks       
---------------------------------------------------------------------------------------
 DiskR        0A040120452A20B4   /dev/hdisk0    p5-gpfs-r                primary node
 DiskR        0A040120452A20B4   /dev/hdisk3    p5-gpfs-h                backup node
 DiskH        0A040120452A20B6   /dev/hdisk3    p5-gpfs-r                primary node
 DiskH        0A040120452A20B6   /dev/hdisk0    p5-gpfs-h                backup node
 DiskK        AC13131D48008499   /dev/descgpfslv gpfs-k                   primary node
 DiskR2       0A04012046827A9E   /dev/hdisk4    p5-gpfs-r                primary node
 DiskR2       0A04012046827A9E   /dev/hdisk5    p5-gpfs-h                backup node
 DiskH2       0A04012046827AA0   /dev/hdisk5    p5-gpfs-r                primary node
 DiskH2       0A04012046827AA0   /dev/hdisk4    p5-gpfs-h                backup node
 DiskK2       AC13131D480084BD   /dev/descgpfslv2 gpfs-k                   primary node

mmlsdisk permet d’afficher les disques d’un filesysteme GPFS. (peut s’exécuter sur les 3 machines)

root@oragpfh:/home/root>mmlsdisk orafs
disk         driver   sector failure holds    holds
name         type       size   group metadata data  status        availability
------------ -------- ------ ------- -------- ----- ------------- ------------
DiskR        nsd         512       1 yes      yes   ready         up           
DiskH        nsd         512       2 yes      yes   ready         up           
DiskK        nsd         512       3 no       no    ready         up   

Le statu normal des disques est ready,  la disponibilité est up

En cas d’incohérences, un message d’alerte est affiché après les informations disques !

ex : Attention : Due to an earlier configuration change, the file system
may contain data that is at risk of being lost


Autes commandes :
mmlscluster
root@oragpfh:/home/root>mmlscluster

GPFS cluster information
========================
  GPFS cluster name:         p5-gpfs-r
  GPFS cluster id:           12399281700916488274
  GPFS UID domain:           p5-gpfs-r
  Remote shell command:      /usr/bin/ssh
  Remote file copy command:  /usr/bin/scp

GPFS cluster configuration servers:
-----------------------------------
  Primary server:    p5-gpfs-r
  Secondary server:  p5-gpfs-h

 Node number  Node name    IP address       Full node name              Remarks    
-----------------------------------------------------------------------------------
       1      p5-gpfs-h    10.11.10.2      p5-gpfs-h                   quorum node
       2      p5-gpfs-r    10.11.10.3      p5-gpfs-r                   quorum node
       3      gpfs-k       10.11.10.4      gpfs-k                      quorum node

mmlsfs « Fs-name »  			(Fs-name ={orafs|orafs2}
root@oragpfh:/home/root>mmlsfs orafs
flag value          description
---- -------------- -----------------------------------------------------
 -s  roundRobin     Stripe method
 -f  8192           Minimum fragment size in bytes
 -i  512            Inode size in bytes
 -I  16384          Indirect block size in bytes
 -m  2              Default number of metadata replicas
 -M  2              Maximum number of metadata replicas
 -r  2              Default number of data replicas
 -R  2              Maximum number of data replicas
 -j  cluster        Block allocation type
 -D  posix          File locking semantics in effect
 -k  posix          ACL semantics in effect
 -a  1048576        Estimated average file size
 -n  32             Estimated number of nodes that will mount file system
 -B  262144         Block size
 -Q  user;group     Quotas enforced
     none           Default quotas enabled
 -F  185344         Maximum number of inodes
 -V  8.01           File system version. Highest supported version: 8.02
 -u  yes            Support for large LUNs?
 -z  no             Is DMAPI enabled?
 -E  yes            Exact mtime mount option
 -S  no             Suppress atime mount option
 -d  DiskR;DiskH;DiskK  Disks in file system
 -A  yes            Automatic mount option
 -o  none           Additional mount options
 -T  /kora        Default mount point




2)	Réactiver / resynchroniser un disque


Si l’état d’un disque est différent de ready et up, il faut utiliser la commande mmchdisk pour activer le disque et mmrestripefs pour resynchroniser le disque.
Ces commandes mettent un certain temps (jusqu’à 30 minutes). Il est fortement conseiller d’attendre la fin d’une commande avant de lancer la suivante.

Si le disque DiskH est down  mmchdisk orafs start -d DiskH 
Si le disque DiskH est suspended  mmchdisk orafs resume –d DiskH

Si la commande mmlsdisk renseigne des alertes :
Attention : Due to an earlier configuration change, the file system
may contain data that is at risk of being lost
 ==> mmrestripefs “Fs-name” –r –N mount    (Fs-name ={orafs|orafs2})
relancer la commande mmlsdisk “Fs-name” pour conrôler le résultat

Attention : Due to an earlier configuration change, the file system
is no longer properly balanced.
==> mmrestripefs “Fs-name”  -b -N mount    (Fs-name ={orafs|orafs2})
relancer la commande mmlsdisk “Fs-name” pour conrôler le résultat
 
gpfs/gpfs_healthcheck.1611736767.txt.gz · Last modified: 2021/01/27 09:39 by manu