User Tools

Site Tools


gpfs:gpfs_sizing

Spectrum Scale sizing

Filesystem Block size, best practice

Typically, metadata is between 1 and 5% of the filesystem space, but this can vary.

Depending on usage, you can have different block size

IO TypeApplication ExamplesBlocksize
Large Sequential IOScientific Computing, Digital Media,File based Analytics1MB to 16MB
Relational DatabaseDB2, Oracle, Small Files on ESS512KiB
Small I/O SequentialGeneral File Service ,Email, Web Applications256KiB
Special*Special16KB-64KiB
*Since GPFS 3.3 there are very few workloads that benefit from a file system blocksize of 16KiB or 64KiB.
WorkloadConfiguration TypeBlocksize
SAP HANAESS GL16MiB for Data
SAP HANAFPO1MiB for single pool
Or
256KiB for metadataOnly
2MiB for dataOnly
HadoopESS GL1MiB for metadataOnly
8MiB for dataOnly
Hadoop FPO256KiB for metadataOnly
2MiB for dataonly
Spark FPO256KiB for metadataOnly
2MiB for dataOnly
SAP Sybase IQ ESS GL256KiB-1MiB for metadataOnly
16MiB for dataOnly
Healthcare (Medical Imaging)ESS256KiB for metadataOnly
1MiB for dataOnly
Healthcare (Medical Imaging)Other Storage256KiB Metadata and data
ArchiveOther StorageDepends on storage and performance requirements
ECMOther Storage256KiB Unless the content is very large files (Videos for example).
Oracle Other Storage 256KiB
Technical ComputingESS GL1MIB Metadata
4MiB - 16MiB Data depending on importance of peak sequential performance.
SASESS1MiB MetadataOnly
8MiB or 16MiB depending on the SASBUF size (128KiB or 256 KiB)
Enterprise File (Misc Projects, data sharing) Other Storage256KiB metadata and data

Customization

NSD access

During NSD creation alternate node position for access to NSD. If the first node is used as first node in NSD definition, then it 'll be only use and you'll reach performance problems

If only the first node is selected as first into NSD definition means every NSD task ( read/ write … ) has to be handled by NSD server 'gpfs1' as long as he is reachable. Such a configuration could cause a overload situation on the affected server.

The NSD server sequence can be adjusted online via command mmchnsd ( see below ): https://www.ibm.com/docs/en/spectrum-scale/5.0.5?topic=disks-changing-your-nsd-configuration Ex:

# mmchnsd "data_nsd043:gpfs03.gpfsint.labo,gpfs04.gpfsint.labo,gpfs01.gpfsint.labo,gpfs02.gpfsint.labo"

Maybe easier with a description file

# mmlsnsd -X
File system Disk name NSD volume ID NSD servers
------------------------------------------------------------------------------------------------
cases data_nsd043 C0A80017543D01BC gpfs03.gpfsint.labo,gpfs04.gpfsint.labo,gpfs01.gpfsint.labo,gpfs02.gpfsint.labo
cases data_nsd044 C0A80018543CE5A2 gpfs04.gpfsint.labo,gpfs01.gpfsint.labo,gpfs02.gpfsint.labo,gpfs03.gpfsint.labo
cases data_nsd045 C0A80017543D01C3 gpfs01.gpfsint.labo,gpfs02.gpfsint.labo,gpfs03.gpfsint.labo,gpfs04.gpfsint.labo
cases data_nsd046 C0A80018543CE5A8 gpfs02.gpfsint.labo,gpfs03.gpfsint.labo,gpfs04.gpfsint.labo,gpfs01.gpfsint.labo
cases data_nsd047 C0A80017543D01C9 gpfs03.gpfsint.labo,gpfs04.gpfsint.labo,gpfs01.gpfsint.labo,gpfs02.gpfsint.labo
gpfs/gpfs_sizing.txt · Last modified: 2024/06/21 17:05 by manu