HPCG-Benchmark
version=3.1
Release date=March 28, 2019
Machine Summary=
Machine Summary::Distributed Processes=157248
Machine Summary::Threads per processes=1
Global Problem Dimensions=
Global Problem Dimensions::Global nx=7168
Global Problem Dimensions::Global ny=6912
Global Problem Dimensions::Global nz=6656
Processor Dimensions=
Processor Dimensions::npx=56
Processor Dimensions::npy=54
Processor Dimensions::npz=52
Local Domain Dimensions=
Local Domain Dimensions::nx=128
Local Domain Dimensions::ny=128
Local Domain Dimensions::Lower ipz=0
Local Domain Dimensions::Upper ipz=51
Local Domain Dimensions::nz=128
########## Problem Summary  ##########=
Setup Information=
Setup Information::Setup Time=15.079
Linear System Information=
Linear System Information::Number of Equations=329772957696
Linear System Information::Number of Nonzero Terms=8901291396088
Multigrid Information=
Multigrid Information::Number of coarse grid levels=3
Multigrid Information::Coarse Grids=
Multigrid Information::Coarse Grids::Grid Level=1
Multigrid Information::Coarse Grids::Number of Equations=41221619712
Multigrid Information::Coarse Grids::Number of Nonzero Terms=1112339179000
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
Multigrid Information::Coarse Grids::Grid Level=2
Multigrid Information::Coarse Grids::Number of Equations=5152702464
Multigrid Information::Coarse Grids::Number of Nonzero Terms=138961859320
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
Multigrid Information::Coarse Grids::Grid Level=3
Multigrid Information::Coarse Grids::Number of Equations=644087808
Multigrid Information::Coarse Grids::Number of Nonzero Terms=17350109560
Multigrid Information::Coarse Grids::Number of Presmoother Steps=1
Multigrid Information::Coarse Grids::Number of Postsmoother Steps=1
########## Memory Use Summary  ##########=
Memory Use Information=
Memory Use Information::Total memory used for data (Gbytes)=235851
Memory Use Information::Memory used for OptimizeProblem data (Gbytes)=0
Memory Use Information::Bytes per equation (Total memory / Number of Equations)=715.193
Memory Use Information::Memory used for linear system and CG (Gbytes)=207552
Memory Use Information::Coarse Grids=
Memory Use Information::Coarse Grids::Grid Level=1
Memory Use Information::Coarse Grids::Memory used=24805.6
Memory Use Information::Coarse Grids::Grid Level=2
Memory Use Information::Coarse Grids::Memory used=3104.75
Memory Use Information::Coarse Grids::Grid Level=3
Memory Use Information::Coarse Grids::Memory used=389.152
########## V&V Testing Summary  ##########=
Spectral Convergence Tests=
Spectral Convergence Tests::Result=PASSED
Spectral Convergence Tests::Unpreconditioned=
Spectral Convergence Tests::Unpreconditioned::Maximum iteration count=11
Spectral Convergence Tests::Unpreconditioned::Expected iteration count=12
Spectral Convergence Tests::Preconditioned=
Spectral Convergence Tests::Preconditioned::Maximum iteration count=2
Spectral Convergence Tests::Preconditioned::Expected iteration count=2
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon=
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Result=PASSED
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for SpMV=4.03136e-14
Departure from Symmetry |x'Ay-y'Ax|/(2*||x||*||A||*||y||)/epsilon::Departure for MG=3.92814e-13
########## Iterations Summary  ##########=
Iteration Count Information=
Iteration Count Information::Result=PASSED
Iteration Count Information::Reference CG iterations per set=50
Iteration Count Information::Optimized CG iterations per set=50
Iteration Count Information::Total number of reference iterations=1200
Iteration Count Information::Total number of optimized iterations=1200
########## Reproducibility Summary  ##########=
Reproducibility Information=
Reproducibility Information::Result=PASSED
Reproducibility Information::Scaled residual mean=0.00506715
Reproducibility Information::Scaled residual variance=0
########## Performance Summary (times in sec) ##########=
Benchmark Time Summary=
Benchmark Time Summary::Optimization phase=2.84e-07
Benchmark Time Summary::DDOT=223.16
Benchmark Time Summary::WAXPBY=48.8269
Benchmark Time Summary::SpMV=259.301
Benchmark Time Summary::MG=1442.39
Benchmark Time Summary::Total=1973.91
Floating Point Operations Summary=
Floating Point Operations Summary::Raw DDOT=2.39019e+15
Floating Point Operations Summary::Raw WAXPBY=2.39019e+15
Floating Point Operations Summary::Raw SpMV=2.17904e+16
Floating Point Operations Summary::Raw MG=1.21914e+17
Floating Point Operations Summary::Total=1.48485e+17
Floating Point Operations Summary::Total with convergence overhead=1.48485e+17
GB/s Summary=
GB/s Summary::Raw Read B/W=463313
GB/s Summary::Raw Write B/W=107065
GB/s Summary::Raw Total B/W=570378
GB/s Summary::Total with convergence and optimization phase overhead=560109
GFLOP/s Summary=
GFLOP/s Summary::Raw DDOT=10710.7
GFLOP/s Summary::Raw WAXPBY=48952.5
GFLOP/s Summary::Raw SpMV=84035
GFLOP/s Summary::Raw MG=84522.5
GFLOP/s Summary::Raw Total=75223.7
GFLOP/s Summary::Total with convergence overhead=75223.7
GFLOP/s Summary::Total with convergence and optimization phase overhead=73869.4
User Optimization Overheads=
User Optimization Overheads::Optimization phase time (sec)=2.84e-07
User Optimization Overheads::Optimization phase time vs reference SpMV+MG time=1.86672e-07
DDOT Timing Variations=
DDOT Timing Variations::Min DDOT MPI_Allreduce time=102.572
DDOT Timing Variations::Max DDOT MPI_Allreduce time=220.807
DDOT Timing Variations::Avg DDOT MPI_Allreduce time=117.981
Final Summary=
Final Summary::HPCG result is VALID with a GFLOP/s rating of=73869.4
Final Summary::HPCG 2.4 rating for historical reasons is=75223.7
Final Summary::Reference version of ComputeDotProduct used=Performance results are most likely suboptimal
Final Summary::Reference version of ComputeSPMV used=Performance results are most likely suboptimal
Final Summary::Reference version of ComputeMG used=Performance results are most likely suboptimal
Final Summary::Reference version of ComputeWAXPBY used=Performance results are most likely suboptimal
Final Summary::Please upload results from the YAML file contents to=http://hpcg-benchmark.org