How to evaluate the memory needed for a computation?
Posted: Tue Dec 19, 2023 2:35 pm
Dear all,
(To be honest, I have post the same topic two days ago, but I can not find it now.)
When I proferm calculation for a 3D system (38 wannier functions), the process always reports an error about memory (Specifically, not enough memory can be used).
My job is cut into three steps. The first step is wannier interpolation. The second step is used to calculating the matrix of EPC. The third step is used to calculate the gap related properties.
When I'm doing the third step with a default value for max_memlt, the code calls erro information as following:
Size of required memory per pool: ~= 22.8741 Gb
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Error in routine mem_size_eliashberg (1):
Size of required memory exceeds max_memlt
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
So i enhance the max_memlt to 32 GB, and I resubimit the job with 256 cores (32 nodes, 256 GB & 8 cores per node). This seems to solve the problem above. But unfortunately, the process terminates with the error imformation as
Size of allocated memory per pool: ~= 22.8741 Gb
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 30 PID 14207 RUNNING AT cn022
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================
Why is this happening? Can anyone give me some advice and help?
How to evaluate the memory a job needs?
For my job, the nkf and nqf set as 40*40*40 and 20*20*20 can give excellent lambda comparing to the results calculating via QE. In terms of computing resources, I have used almost all the nodes available to me. That means I can not increase the number of node for my job. What parameters can I change to solve my problem?
Any advice and comment are welcomed.
Thanks and Best Regards!
Jianguo Si
-------------------------------------------------
Songshan Lake Materials Laboratory
Building A1, University Innovation Park, Songshan Lake, Dongguan, Guangdong, CHINA
(To be honest, I have post the same topic two days ago, but I can not find it now.)
When I proferm calculation for a 3D system (38 wannier functions), the process always reports an error about memory (Specifically, not enough memory can be used).
My job is cut into three steps. The first step is wannier interpolation. The second step is used to calculating the matrix of EPC. The third step is used to calculate the gap related properties.
When I'm doing the third step with a default value for max_memlt, the code calls erro information as following:
Size of required memory per pool: ~= 22.8741 Gb
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Error in routine mem_size_eliashberg (1):
Size of required memory exceeds max_memlt
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
So i enhance the max_memlt to 32 GB, and I resubimit the job with 256 cores (32 nodes, 256 GB & 8 cores per node). This seems to solve the problem above. But unfortunately, the process terminates with the error imformation as
Size of allocated memory per pool: ~= 22.8741 Gb
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 30 PID 14207 RUNNING AT cn022
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================
Why is this happening? Can anyone give me some advice and help?
How to evaluate the memory a job needs?
For my job, the nkf and nqf set as 40*40*40 and 20*20*20 can give excellent lambda comparing to the results calculating via QE. In terms of computing resources, I have used almost all the nodes available to me. That means I can not increase the number of node for my job. What parameters can I change to solve my problem?
Any advice and comment are welcomed.
Thanks and Best Regards!
Jianguo Si
-------------------------------------------------
Songshan Lake Materials Laboratory
Building A1, University Innovation Park, Songshan Lake, Dongguan, Guangdong, CHINA