BAD TERMINATION using epw.x
Posted: Mon Jun 11, 2018 12:22 am
Dear all,
I am running EPW-4.3 with QE-6.2.1. Once the "amn" files are being calculated , the code stops with the error:
"AMN calculated
MMN
k points = 576 in 512 pools
1 of 2 on ionode
2 of 2 on ionode
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= PID 17975 RUNNING AT kcn464.local
= EXIT CODE: 6
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES"
Checking the error file I get:
"forrtl: error (69): process interrupted (SIGINT)
Image PC Routine Line Source
epw.x 00000000010A1A21 Unknown Unknown Unknown
epw.x 00000000010A0177 Unknown Unknown Unknown
epw.x 0000000000FA3464 Unknown Unknown Unknown
epw.x 0000000000FA3276 Unknown Unknown Unknown
epw.x 0000000000F23B04 Unknown Unknown Unknown
epw.x 0000000000F2B887 Unknown Unknown Unknown
libpthread.so.0 00002ADC89925500 Unknown Unknown Unknown
libc.so.6 00002ADC89E863A7 Unknown Unknown Unknown
libmpi.so.12 00002ADC88E56034 Unknown Unknown Unknown
libmpi.so.12 00002ADC88E503E3 Unknown Unknown Unknown
libmpi.so.12 00002ADC88F61CE9 Unknown Unknown Unknown
libmpi.so.12 00002ADC88F619DF Unknown Unknown Unknown
libmpi.so.12 00002ADC88E2385C Unknown Unknown Unknown
libmpi.so.12 00002ADC88E2773D Unknown Unknown Unknown
libmpi.so.12 00002ADC88E26F6E Unknown Unknown Unknown
libmpifort.so.12 00002ADC889FEC57 Unknown Unknown Unknown
epw.x 0000000000D04526 reduce_base_real_ 303 mp_base.f90
epw.x 0000000000CF6519 mp_mp_mp_sum_c4d_ 1537 mp.f90
epw.x 00000000005059A6 compute_mmn_para_ 1125 pw2wan90epw.f90
epw.x 00000000004F2FED pw2wan90epw_ 78 pw2wan90epw.f90
epw.x 00000000004F100C wann_run_ 69 wannierize.f90
epw.x 0000000000412C6E MAIN__ 137 epw.f90
epw.x 0000000000411EDE Unknown Unknown Unknown
libc.so.6 00002ADC89DD5CDD Unknown Unknown Unknown
epw.x 0000000000411DE9 Unknown Unknown Unknown"
I suspect that this due to the number of processors used. I am using
mpirun -np 512 /......./EPW/bin/epw.x -npool 512 < epw.in > epw.out
I need such a large number to accelerate the calculations. In fact I usually define the nodes in the pbs file as:
select= ;ncpus=;mpiprocs=;ompthreads=;
How is it possible to define the variables above such that the number of processors be equal to the number of pools (by defining more than one node, that is without having to put 512 processors per single node)?
Thanks in advance
Elio
I am running EPW-4.3 with QE-6.2.1. Once the "amn" files are being calculated , the code stops with the error:
"AMN calculated
MMN
k points = 576 in 512 pools
1 of 2 on ionode
2 of 2 on ionode
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= PID 17975 RUNNING AT kcn464.local
= EXIT CODE: 6
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES"
Checking the error file I get:
"forrtl: error (69): process interrupted (SIGINT)
Image PC Routine Line Source
epw.x 00000000010A1A21 Unknown Unknown Unknown
epw.x 00000000010A0177 Unknown Unknown Unknown
epw.x 0000000000FA3464 Unknown Unknown Unknown
epw.x 0000000000FA3276 Unknown Unknown Unknown
epw.x 0000000000F23B04 Unknown Unknown Unknown
epw.x 0000000000F2B887 Unknown Unknown Unknown
libpthread.so.0 00002ADC89925500 Unknown Unknown Unknown
libc.so.6 00002ADC89E863A7 Unknown Unknown Unknown
libmpi.so.12 00002ADC88E56034 Unknown Unknown Unknown
libmpi.so.12 00002ADC88E503E3 Unknown Unknown Unknown
libmpi.so.12 00002ADC88F61CE9 Unknown Unknown Unknown
libmpi.so.12 00002ADC88F619DF Unknown Unknown Unknown
libmpi.so.12 00002ADC88E2385C Unknown Unknown Unknown
libmpi.so.12 00002ADC88E2773D Unknown Unknown Unknown
libmpi.so.12 00002ADC88E26F6E Unknown Unknown Unknown
libmpifort.so.12 00002ADC889FEC57 Unknown Unknown Unknown
epw.x 0000000000D04526 reduce_base_real_ 303 mp_base.f90
epw.x 0000000000CF6519 mp_mp_mp_sum_c4d_ 1537 mp.f90
epw.x 00000000005059A6 compute_mmn_para_ 1125 pw2wan90epw.f90
epw.x 00000000004F2FED pw2wan90epw_ 78 pw2wan90epw.f90
epw.x 00000000004F100C wann_run_ 69 wannierize.f90
epw.x 0000000000412C6E MAIN__ 137 epw.f90
epw.x 0000000000411EDE Unknown Unknown Unknown
libc.so.6 00002ADC89DD5CDD Unknown Unknown Unknown
epw.x 0000000000411DE9 Unknown Unknown Unknown"
I suspect that this due to the number of processors used. I am using
mpirun -np 512 /......./EPW/bin/epw.x -npool 512 < epw.in > epw.out
I need such a large number to accelerate the calculations. In fact I usually define the nodes in the pbs file as:
select= ;ncpus=;mpiprocs=;ompthreads=;
How is it possible to define the variables above such that the number of processors be equal to the number of pools (by defining more than one node, that is without having to put 512 processors per single node)?
Thanks in advance
Elio