How to properly restart calculation?
Posted: Tue May 07, 2024 4:25 pm
Hi EPW users and developers,
I am trying to compute following epw.x calculation but fails due to time limit. I have been trying to restart the calculation but it does everything the same (wannierization steps and all and stops at the same point). I guess I have not been able to properly restart the calculation. Can you please suggest me how can I restart from last results (skipping wannierization and previously computed steps).
I tried with restart=.true. flag in the input but somehow it did not restart calculation. It ends after time limit as below, which was the same case without restart flag. Also please see below my input file.
Inside velocity step 1
Velocity matrix elements calculated
Bloch2wane: 1 / 32
Bloch2wane: 2 / 32
Bloch2wane: 3 / 32
Bloch2wane: 4 / 32
Bloch2wane: 5 / 32
Bloch2wane: 6 / 32
Bloch2wane: 7 / 32
Bloch2wane: 8 / 32
Bloch2wane: 9 / 32
Bloch2wane: 10 / 32
Bloch2wane: 11 / 32
Bloch2wane: 12 / 32
Bloch2wane: 13 / 32
Bloch2wane: 14 / 32
Bloch2wane: 15 / 32
Bloch2wane: 16 / 32
Bloch2wane: 17 / 32
Bloch2wane: 18 / 32
Bloch2wane: 19 / 32
Bloch2wane: 20 / 32
Bloch2wane: 21 / 32
Bloch2wane: 22 / 32
Bloch2wane: 23 / 32
Bloch2wane: 24 / 32
Bloch2wane: 25 / 32
Bloch2wane: 26 / 32
Bloch2wane: 27 / 32
Bloch2wane: 28 / 32
Bloch2wane: 29 / 32
Bloch2wane: 30 / 32
Bloch2wane: 31 / 32
Bloch2wane: 32 / 32
Bloch2wanp: 1 / 1
Writing Hamiltonian, Dynamical matrix and EP vertex in Wann rep to file
===================================================================
Memory usage: VmHWM = 1665Mb
VmPeak = 3380Mb
===================================================================
Using uniform q-mesh: 16 16 8
Size of q point mesh for interpolation: 2048
Using uniform k-mesh: 16 16 8
Size of k point mesh for interpolation: 4096
Max number of k points per pool: 16
Fermi energy coarse grid = 9.928846 eV
Skipping the first 70 bands:
The Fermi level will be determined with 44.00000 electrons
Fermi energy is calculated from the fine k-mesh: Ef = 10.001024 eV
===================================================================
ibndmin = 21 ebndmin = 9.601 eV
ibndmax = 24 ebndmax = 10.401 eV
Number of ep-matrix elements per pool : 4608 ~= 36.00 Kb (@ 8 bytes/ DP)
A selecq.fmt file was found but re-created because selecqread == .FALSE.
Number selected, total 100 100
Number selected, total 200 200
Number selected, total 300 300
Number selected, total 400 400
Number selected, total 500 500
Number selected, total 600 600
Number selected, total 700 700
Number selected, total 800 800
Number selected, total 900 900
Number selected, total 1000 1000
Number selected, total 1100 1100
Number selected, total 1200 1200
Number selected, total 1300 1300
Number selected, total 1400 1400
Number selected, total 1500 1500
Number selected, total 1600 1600
Number selected, total 1700 1700
Number selected, total 1800 1800
Number selected, total 1900 1900
Number selected, total 2000 2000
We only need to compute 2048 q-points
Progression iq (fine) = 100/ 2048
Progression iq (fine) = 200/ 2048
Progression iq (fine) = 300/ 2048
Progression iq (fine) = 400/ 2048
Progression iq (fine) = 500/ 2048
Progression iq (fine) = 600/ 2048
Progression iq (fine) = 700/ 2048
Progression iq (fine) = 800/ 2048
srun: error: nid001888: tasks 0-255: Power failure
epw1.out lines 788-889/890 100%
&inputepw
restart=.true.
prefix = 'hfte5'
outdir = './'
dvscf_dir = './save'
elph = .true.
epwwrite = .true.
epwread = .false.
lpolar = .true.
wannierize = .true.
proj(1) = 'Hf:d'
proj(2) = 'Te:p'
vme='wannier'
nbndsub = 40
bands_skipped='exclude_bands = 1-70, 111-160'
num_iter = 50000
iprint=2
dis_win_max=15
dis_froz_max=13
dis_froz_min = 3
dis_win_min = 3
wdata(1)='bands_plot = .true.'
wdata(2) = 'begin kpoint_path'
wdata(3) = 'X 0.25122543 -0.25122543 0 G 0 0 0'
wdata(4)= 'G 0 0 0 Y 0.24908686 0.24908686 0'
wdata(5)='Y 0.24908686 0.24908686 0 S 0.5003123 -0.00213862 0'
wdata(6)='S 0.5003123 -0.00213862 0 R 0.5003123 -0.00213862 0.49836313'
wdata(7)='R 0.5003123 -0.00213862 0.49836313 Z 0 0 0.49836313'
wdata(8)='Z 0 0 0.49836313 G 0 0 0'
wdata(9)='end kpoint_path'
wdata(10)='bands_plot_format = gnuplot'
wdata(11)='use_ws_distance =T'
fsthick=0.4
band_plot= .false.
!filqf = './XGYSRZ.dat'
!filkf = './XGYSRZ.dat'
nkf1 = 16
nkf2 = 16
nkf3 = 8
nqf1 = 16
nqf2 = 16
nqf3 = 8
nk1 = 8
nk2 = 8
nk3 = 4
nq1 = 4
nq2 = 4
nq3 = 2
/
Best
Rijan
I am trying to compute following epw.x calculation but fails due to time limit. I have been trying to restart the calculation but it does everything the same (wannierization steps and all and stops at the same point). I guess I have not been able to properly restart the calculation. Can you please suggest me how can I restart from last results (skipping wannierization and previously computed steps).
I tried with restart=.true. flag in the input but somehow it did not restart calculation. It ends after time limit as below, which was the same case without restart flag. Also please see below my input file.
Inside velocity step 1
Velocity matrix elements calculated
Bloch2wane: 1 / 32
Bloch2wane: 2 / 32
Bloch2wane: 3 / 32
Bloch2wane: 4 / 32
Bloch2wane: 5 / 32
Bloch2wane: 6 / 32
Bloch2wane: 7 / 32
Bloch2wane: 8 / 32
Bloch2wane: 9 / 32
Bloch2wane: 10 / 32
Bloch2wane: 11 / 32
Bloch2wane: 12 / 32
Bloch2wane: 13 / 32
Bloch2wane: 14 / 32
Bloch2wane: 15 / 32
Bloch2wane: 16 / 32
Bloch2wane: 17 / 32
Bloch2wane: 18 / 32
Bloch2wane: 19 / 32
Bloch2wane: 20 / 32
Bloch2wane: 21 / 32
Bloch2wane: 22 / 32
Bloch2wane: 23 / 32
Bloch2wane: 24 / 32
Bloch2wane: 25 / 32
Bloch2wane: 26 / 32
Bloch2wane: 27 / 32
Bloch2wane: 28 / 32
Bloch2wane: 29 / 32
Bloch2wane: 30 / 32
Bloch2wane: 31 / 32
Bloch2wane: 32 / 32
Bloch2wanp: 1 / 1
Writing Hamiltonian, Dynamical matrix and EP vertex in Wann rep to file
===================================================================
Memory usage: VmHWM = 1665Mb
VmPeak = 3380Mb
===================================================================
Using uniform q-mesh: 16 16 8
Size of q point mesh for interpolation: 2048
Using uniform k-mesh: 16 16 8
Size of k point mesh for interpolation: 4096
Max number of k points per pool: 16
Fermi energy coarse grid = 9.928846 eV
Skipping the first 70 bands:
The Fermi level will be determined with 44.00000 electrons
Fermi energy is calculated from the fine k-mesh: Ef = 10.001024 eV
===================================================================
ibndmin = 21 ebndmin = 9.601 eV
ibndmax = 24 ebndmax = 10.401 eV
Number of ep-matrix elements per pool : 4608 ~= 36.00 Kb (@ 8 bytes/ DP)
A selecq.fmt file was found but re-created because selecqread == .FALSE.
Number selected, total 100 100
Number selected, total 200 200
Number selected, total 300 300
Number selected, total 400 400
Number selected, total 500 500
Number selected, total 600 600
Number selected, total 700 700
Number selected, total 800 800
Number selected, total 900 900
Number selected, total 1000 1000
Number selected, total 1100 1100
Number selected, total 1200 1200
Number selected, total 1300 1300
Number selected, total 1400 1400
Number selected, total 1500 1500
Number selected, total 1600 1600
Number selected, total 1700 1700
Number selected, total 1800 1800
Number selected, total 1900 1900
Number selected, total 2000 2000
We only need to compute 2048 q-points
Progression iq (fine) = 100/ 2048
Progression iq (fine) = 200/ 2048
Progression iq (fine) = 300/ 2048
Progression iq (fine) = 400/ 2048
Progression iq (fine) = 500/ 2048
Progression iq (fine) = 600/ 2048
Progression iq (fine) = 700/ 2048
Progression iq (fine) = 800/ 2048
srun: error: nid001888: tasks 0-255: Power failure
epw1.out lines 788-889/890 100%
&inputepw
restart=.true.
prefix = 'hfte5'
outdir = './'
dvscf_dir = './save'
elph = .true.
epwwrite = .true.
epwread = .false.
lpolar = .true.
wannierize = .true.
proj(1) = 'Hf:d'
proj(2) = 'Te:p'
vme='wannier'
nbndsub = 40
bands_skipped='exclude_bands = 1-70, 111-160'
num_iter = 50000
iprint=2
dis_win_max=15
dis_froz_max=13
dis_froz_min = 3
dis_win_min = 3
wdata(1)='bands_plot = .true.'
wdata(2) = 'begin kpoint_path'
wdata(3) = 'X 0.25122543 -0.25122543 0 G 0 0 0'
wdata(4)= 'G 0 0 0 Y 0.24908686 0.24908686 0'
wdata(5)='Y 0.24908686 0.24908686 0 S 0.5003123 -0.00213862 0'
wdata(6)='S 0.5003123 -0.00213862 0 R 0.5003123 -0.00213862 0.49836313'
wdata(7)='R 0.5003123 -0.00213862 0.49836313 Z 0 0 0.49836313'
wdata(8)='Z 0 0 0.49836313 G 0 0 0'
wdata(9)='end kpoint_path'
wdata(10)='bands_plot_format = gnuplot'
wdata(11)='use_ws_distance =T'
fsthick=0.4
band_plot= .false.
!filqf = './XGYSRZ.dat'
!filkf = './XGYSRZ.dat'
nkf1 = 16
nkf2 = 16
nkf3 = 8
nqf1 = 16
nqf2 = 16
nqf3 = 8
nk1 = 8
nk2 = 8
nk3 = 4
nq1 = 4
nq2 = 4
nq3 = 2
/
Best
Rijan