Polaron self-consistent process fails

Post here questions linked with issue while running the EPW code

Moderator: stiwari

Post Reply
chifuyu
Posts: 1
Joined: Thu Feb 29, 2024 5:15 pm
Affiliation: Universität Hamburg

Polaron self-consistent process fails

Post by chifuyu »

Hi everyone,

I am trying to work my way through the LiF polaron example. Everything works just fine until the code starts the self-consistent process. At this point, the code crashes and I can find in the slurm output repeatedly "epw.x:489634 terminated with signal 11 at PC=4bb203 SP=7fffffff0d80.". As far as I understand, this points to a segmentation fault?

I attach all output files for reference.

Has anyone ever encountered a similar problem?

Cheers
Michael Winter
Attachments
LiF.zip
(97.23 KiB) Downloaded 280 times
jlb
Posts: 5
Joined: Tue Aug 01, 2023 7:31 am
Affiliation: DIPC

Re: Polaron self-consistent process fails

Post by jlb »

Hi Michael,

At the end of your slurm file I see that the number of requested CPUs is 192. In this example the k/q-grid is 4x4x4 so you can parallelize over 64 pools at most. Could you try reducing the number of CPUs / pools and see if the problem persists?

Best,
Jon Lafuente-Bartolome
Post Reply