Dear all,
I am running a npt MD using the latest 5.3.3 version. I am using 4 nodes with 12 cores each. A gamma point sampling is used. The following is the assignment of those cores:
------------------------------------------------
distrk: each k-point on 48 cores, 1 groups
distr: one band on 6 cores, 8 groups
------------------------------------------------
It could run for two frames, and only two. The energy was converged in each main loop, and the temperature and the volume... was given in REPORT. So far so good. It appeared that everything was fine and the code could run forever.
However, after these 2 steps. The mpi is dead. The error messages:
-----------------------------------------------------------------
forrtl: error (78): process killed (SIGTERM)
........
........
mpirun noticed that process rank 32 with PID 47525 on node n23 exited on signal 11 (Segmentation fault).
-------------------------------------------------------------------
I don't know what is going on. If the parallelism was wrong, why it could work for two frames w/o any problem? I tried to use only one node, the same error exactly took place after two steps as well.
I am confused at this moment. Could someone help me out?
Thank you so much.
Best,
Tian
MPI was killed after several steps in npt MD, running at vasp5.3.3
Moderators: Global Moderator, Moderator
-
- Newbie
- Posts: 2
- Joined: Wed May 29, 2013 7:22 pm
MPI was killed after several steps in npt MD, running at vasp5.3.3
Last edited by Emerald on Wed May 29, 2013 8:29 pm, edited 1 time in total.
-
- Administrator
- Posts: 2921
- Joined: Tue Aug 03, 2004 8:18 am
- License Nr.: 458
MPI was killed after several steps in npt MD, running at vasp5.3.3
Have you checked "segmentation fault" issues?
E.g. proper setting in .bashrc file?
E.g. proper setting in .bashrc file?
Last edited by admin on Mon Jun 03, 2013 1:23 pm, edited 1 time in total.