mpirun noticed that process rank 0 with PID 0 on node p2-gpu-10 exited on signal 9

Queries about input and output files, running specific calculations, etc.


Moderators: Global Moderator, Moderator

Post Reply
Message
Author
jie_yao2
Newbie
Newbie
Posts: 9
Joined: Sun Jun 26, 2022 6:32 am

mpirun noticed that process rank 0 with PID 0 on node p2-gpu-10 exited on signal 9

#1 Post by jie_yao2 » Wed Nov 16, 2022 10:58 pm

Dear VASP group,

I run VASP on gpus for long ab initio MD run for 20,000 ionic steps. (iron magnesium silicate liquid around 150 atoms)

The VASP report: Some of your processes may have been killed by the cgroup out-of-memory handler.

The memory is not enough, always after 1000 steps.

Is there a way to reduce memory or is it because the gpu hardware setup is not 100% correct ?

In my OUTCAR: total amount of memory used by VASP MPI-rank0 296452. kBytes.

thank you for your advice,

Jie

henrique_miranda
Global Moderator
Global Moderator
Posts: 501
Joined: Mon Nov 04, 2019 12:41 pm
Contact:

Re: mpirun noticed that process rank 0 with PID 0 on node p2-gpu-10 exited on signal 9

#2 Post by henrique_miranda » Fri Nov 18, 2022 8:32 am

Could you share your input files so that we try to reproduce the issue?

The issue you are reporting sounds like a memory leak.
Do you have some monitoring tool that reports the memory usage during your calculation?
If the memory usage is increasing during the MD run is normally a sign of a memory leak.

See this related thread:
https://www.vasp.at/forum/viewtopic.php?f=3&t=18493

Post Reply