[Pw_forum] submits jobs to node already in use

Axel Kohlmeyer akohlmey at cmm.chem.upenn.edu
Wed May 6 05:29:49 CEST 2009


On Tue, 2009-05-05 at 17:30 -0700, Jonas Baltrusaitis wrote:
> Funny story, if I submit any job with mpiexec -n 4 -machinefile

where is the machinefile??

>  /../pw.x -rmpool 0 -nodes 1 -procs 4 via PBS to a 4 core processor I

this must be a specialty of how your machine is set up. if you submit
to a PBS queue, you don't use mpiexec, but rather use qsub.
mpiexec (or mpirun) is being called from within the job script
that you submit to qsub.

>  can't see the exact number of cores running, e.g. with qstat -n I see
>  only    compute-0-1/0, instead of usual compute-0-1/0, compute-0-1/1,
>  compute-0-1/2, compute-0-1/4.

again. this has nothing at all to do with Q-E. there is no code in
Q-E that knows anything about any batch system (and it should not).

> It gets even worse if I want to submit any other pwscf job to my 16
>  node cluster. It get's submitted to exactly the same node, e.g. I run
>  8 processes on a 4 core node and obviously no performance
> 
> I inquired at Rocks forum, they claim it must be related with how Q-E
>  submits jobs. Has anybody seen that before?

that is wrong advice. 

cheers,
   axel.


> 
> Jonas Baltrusaitis
> University of Iowa
> 
> 
> 
>       
> _______________________________________________
> Pw_forum mailing list
> Pw_forum at pwscf.org
> http://www.democritos.it/mailman/listinfo/pw_forum

-- 
=======================================================================
Axel Kohlmeyer   akohlmey at cmm.chem.upenn.edu   http://www.cmm.upenn.edu
   Center for Molecular Modeling   --   University of Pennsylvania
Department of Chemistry, 231 S.34th Street, Philadelphia, PA 19104-6323
tel: 1-215-898-1582,  fax: 1-215-573-6233,  office-tel: 1-215-898-5425
=======================================================================
If you make something idiot-proof, the universe creates a better idiot.



More information about the Pw_forum mailing list