LIGO Support Ticket 14990
Ticket Information
Number: admin 14990
User: anderson@ligo.caltech.edu
Email: espinoza_e__AT__ligo.caltech.edu
Status: open
Assigned To: wenger
Date: Tue, 20 Feb 2007 19:29:15 -0800
From: Stuart Anderson <anderson__AT__ligo.caltech.edu>
To: condor-admin__AT__cs.wisc.edu
CC: Erik Espinoza <espinoza_e__AT__ligo.caltech.edu>
Subject: LIGO: removing jobs on hold
Is it a bug or a feature in condor 6.8.4 that jobs are reported
as transitioning from H to X to H before being removed from the queue?
When removing (condor_rm) jobs that are in the "H" state condor_q
reports from the Quill db that the jobs first transition to the "X" state
and then back in the "H" state before dissapearing from the queue.
Thanks.
-bash-3.00$ condor_q ajith
-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
9733029.0 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.1 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.2 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.3 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.4 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.5 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.6 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.7 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.8 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.9 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733042.0 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.1 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.2 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.3 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.4 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.5 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.6 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.7 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.8 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.9 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
20 jobs; 0 idle, 0 running, 20 held
(run "condor_rm ajith" on CONDOR_HOST)
-bash-3.00$ condor_q ajith
-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
9733029.0 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.1 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.2 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.3 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.4 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.7 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733029.9 ajith 12/14 15:04 0+00:00:00 X 0 9.8 hostname
9733042.1 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.2 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.3 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.4 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.6 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.7 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.8 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
9733042.9 ajith 12/14 15:12 0+00:00:00 X 0 0.0 ls -l
0 jobs; 0 idle, 0 running, 0 held
-bash-3.00$ condor_q ajith
-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
9733029.0 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.1 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.2 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.3 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.4 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.7 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733029.9 ajith 12/14 15:04 0+00:00:00 H 0 9.8 hostname
9733042.1 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.2 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.3 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.4 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.6 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.7 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.8 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
9733042.9 ajith 12/14 15:12 0+00:00:00 H 0 0.0 ls -l
15 jobs; 0 idle, 0 running, 15 held
-bash-3.00$ condor_q ajith
-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
ID OWNER SUBMITTED RUN_TIME ST PRI SIZE CMD
0 jobs; 0 idle, 0 running, 0 held
--
Stuart Anderson anderson__AT__ligo.caltech.edu
http://www.ligo.caltech.edu/~anderson
===========================================================================
Date of creation: Tue Feb 20 21:29:35 2007 (1172028577)
Subject: Actions
Assigned to wenger by wenger
===========================================================================
Date of actions: Wed Feb 21 11:50:33 2007 (1172080233)
Date: Wed, 21 Feb 2007 11:52:38 -0600 (CST)
From: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
To: wenger <condor-admin__AT__cs.wisc.edu>
CC: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
Subject: Re: [condor-admin #14990] LIGO: removing jobs on hold
Stuart,
> Is it a bug or a feature in condor 6.8.4 that jobs are reported
> as transitioning from H to X to H before being removed from the queue?
>
> When removing (condor_rm) jobs that are in the "H" state condor_q
> reports from the Quill db that the jobs first transition to the "X" state
> and then back in the "H" state before dissapearing from the queue.
I'll have to consult with some of the other Condor folks on this one.
I assume that condor-support 1885 is a higher priority, though?
Kent Wenger
Condor Team
===========================================================================
Date mail was appended: Wed Feb 21 11:52:45 2007 (1172080365)
Date: Wed, 21 Feb 2007 09:55:51 -0800
From: Stuart Anderson <anderson__AT__ligo.caltech.edu>
To: condor-admin response tracking system <condor-admin__AT__cs.wisc.edu>
CC: espinoza_e__AT__ligo.caltech.edu
Subject: Re: [condor-admin #14990] LIGO: removing jobs on hold
On Wed, Feb 21, 2007 at 11:52:45AM -0600, condor-admin response tracking system wrote:
> Stuart,
>
> > Is it a bug or a feature in condor 6.8.4 that jobs are reported
> > as transitioning from H to X to H before being removed from the queue?
> >
> > When removing (condor_rm) jobs that are in the "H" state condor_q
> > reports from the Quill db that the jobs first transition to the "X" state
> > and then back in the "H" state before dissapearing from the queue.
>
> I'll have to consult with some of the other Condor folks on this one.
>
> I assume that condor-support 1885 is a higher priority, though?
Yes, this one is more a curiousity since the jobs where actually removed
from the queue eventually.
Thanks.
--
Stuart Anderson anderson__AT__ligo.caltech.edu
http://www.ligo.caltech.edu/~anderson
===========================================================================
Date mail was appended: Wed Feb 21 11:56:13 2007 (1172080574)