LIGO Support Ticket 14990

Ticket Information
  Number:      admin 14990
  User:        anderson@ligo.caltech.edu
  Email:       espinoza_e__AT__ligo.caltech.edu
  Status:      open
  Assigned To: wenger
Date: Tue, 20 Feb 2007 19:29:15 -0800
From: Stuart Anderson <anderson__AT__ligo.caltech.edu>
To: condor-admin__AT__cs.wisc.edu
CC: Erik Espinoza <espinoza_e__AT__ligo.caltech.edu>
Subject: LIGO: removing jobs on hold

	Is it a bug or a feature in condor 6.8.4 that jobs are reported
as transitioning from H to X to H before being removed from the queue?

	When removing (condor_rm) jobs that are in the "H" state condor_q
reports from the Quill db that the jobs first transition to the "X" state
and then back in the "H" state before dissapearing from the queue.

Thanks.


-bash-3.00$ condor_q ajith


-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD               
9733029.0   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.1   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.2   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.3   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.4   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.5   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.6   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.7   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.8   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.9   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733042.0   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.1   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.2   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.3   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.4   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.5   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.6   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.7   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.8   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.9   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l

20 jobs; 0 idle, 0 running, 20 held


(run "condor_rm ajith" on CONDOR_HOST)


-bash-3.00$ condor_q ajith


-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD               
9733029.0   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.1   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.2   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.3   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.4   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.7   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733029.9   ajith          12/14 15:04   0+00:00:00 X  0   9.8  hostname
9733042.1   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.2   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.3   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.4   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.6   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.7   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.8   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l
9733042.9   ajith          12/14 15:12   0+00:00:00 X  0   0.0  ls -l

0 jobs; 0 idle, 0 running, 0 held
-bash-3.00$ condor_q ajith


-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD               
9733029.0   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.1   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.2   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.3   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.4   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.7   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733029.9   ajith          12/14 15:04   0+00:00:00 H  0   9.8  hostname
9733042.1   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.2   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.3   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.4   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.6   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.7   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.8   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l
9733042.9   ajith          12/14 15:12   0+00:00:00 H  0   0.0  ls -l

15 jobs; 0 idle, 0 running, 15 held
-bash-3.00$ condor_q ajith


-- Quill: citquill@ligo : <10.14.0.25:5432> : citquill_db
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD               

0 jobs; 0 idle, 0 running, 0 held


-- 
Stuart Anderson  anderson__AT__ligo.caltech.edu
http://www.ligo.caltech.edu/~anderson

===========================================================================
Date of creation: Tue Feb 20 21:29:35 2007 (1172028577)
Subject: Actions

Assigned to wenger by wenger
===========================================================================
Date of actions: Wed Feb 21 11:50:33 2007 (1172080233)
Date: Wed, 21 Feb 2007 11:52:38 -0600 (CST)
From: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
To: wenger <condor-admin__AT__cs.wisc.edu>
CC: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
Subject: Re: [condor-admin #14990] LIGO: removing jobs on hold

Stuart,

> 	Is it a bug or a feature in condor 6.8.4 that jobs are reported
> as transitioning from H to X to H before being removed from the queue?
>
> 	When removing (condor_rm) jobs that are in the "H" state condor_q
> reports from the Quill db that the jobs first transition to the "X" state
> and then back in the "H" state before dissapearing from the queue.

I'll have to consult with some of the other Condor folks on this one.

I assume that condor-support 1885 is a higher priority, though?

Kent Wenger
Condor Team

===========================================================================
Date mail was appended: Wed Feb 21 11:52:45 2007 (1172080365)
Date: Wed, 21 Feb 2007 09:55:51 -0800
From: Stuart Anderson <anderson__AT__ligo.caltech.edu>
To: condor-admin response tracking system <condor-admin__AT__cs.wisc.edu>
CC: espinoza_e__AT__ligo.caltech.edu
Subject: Re: [condor-admin #14990] LIGO: removing jobs on hold

On Wed, Feb 21, 2007 at 11:52:45AM -0600, condor-admin response tracking system wrote:
> Stuart,
> 
> > 	Is it a bug or a feature in condor 6.8.4 that jobs are reported
> > as transitioning from H to X to H before being removed from the queue?
> >
> > 	When removing (condor_rm) jobs that are in the "H" state condor_q
> > reports from the Quill db that the jobs first transition to the "X" state
> > and then back in the "H" state before dissapearing from the queue.
> 
> I'll have to consult with some of the other Condor folks on this one.
> 
> I assume that condor-support 1885 is a higher priority, though?

Yes, this one is more a curiousity since the jobs where actually removed
from the queue eventually.

Thanks.

-- 
Stuart Anderson  anderson__AT__ligo.caltech.edu
http://www.ligo.caltech.edu/~anderson

===========================================================================
Date mail was appended: Wed Feb 21 11:56:13 2007 (1172080574)