LIGO Support Ticket 18836
Ticket Information
Number: admin 18836
User: anderson@ligo.caltech.edu
Email:
Status: resolved
Assigned To: psilord
Date: Fri, 12 Dec 2008 12:31:18 -0600 (CST)
From: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
To: condor-admin__AT__cs.wisc.edu
CC: "R. Kent Wenger" <wenger__AT__cs.wisc.edu>
Subject: [CondorLIGO] Fwd: testing of SPLICE in dagman (fwd)
---------- Forwarded message ----------
Date: Thu, 11 Dec 2008 13:26:58 -0800
From: Stuart Anderson <anderson__AT__ligo.caltech.edu>
To: Condor/LIGO mailing list <condorligo__AT__aei.mpg.de>
Subject: [CondorLIGO] Fwd: testing of SPLICE in dagman
It would be helpful if someone at UW could link this new ticket to the LIGO
Support Tickets web page.
Pete,
Unless you think it is helpful to keep the original DIR+SPLICE ticket
open (18376) I would like to suggest you resolve that ticket in favor of
tracking this new stack dump problem.
Thanks.
Begin forwarded message:
> From: Stephen Fairhurst <stephen.fairhurst__AT__astro.cf.ac.uk>
> Date: December 9, 2008 4:10:43 AM PST
> To: condor-admin__AT__cs.wisc.edu
> Cc: Duncan Brown <dabrown__AT__physics.syr.edu>, Stuart Anderson
> <anderson__AT__ligo.caltech.edu>
> Subject: testing of SPLICE in dagman
>
> Hi,
>
> I have been testing the recent addition of the SPLICE functionality in
> dagman. Thank you for adding this, and apologies for taking a while to do
> the testing.
>
> I took the LIGO inspiral DAG which has been running for a while using sub
> dags and manually edited it to use SPLICE. The DAG started up fine, ran 16
> jobs and then failed. I re-ran the job and it failed at exactly the same
> place. Here are the last few lines of the dagman.out file:
>
>
> 12/8 04:45:27 From submit: Submitting job(s).
> 12/8 04:45:27 From submit: Logging submit event(s).12/8 04:45:27 From submit:
> 1 job(s) submitted to cluster 39334675.
> 12/8 04:45:27 assigned Condor ID (39334675.0)12/8 04:45:27 Sleeping for one
> second for log file consistency
> Stack dump for process 17962 at timestamp 1228740328 (20 frames)
> condor_scheduniv_exec.39334635.0(dprintf_dump_stack+0x9b)[0x532156]condor_scheduniv_exec.39334635.0[0x5323d3]
> /lib64/libc.so.6[0x2b79092db2b0]/lib64/libc.so.6[0x2b79093179bb]
> /lib64/libc.so.6(malloc+0x7b)[0x2b790931958b]
> /usr/lib64/libstdc++.so.5(_Znwm+0x25)[0x2b7908eda425]
> condor_scheduniv_exec.39334635.0(_ZN4ListIcE6AppendEPc+0x1a)[0x5403e6]
> condor_scheduniv_exec.39334635.0(_ZN10StringList20initializeFromStringEPKc+0xec)
> [0x55b8be]
> condor_scheduniv_exec.39334635.0(_ZN10StringListC1EPKcS1_+0x57)[0x55b7a7]
> condor_scheduniv_exec.39334635.0(_ZN13MultiLogFiles22fileNameToLogicalLinesERK8M
> yStringR10StringList+0x131)[0x582091]
> condor_scheduniv_exec.39334635.0(_ZN13MultiLogFiles26loadLogFileNameFromSubFileE
> RK8MyStringS2_+0x1dd)[0x5825f5]
> condor_scheduniv_exec.39334635.0(_ZNK3Job15CheckForLogFileEv+0x4f)[0x4e9385]
> condor_scheduniv_exec.39334635.0(_ZN3Dag13SubmitNodeJobERK6DagmanP3JobR8CondorID+0x136)[0x4e6b6e]
> condor_scheduniv_exec.39334635.0(_ZN3Dag15SubmitReadyJobsERK6Dagman+0x3f8)[0x4e293c]
> condor_scheduniv_exec.39334635.0(_Z18condor_event_timerv+0x7c)[0x4dcc06]
> condor_scheduniv_exec.39334635.0(_ZN12TimerManager7TimeoutEv+0x335)[0x52da19]
> condor_scheduniv_exec.39334635.0(_ZN10DaemonCore6DriverEv+0x716)[0x5141b8]
> condor_scheduniv_exec.39334635.0(main+0x178a)[0x52605a]
> /lib64/libc.so.6(__libc_start_main+0xef)[0x2b79092c840f]
> condor_scheduniv_exec.39334635.0(__strtoll_internal+0x42)[0x4da5ea]
>
> I would be happy to send more details and any other files that would be of
> use. Thanks for your help,
>
> Cheers,
> Steve
>
> ------------------------------------------------------------
> Stephen Fairhurst
> School of Physics & Astronomy
> Cardiff University
> The Parade
> Cardiff, CF24 3AA, UK.
>
> stephen.fairhurst__AT__astro.cf.ac.uk
> Tel: +44 (0) 2920 870166
>
>
>
>
--
Stuart Anderson anderson__AT__ligo.caltech.edu
http://www.ligo.caltech.edu/~anderson
_______________________________________________
Condorligo mailing list
Condorligo__AT__aei.mpg.de
http://lists.aei.mpg.de/cgi-bin/mailman/listinfo/condorligo
===========================================================================
Date of creation: Fri Dec 12 12:31:20 2008 (1229106682)
Subject: Actions
Assigned to psilord by wenger
===========================================================================
Date of actions: Fri Dec 12 12:32:31 2008 (1229106752)
Subject: Actions
Ticket resolved by wenger
===========================================================================
Date of actions: Fri Dec 12 13:11:21 2008 (1229109081)