P130 Reprocessing
status: In Development Complete
last update: 25 January 6 Aug 2012
This page is a record of the configuration for the P130 reprocessing project, recalculating the diffuse response (see here for a description of the change). This task is identical with P120-FT1, v2.1 [please see for details], run in July 2011, except for updating the following details.
- New version of ScienceTools
...
=============== WHAT FOLLOWS IS TO BE UPDATED ====================
The name "P120" derives from the word "processing" and the initial file version to be used for the output data products, e.g., r0123456789_v120_merit.root.
- P120-MERIT - this task reads DIGI+RECON+MERIT and produces reprocessed MERIT + FILTEREDMERIT (photons) + ELECTRONMERIT
- P120-LEO-MERIT - this task reads DIGI+RECON+MERIT and produces reprocessed MERIT + FILTEREDMERIT (photons) + ELECTRONMERIT for 200 runs of earth limb (L&EO) data
Datafile names, versions and locations
Data file version numbers for this reprocessing will begin with v120.
XROOT location and file naming
Location template:
Code Block |
---|
/glast/Data/Flight/Reprocess/<reprocessName>/<dataType>
|
Locations for P120:
Code Block |
---|
/glast/Data/Flight/Reprocess/P120/merit
/glast/Data/Flight/Reprocess/P120/filteredmerit
/glast/Data/Flight/Reprocess/P120/electronmerit
/glast/Data/Flight/Reprocess/P120/ft1
/glast/Data/Flight/Reprocess/P120/extendedft1
/glast/Data/Flight/Reprocess/P120/electronft1
/glast/Data/Flight/Reprocess/P120/ls1
/glast/Data/Flight/Reprocess/P120/extendedls1
|
File naming:
Data Type | aka | Send to FSSC | Naming template |
---|---|---|---|
MERIT |
| No | r<run#>_<version>_<dataType>.root |
FILTEREDMERIT |
| No | r<run#>_<version>_<dataType>.root |
ELECTRONMERIT |
| No | r<run#>_<version>_<dataType>.root |
ELECTRONFT1 |
| No | gll_el_p<procVer>_r<run#>_<version>.fit |
EXTENDEDFT1 |
| No | gll_xp_p<procVer>_r<run#>_<version>.fit |
FT1 | LS-002 | Yes | gll_ph_p<procVer>_r<run#>_<version>.fit |
EXTENDEDLS1 |
| No | gll_xe_p<procVer>_r<run#>_<version>.fit |
LS1 | LS-001 | Yes | gll_ev_p<procVer>_r<run#>_<version>.fit |
Note: 'procVer' is a field added to the file name (and the keyword "PROC_VER" in the primary header) added to the FFD 5/12/2010. Ref: http://fermi.gsfc.nasa.gov/ssc/dev/current_documents/Science_DP_FFD_RevA.pdf
Example:
Code Block |
---|
/glast/Data/Flight/Reprocess/P120/merit/r0239557414_v120_merit.root
/glast/Data/Flight/Reprocess/P120/filteredmerit/r0239557414_v120_filteredmerit.root
/glast/Data/Flight/Reprocess/P120/electronmerit/r0239557414_v120_electronmerit.root
/glast/Data/Flight/Reprocess/P120/extendedft1/gll_xp_p120_r0239559565_v120.fit
/glast/Data/Flight/Reprocess/P120/ft1/gll_ph_p120_r0239559565_v120.fit
/glast/Data/Flight/Reprocess/P120/electronft1/gll_el_p120_r0239559565_v120.fit
/glast/Data/Flight/Reprocess/P120/extendedls1/gll_xe_p120_r0239559565_v120.fit
/glast/Data/Flight/Reprocess/P120/ls1/gll_ev_p120_r0239559565_v120.fit
|
DataCatalog location and naming
Logical directory and group template:
Code Block |
---|
Data/Flight/Reprocess/<reprocessName>:<dataType>
|
Note that the <dataType> field (following the colon) is a DataCatalog 'group' name, and file names are of the form r<run#>.
Naming examples:
Code Block |
---|
Data/Flight/Reprocess/P120:MERIT r0239557414
Data/Flight/Reprocess/P120:FILTEREDMERIT r0239557414
Data/Flight/Reprocess/P120:EXTENDEDFT1 r0239557414
Data/Flight/Reprocess/P120:FT1 r0239557414
Data/Flight/Reprocess/P120:ELECTRONFT1 r0239557414
Data/Flight/Reprocess/P120:EXTENDEDLS1 r0239557414
Data/Flight/Reprocess/P120:LS1 r0239557414
|
Data Sample
- (09-26-02) which includes new integration scheme for gtdiffrsp
- New run list (updated for data processed since the end of P120)
- Name of Pipeline task (P120->P130) as well as in file naming
Data Sample
The data sample for P130 The currently defined data sample for P120 reprocessing includes:
First run | 239557414 (MET), 2008-08-04 15:43:34 (UTC) | beginning of Science |
Last run | 333880535 356429251 (MET), 20112012-0804-01 18 08:3507:33 29 (UTC) | Official Pass7 release |
Total runs | 16 20,459385 |
|
Total input MERIT events | 3544,921398,666334,747636 | all "events" |
Total FILTEREDMERIT/EXTENDEDFT1/LS1 events | 5,035,929,409 | all photon event classes |
Total ELECTRONMERIT/ELECTRONFT1 events | 68,055,849 |
|
Total LS1 (FSSC selection) events | 1,025,359,231 | event classes (bits) 0,2,3,4 (transient, source, clean, ultraclean) |
Total FT1 (FSSC selection) events | 142,042,060 | event classes (bits) 2,3,4 (source, clean, ultraclean) |
Total disk space used | 33.9 TB |
|
Summary from DataCatalog as of 4/17/2012 and updated 8/2/20116/2012 (after new evt selection filter).
Name | Files | Events | Size |
---|---|---|---|
ELECTRONFT1 FT1 | 16459 20385 | 68184,055008,849 254 | 16.6 .4 GB |
ELECTRONMERIT LS1 | 16459 20385 | 681,295,055164,849 924 | 147210.4 5 GB |
EXTENDEDFT1 ELECTRONFT1 | 16459 20385 | 585,035057,929,409 599 | 4418.3 0 GB |
EXTENDEDLS1 EXTENDEDFT1 | 16459 20385 | 511,035240,929,409 | 816.5 GB |
FILTEREDMERIT | 16459 | 5,035,929,409 | 4.0 TB |
FT1 | 16459 | 142,042,060 | 12.9 GB |
LS1 | 16459 | 1,025,359,231 | 166.6 GB |
MERIT | 16459 | 35,921,666,747 | 28.3 TB |
NOTE: One run, 242429468, of type TrigTest was declared 'good for science' but long after this task got started, so it has been intentionally omitted.
8/17/2011 update: Four orphaned runs are being reprocessed, including one TrigTest run and four nadir-pointed runs.
869,507 6,280,448,356 | 984.6 550.3GB | ||
EXTENDEDLS1 | 20385 | 11,240,869,507 6,280,448,356 | 1.8 1.0 TB |
Stream | run | type | |
---|---|---|---|
16459 | 242429468 | (TrigTest) | |
16460 | 333355876 | (nadirOps) | |
16461 | 333358500 | (nadirOps) | |
16462 | 333365716 | (nadirOps) |
Bookkeeping
- (This page): Define ingredients of reprocessing (processing code/configuration changes)
- Processing History database: http://glast-ground.slac.stanford.edu/HistoryProcessing/HProcessingRuns.jsp?processingname=P120P130
- List of all reprocessings
- List of all data runs reprocessed
- Pointers to all input data files (-> dataCatalog)
- Pointers to associated task processes (-> Pipeline II status)
- Data Catalog database: http://glast-ground.slac.stanford.edu/DataCatalog/folder.jsp
- Lists of and pointers to all output data files
- Meta data associated with each output data product
...
P130-FT1
Anchor |
---|
...
|
...
|
This task generates all desired FITS data products. An example of the code processing chain appears on a child page.
Status chronology
Configuration (version 2)
- 1/26/2012 - Initial task configured, 1000 runs reprocessed. On hold for validation.
- 2/1/2012 - Okay to continue. Begin block 1 reprocessing, containing 19133 runs:
| MET(sec) | MJD(days) | UNX(sec) | UTC |
---|---|---|---|---|
1st run | 333886531 | 55774.427442129629 | 1312193731.000 | 2011-08-01 10:15:29 |
last run | 349266293 | 55952.433946759258 | 1327573493.000 | 2012-01-26 10:24:51 |
- 2/3/2012 - Block 1 reprocessing complete
- 4/2/2012 - Begin block 2, raising the total # runs to 20126, through run 354926369 2012-03-31 22:39:29 (UTC)
- 4/18/2012 - Final reprocessing block: additional 259 runs through 356429251, 2012-04-18 08:07:29 (UTC)
- 4/19/2012 - Complete.
- 5/1/2012 - Reconfigure start time - reducing by 1.9 seconds - for run 245403855 due to old leap second problem - to regain 27 events at the start of the run. This is stream 1019.
- 8/3/2012 - A request to update the event selection filter on data in the period 1 Aug 2011 and 19 Apr 2012 was made to bring EXTENDED FITS and FILTEREDMERIT files into synch. See this page for details. The P130-FT1 task has been modified to run a new event filter; mergeClumps job steps for the following runs rolled back: This action will recreate only the EXTENDEDFT1 and EXTENDEDLS1 data products. These will automatically be ingested by the astroserver. Note that the file original-config.py contains the former configuration for this task, while config.py contains the changes discussed above.
Run
Task Stream
Start
333886531
16460
End
356429251
20384
- 8/6/2012 - Re-reprocessing complete
Configuration
Task Location | /nfs/farm/g/glast/u38/Reprocess-tasks/P120P130-FT1 |
Task Status | http://glast-ground.slac.stanford.edu/Pipeline-II/task.jsp?task=6339023674765664 |
Input Data | MERIT ( from P120-MERIT (runs 239557414 - 333880535) and Level 1 (runs 333886531 - present) |
spacecraft data | FT2 from P105 (runs 239557414 - 271844560), then from current Level 1 production |
Input Run List | ftp://ftp-glast.slac.stanford.edu/glast.u38/Reprocess-tasks/P120P130-FT1/config/runFile.txt |
Reprocessing Mode | reFT1 |
meritFilter | FT1EventClass!=0 for P120-reprocessing, and (FT1EventClass& 0x00003EFF)!=0 for subsequent data |
evtClassDefs | 00-19-01 |
eventClassMap | EvtClassDefs_P7V6.xml |
ScienceTools | 09-2426-00 02 |
Code Variants | redhat5-i686-32bit-gcc41 (Optimized) |
Diffuse Model | based on contents of /afs/slac.stanford.edu/g/glast/ground/GLAST_EXT/diffuseModels/v2r0 ) |
Diffuse Response | 'source' using P7SOURCE_V6 IRF |
IRFs | P6V7P7V6, contained within ScienceTools release |
Output Data Products |
...
Data Product | selection | makeFT1 | gtdiffrsp | gtmktime | gtltcube |
---|---|---|---|---|---|
FT1 (for FSSC) | 'source' and above | truetrue | true | false | |
LS1 (for FSSC) | 'transient' and above | truetrue | true | false | |
FT1EXTENDED | FT1EventClass!=0 | truetrue | true | false | |
LS1EXTENDED | FT1EventClass!=0 | truetrue | true | false | |
ELECTRONFT1 | CTBParticleType==1 | true | false | true | false |
Note that diffuse response is calculated for 'source' and 'clean' event classes only.
Note on 'Code Variant': The SLAC batch farm contains a mixture of architectures , both hardware (Intel/AMD 64-bit) and software (RHEL5-64, gcc v4.1, etc.).
Timing and Performance
With a sample of the first 8128 runs, the CPU-intensive part of the task (mergeClumps) is consuming ~42 CPU-min/job (differences in machine class are small). TrickleStream configured to allow 1500 simultaneous mergeClumps jobs, and up to 150 job starts every 200 second cycle.