Blog from August, 2008

Reason for change

At C&A's request: We have a new GR, GR v15r40, which contains the space craft alignment code. We also have two new sets of job options. One set is to enable the space craft alignment. The other is to turn on the TKR alignment again which was accidentally turned off on July 16 (in L1Proc 1.61). ***ADDED May 13, 2010 by anders: The TKR alignment was turned off in L1Proc 1.62, see this***

Note that while C&A has accepted the S/C alignment, there is no explicit SSC JIRA for them. I copied over the set with flavor "align" to "vanilla" which is the default flavor in L1. We need to start tracking them like we track all other calibrations. In particular to make sure any updates (if there are any) get propagated into L1 ("vanilla").

There are also general updates to the monitoring.

Test Procedure

We have processed data runs in the DEV pipeline with this version of L1Proc. C&A has looked at the difference between a reprocessed run (with S/C and TKR alignment) and the original processing (no S/C or TKR alignment). Their results are here.  A direct comparison between the Multiple scattering angle for reprocessed and original processing can be found in the attachments to this JIRA: GDQMQ-252.

Rollback procedure

We can switch back to the previous version of L1Proc in case of problems.

CCB Jira

SSC-130@JIRA

Details

L1Pipeline v1r64:
- Run the SVAC ntuple on the chunk level.

GlastRelease v15r40:
- The main change with this GR is the space craft alignment code. In addition, we have added two sets of job options: one to enable the space craft alignment and another to enable the TKR alignment that was accidentally turned off on July 16. These four lines of JO will be added to Gleam in a subsequent GR. For the moment they are appended to the usual recon JO by the L1 pipeline. (Sorry for the funky format below, but Confluence insists on interpreting the third JO line as a macro and this was the only way I found around it)

FT1Alg.CorrectForAberration = true;
FT1Alg.AlignmentFlavor      = "vanilla";

Calibration.Members       = {"TkrCalibAlg/TkrReconCalib"};
TkrReconCalib.calibFlavor = "vanilla";

- Systemtestsfor GR v15r40with respect to the previous production release, GR v15r39. There are no changes in the system tests.
- RM diffwith respect to GR v15r39.

dataMonitoring/FastMonCfg: v1r6p0
- All histograms at the tower level needs a _Tower added as a suffix. Relevant jira(s):GDQMQ-258
- Add spacecraft_orbit_mode and spacecraft_orbit_inSAA. Relevant jira(s): GDQMQ-195
- Remove cut at 0 for Cal plots CalX_Total_NHit_TH1 and CalX_NHit_TH1 in config.xml. Relevant jira(s): GDQMQ-230
- Remove Tick20MHzDeviation_SuperZoom_TH1 and tune Tick20MHzDeviation_Zoom_TH1 x-axis to (-300, -100) in config.xml configLCI.xml MergeHistos_FastMon.txt. Relevant jira(s): GDQMQ-246
- Change histogram name AcdHitsCounter_CableChannel_TH2 to AcdHitMap_GafeGarc_TH2, and swap axis to match the Digi definition in config.xml configLCI.xml MergeHistos_FastMon.txt. Relevant jira(s): GDQMQ-244

dataMonitoring/DigiReconCalMeritCfg: v1r2p27
- Trend the rate of TEM bugs using the FastMon_error_summary bit 23, type is rate in (monconfig_fastmon_trending.xml). Changes to address GDQMQ-254
- Corrected bug in filter for some of the compressed size rates (monconfig_digi_trending.xml).
-  Add FastMon trending quantities FastMon_spacecraft_orbit_mode and FastMon_spacecraft_orbit_inSAA as ValueChange. in monconfig_fastmon_trending.xml
- Changes to address GDQMQ-195
- Check that signal > 0 in ACD PMT that vetoed before computing Log10() to fill histogram. This only affects recon histos job (monconfig_recon_histos.xml).
- Remove cut at 0 for Cal plots CalX_Total_NHit_TH1 and CalX_NHit_TH1_Tower[16]. Changes to address GDQMQ-230
- Remove Tick20MHzDeviation_SuperZoom_TH1 from merge histo digi list. Changes to address GDQMQ-246
- Remove Tick20MHzDeviation_SuperZoom_TH1 and tune Tick20MHzDeviation_Zoom_TH1 x-axis to (-300, -100). Changes to address GDQMQ-246
- Add histogram to hold pedestals from the database.

dataMonitoring/Common: v4r9p0
- Add function LookupErrorCode to Lookup the symbolic name of an error code in FastMon. Relevant jira(s): GDQMQ-257
- New variable in pGloblals to tell FastMon to dump event with an error to file : FASTMON_DUMP_ERRORS_TO_FILE is set to false, and is used only for debugging. Relevant jira(s): GDQMQ-253
- New attribute "seconds_elapsed" in the FastMon parsed by the pErrorLogger module.
- New algorithm to normalize the number of errors to the run duration added to pErrorLogger.
- Minor bug fix: the algorithm alg__values was crashing if for some reason it was not run (for instance if the plot was missing). Which should never happen, but... good to be prepared.
- Another bug fix: exception handling correctly implemented in alg__empty_bins.
- Added a new class (pErrorLogger) to apply alarms on the xml output files from the fastMon error handler and the verify module. The interface is identical to the one of the alarm handler, with the only difference that  the new class operates on xml files, rather than root files (and the algorithms are obviously different). Not running into the pipeline, yet.
- Implemented the pedestal deviation in the CAL fitting tool, based on the new file from David (with the histogram of the pedestal references).
- Number of degrees of freedom and reduced chisuare added to the details dict for all the algorithms involving fitting.
- Left fitting range changed from 1.5 to 2.5 sigma for LEX8 and HEX8 CAL pedestals in pCalPedsAnalyzer.py.
- Some minor refactoring in the p*Analyzer classes to allow the derived classes to add their own histograms to the ones specified in the base class.
- Added histograms of the mean and rms values wrt the truncated average method in pCalPedsAnalyzer and pAcdPedsAnalyzer.
- Added placeholders for the pedestal deviations in pCalPedsAnalyzer and pAcdPedsAnalyzer.
- New alarm algorithm added: alg__low_high_ratio, measuring the ratio--in terms of number of entries--below and above a given pivot point. Relevant jira(s): GDQMQ-26
- All the alarms parameters with a default value (i.e. those which may not be specified in the xml file) added to the parameters dict of the algorithm so that they make it to the output xml file.
- Methods to adjust the x range in pAlarmBaseAlgorithm modified to use the getParameter() method for consistency (this way the min and max parameters make it to the output xml file).
- All algorithms modified to accept both TH1F and TH1D (and/or TH2F and TH2D where necessary).
- Test functions for all the algorithms modified as to print out the actual parameters dict, as it is shipped out to the xml file. * Relevant jira(s): GDM-133
- Output status promoted to warning (used to be undefined) in the edge finders if the edege(s) is (are) not found. The idea is that a minimu suitable number of entries is required before running an alarm on a plot; if the condition is not met the status is UNDEFINED---no need to bother. This change is intended to distinguish the cases in which the algorithm is supposed to have enough stastistics (and hence to find the edge) but it doesn't.
- Bug fix in pBaseAnalyzer: undefined variable when printing the fit parameters on the screen, was affecting the debug mode only.
- Bug fix in pBaseAnalyzer: the errors bars were not set correctly, there was an offset of -1.
- Default values for the ACDanalyzer quantites (for the unphysical empty histograms) set to 0.0 rather than -1.0

dataMonitoring/FastMon: v4r5p0
- Always do a CopyTree even when there is no cut, it seems more reliable pCustomPlotter.py
- Sort list of plots before creating them and add some logging to know which plot is cbeing done : pFastMonTreeProcessor.py
- Update getExpandedExpression() to add only _ and not a full suffix pXmlOutputList.py
- Needs FastMonCfg v1r6p0. Relevant jira(s): GDQMQ-258
- Update error handling using a new function provided by Ric to automatically log the error type for ALL iterators.
- TEM_BUG are still tagged as such in the ERR contribution but in the TKR contribution iterator  UnphysStrip, BadStripOrder, UnPhysTOT and TooManyHits are not anymore counted as TEM_BUG, the bug should be catched earlier.
- The function LookupErrorCode(obj, code) is defined in Common/python/pGlobals.py v4r9p0.  This means that this version of FastMon (>v4r4p0) REQUIRES  Common >= v4r9p0. Relevant jira(s): GDQMQ-257
- error_summary is filled in __postEvent : pDataProcessor. Relevant jira(s): GDQMQ-25
- When FASTMON_DUMP_ERRORS_TO_FILE is set to true, the events with an error are dumped to a file calling the new method __dumpEventToFile. For debugging purpose only : pDataProcessor. Relevant jira(s): GDQMQ-253
- Add spacecraft_orbit_mode and spacecraft_orbit_inSAA, read them from magic7, pass them to the updated pSCPosition constructor, add get methods, and fill quantities in the pGeomagProcessor in pM7Parser.py pSCPosition.py pGeomagProcessor.py. Relevant jira(s): GDQMQ-195
- Add the seconds_elapsed to the error summary: pXmlErrorMerger. Relevant jira(s): GDQMQ-251
- Add the seconds_elapsed to the error summary: pErrorHandler.py
- Give number of seconds elapsed to the ErrorHandler: pDataProcessor.py
- Add ERR_TrgParityError to the handleError(), to synced with the last LDF developments, pLATcomponentIterator.py
- Bug fix in gtrcPhaseError, pERRcontributionIteratorBase.py. Relevant jira(s): GDQMQ-251
- Update old files to v3r5p6 after the tags were lost. Hopefully, everything is OK now. I hate cvs even more than before.
- Bug fix, event number was not correctly reported in pErrorHandler. Relevant jira(s): GDQMQ-197
- Change histogram name AcdHitsCounter_CableChannel_TH2 to AcdHitMap_GafeGarc_TH2, and swap axis to match the Digi definition, in pCustomPlotter
- Log the LDF version in pDataProcessor. Relevant jira(s): GDQMQ-243 and GDQMQ-244 

svac/Monitor: v1r2p27
- Added call to function facilities::commonUtilities::setupEnvironment() in class runStrip_t.cxx in order to access MySQL with GRv15r40.
- Add object FastMon_error_summary to be able to trend the number of TEM bugs. Changes to address GDQMQ-254
- Add FastMon trending quantities FastMon_spacecraft_orbit_mode and FastMon_spacecraft_orbit_inSAA as ValueChange. in monconfig_fastmon_trending.xml
- Needs DigiReconCalMeritCfg v1r2p23
- Changes to address GDQMQ-195
- Bug corrected in definition of ValChange. This bug prevented using proxies with ValChange values.

dataMonitoring/AlarmsCfg: v4r7p5
- Added exception in FastMon eor for spikes and holes on tower 0, gtfe 13, layer 14.
- Upper warning limit on the recon AcdPhaMipsCorrectedAngle moved from 1.2 to 1.3 (required by the change in the fit subrange made in v4r7p
-  A few parameter changed in recon eor configuration file to make it more roboust in case of low statistics.
- Limits on CalX_NHit_TH1_Tower_* (x_rms) in digi eor changed a little bit.
- Condition on the minimum number of entries added for the alarm on digi eor GPSInLock_TH1.
- Limits loosened a tiny bit on CalX_NHit_TH1_Tower_* (x_average) for FastMon eor, for consistency with the corresponding digi eor plot.
- Limits loosened on some of the calhist quantities, as the details of the algorithms are still under development.

calibTkrUtil v2r7
- Add error values to occupancy related parameters for TKR monitor to address request from data monitoring group. improve calculation of errors for zero entry cases.

Complete set of tags for L1Proc 1.64

Code Versions

GlastRelease (sim/recon) v15r40*

ScienceTools (Level 2) : v9r6p2

Science Ops (task defs, scripts):

Level 1 pipeline code and applications running in L1:

svac/L1Pipeline: v1r64*

calibTkrUtil v2r7*
calibGenTKR v4r5

dataMonitoring/AlarmsCfg: v4r7p5*
dataMonitoring/FastMonCfg: v1r6p0*

dataMonitoring/DigiReconCalMeritCfg: v1r2p27*

dataMonitoring/Common: v4r9p0*
dataMonitoring/FastMon: v4r5p0*
datMonitoring/IGRF: v1r0p1

svac/Monitor: v1r2p27*
svac/EngineeringModelRoot: v4r3
svac/TestReport: v6r16

users/richard/pipelineDatasets: v0r6

ft2Util: v1r2p23

evtClassDefs v0r6

GPLtools: v1r11

Reason for change

The main reason for change is the update of the alarm exception after the upload of the new TKR masks on the LAT. A few other improvements (described in details below) are also included. The proposed new release of dataMonitoring/AlarmsCfg is v4r7p0 (as opposed to v4r3p4).

Test Procedure

We have processed monitoring products from real on-orbit data (LPA) locally with this version of AlarmsCfg.

Rollback procedure

The package can be rolled back to the previous version by flipping a soft link. Also note that the package is completely independent from any other package running in the pipeline and will not cause a version change of L1Proc.

CCB Jira

SSC-129@JIRA

Details

Here is a cut and paste of all the relevant release notes.

v4r7p0
  • Update lower limit on DeltaEventTime edge to 530, because it cannot go under 529, in digi_eor_alarms.xml.
  • Alarm on DeltaEventTime changed from leftmost_edge to x_min_bin.
  • Exceptions for the FastMon eor alarms modified after the new TKR masks have been uploaded on the instrument.
  • Exceptions for the TkrMon trending alarms modified after the new TKR masks have been uploaded on the instrument.
  • Limits loosened on some of the calhist quantities, as the details of the algorithms are still under development.
  • Limits loosened a tiny bit on CalX_NHit_TH1_Tower_* (x_average) for digi eor---we were seeing occasional spurious violations.
  • Relevant jira(s): GDQMQ-255
v4r6p1
  • Added a new alarm on the rate of timetones incomplete errors from the FastMon.
v4r6p0
  • Added exceptions on the TkrHitsCounter_PlaneGTFE_TH2_Tower_0 (empty_bins) in fastmon_eor for the four almost completely disconnected GTFEs in tower 0. NOTE: needs dataMonitoring/Common v4r8p1 or later to work---the change will have no effect otherwise.
v4r5p0
  • Added configuration files for the alarms on the xml output files from the FastMon error handler and the verify module.
v4r4p0
  • Added alarms on pedestal deviations and differences with respect to the truncated average in calpeds_eor_alarms.xml.

Reason for Change

Add code to send source monitoring fluxes to GSSC.

Test Procedure

Tested on data in dev db tables and in dev pipeline on data in /ASP/TestSims2.

Rollback procedure

Revert to ASP v2r8p4

CCB JIRA

ssc-128@jira

Details

  • ASP-19@JIRA Add light curve and GRB FITS files to FastCopy stream sent to GSSC
  • ASP v2r8p5
    • AspHealPix v0r0p1
    • AspLauncher v1r3p3
    • AspPolicy v0r6
    • BayesianBlocks v0r2
    • asp_pgwave v1r7p7
    • drpMonitoring v1r6*
      • updated makeDrpLcTables to dispatch FITS files using fastCopy
    • grbASP v4r5p7*
      • fixed typo in BlindSearch.py
    • pyASP v3r5p6*
      • added fastCopy.py
      • updated date2met.py

Reason for Change

Database table changes are required to support web application interfaces to RSP results for point sources. These changes force substantial modifications of the SQL used by ASP to access point source information from the db tables.

Test Procedure

Tested in dev using data in /ASP/TestSims2.

Rollback procedure

Revert to ASP v2r8p4. POINTSOURCES db table content must be restored to its previous state.

CCB JIRA

ssc-127@jira

Details

  • POINTSOURCES db table changes
    • Roberto modified the dev tables with these SQL queries:
      Roberto's SQL for changing POINTSOURCES table
      insert into POINTSOURCETYPESET  select ptsrc_name,'DRP' from PointSources where source_type = 'DRP'
      update PointSources set source_type = 'Blazar' where ptsrc_name != 'LSI +61 303' and source_type = 'DRP'
      update PointSources set source_type = 'MQ' where ptsrc_name = 'LSI +61 303'
      
    • The SOURCESUBTYPES table was modified using
      insert into SOURCESUBTYPES (sourcesub_type) values ('BLZRGRPSRC')
      insert into SOURCESUBTYPES (sourcesub_type) values ('KNOWNPSR')
      insert into SOURCESUBTYPES (sourcesub_type) values ('PGWAVE')
      
    • The blazar group sources and known pulsars were added to the POINTSOURCETYPESET table using this script. Similar code will need to be implemented to insert the existing pgwave sources into this table.
  • ASP v2r8p4
    • AspHealPix v0r0p1
    • AspLauncher v1r3p3
    • AspPolicy v0r6
    • BayesianBlocks v0r2
    • asp_pgwave v1r7p7*
      • insert new source into pointsourcetypeset table with PGWAVE sourcesub_type
      • set emax=3e5 for gtselect filtering prior to counts map creation
    • drpMonitoring v1r5*
      • set emax=3e5 for gtselect prefiltering
      • read eband_ids from db tables instead of using hard-coded dictionary
      • modify SQL to perform inner join with pointsourcetypeset to find relocated source types
      • updates to template file for light curve export to GSSC
    • grbASP v4r5p6*
      • Add code to clean up png figures of log-probabilities that are sent to balist
      • enable automatic mailing of ASP alerts to balist
    • pyASP v3r5p5

Reason for change

Two main reasons:

  • a fix in the accumulation algorithm to account for data points with zero error
  • fix the import script to prevent adding time bins with corrupted data. This will allow to spot runs with corrupted timing information and prevent the corruption of the db tables.

Test Procedure

This version has been tested on the DEV server.

Rollback procedure

Version 1.0.5 can be easily put back in place.

CCB Jira

SSC-126@JIRA

Details

type key summary assignee reporter priority status resolution created updated due

Unable to locate Jira server for this macro. It may be due to Application Link configuration.

Reason for change

We found a typo in one of the xml configuration files for the alarms handler. The effect was that one of the alarms was applied twice on the PMT A of the ACD tiles and never on the PMT B. The new release of dataMonitoring/AlarmsCfg (v4r3p4 as opposed to v4r3p0) includes a fix for this, along with a few other minor improvements (details in the last section).

Test Procedure

We have processed monitoring products from real on-orbit data (LPA) locally with this version of AlarmsCfg.

Rollback procedure

The package can be rolled back to the previous version by flipping a soft link. Also note that the package is completely independent from any other package running in the pipeline and will not cause a version change of L1Proc.

CCB Jira

SSC-124@JIRA

Details

Here is a cut and paste of all the relevant release notes.

v4r3p4
  • fastmon_trend_alarms_exceptions.xml and trackermon_trend_alarms_exceptions.xml updated to include the exceptions on the known noisy strips.
v4r3p3
  • Lower warning limits on Mean_ToT_con*_Trunc_1_249_TowerPlane in digi_trend changed from 25.0 to 8.0. The idea is that a single noisy strip used to trigger this alarm with the old limits, generating confusion. This alarm remains to notify serious issues with the TOT measurement.
  • High warning limit on AcdPedRMS_PMTA_TH1 in acdpeds_eor.xml changed from 6.0 to 7.0. Not really important as this task is not running in the pipeline, yet.
v4r3p2
  • Added alarms on ReconAcdPhaMipsCorrectedAngle_PMTB_TH1_AcdTile_* for the ribbons in the recon_eor.
  • Relevant jira(s): GDQMQ-219
v4r3p1
  • Wrong plot name in recon_eor_alarms.xml fixed (an alarm was applied on ReconAcdPhaMipsCorrectedAngle_PMTA_Zoom_TH1_AcdTile_* twice and it was not applied on ReconAcdPhaMipsCorrectedAngle_PMTB_Zoom_TH1_AcdTile_*).
  • Fitting subrange for ReconAcdPhaMipsCorrectedAngle_PMT*Zoom_TH1_AcdTile* in recon_eor_alarms.xml changed from 0.0-1.5 to 0.2-1.4.
  • Upper warning limit on ReconAcdPhaMipsCorrectedAngle_PMTA_Zoom_TH1_AcdTile_* in recon_eor_alarms.xml changed from 1.5 to 1.2.
  • Upper warning limit on Lac_Thresholds_Face*TH1_TowerCalLayerCalColumn* in recon_eor_alarms.xml changed from 0.55 to 0.50.
  • recon_eor_alarms_postprocess.xml and recon_eor_alarms_postprocess_debug.xml updated according to the new set of alarms.
  • Relevant jira(s): GDQMQ-248

Reason for change

The main reason for change is that a change in the Recon_ReconAcdPhaMipsCorrectedAngle_PMTA_TH1_AcdTile plot is causing a lot of spurious alarms to fire. Those alarms have been moved to another quantity (Recon_ReconAcdPhaMipsCorrectedAngle_PMTA_Zoom_TH1_AcdTile), which is more suitable. The proposed package version is dataMonitoring/AlarmsCfg v4r3p0, as opposed to the original v4r0p0.
The detailed description of the changes is in the last section.

Test Procedure

We have processed monitoring products from real on-orbit data (LPA) locally with this version of AlarmsCfg.

Rollback procedure

The package can be rolled back to the previous version by flipping a soft link. Also note that the package is completely independent from any other package running in the pipeline and will not cause a version change of L1Proc.

CCB Jira

SSC-123@JIRA

Details

Here is a cut and paste of all the relevant release notes.

v4r3p0
  • Alarms on the MIP peak of the ACD tiles moved to the zoomed plot (Recon_ReconAcdPhaMipsCorrectedAngle_PMTA_Zoom_TH1_AcdTile), as the not-zoomed one (Recon_ReconAcdPhaMipsCorrectedAngle_PMTA_TH1_AcdTile) became log-log and is no more suitable for that.
v4r2p0
  • All the useless <parameter name="max" value="9999"/> removed from the digi_eor_alarms.xml file.
  • All the useless <parameter name="max" value="9999"/> removed from the fastmon_eor_alarms.xml file.
  • All the useless <parameter name="max" value="9999"/> removed from the recon_eor_alarms.xml file.
v4r1p0
  • Added conditions on the number of entries in recon_eor to prevent the alarms from being evaluated when there's not enough statistics.

Reason for change

A bunch of new features:

  • administrator page to edit point sources information
  • tabular view of light curves
  • view of tentative associations for each detected source
  • applet for visualization of fits files

Plus bug fixes.

Test Procedure

This version has been tested on the DEV server.

Rollback procedure

Version 0.2.1 can be easily put back in place.

CCB Jira

SSC-121@JIRA

Details

type key summary assignee reporter priority status resolution created updated due

Unable to locate Jira server for this macro. It may be due to Application Link configuration.

Reason for Change

The current set of energy bands (100-300, 300-1000, 1000-3000, 3000-10000, 10000-300000, 100-300000 MeV) divide up the data too finely, resulting in mostly upper limits in the top two bands, whereas the bottom three bands mostly show redundant information to the overall band. A proposal has been made to change to (300-1000, 1000-300000, 100-300000) MeV.

Test Procedure

Table changes and rollback procedure have been tested using the dev tables. The tasks were tested in dev with the new energy bands using data in /ASP/TestSims2.

Rollback procedure

The table changes will be implemented using the script refactorEnergyBands.py and the rollback can be performed using restoreEnergyBands.py.

CCB JIRA

ssc-119@jira

Details

See my presentation at the 8 Aug 2008 Weekly Analysis meeting and this email exchange.

Reasons for Change

Various items listed as JIRA issues in Details section.

Test Procedure

Tested in dev on data in /ASP/TestSims2

Rollback Procedure

Revert to ASP v2r8p2.

CCB JIRA

ssc-117@jira

Details

  • ASP-40@JIRA PGWave will make duplicate db table entries for the same "flare event" if rolled back
  • ASP-42@JIRA PGWave should register tarballs using nDatasetId
  • ASP-43@JIRA GRB_refinement scriptlet is trying to register error contour plots when they don't exist
  • ASP-45@JIRA save png versions of allsky images in addition to or instead of gif versions
  • ASP v2r8p3:
    • ASP/AspHealPix v0r0p1
    • ASP/AspLauncher v1r3p3
    • ASP/AspPolicy v0r6
    • ASP/BayesianBlocks v0r2
    • ASP/asp_pgwave v1r7p6*
    • ASP/drpMonitoring v1r4p4
    • ASP/grbASP v4r5p5*
    • ASP/pyASP v3r5p5

Reason for change

We have a turbo version of CalRecon that increases the overall recon processing speed by a factor two! This GR also includes the correct job options to enable the offline ACD High Range and Coherent Noise calibrations,  JIRAS SSC-113 and SSC-112, has fixes for G4 propagator crashes and includes the FSW compressed event size and FSW compression level in the digi file. Note that the ACD trapzoids and Toby's space craft alignment are not in this GR. 

FastMon has been upgraded to use the new version of LDF that has been installed in ISOC_PROD. This version has a much improved error parsing, in particular for phasing errors/TEM bug. Note that GR still uses the old version of LDF. Is not affected by this as it gets LDF through GLAST_EXTERNAL. GR will catch up at a later stage.  

This L1Proc also has general updates and bug fixes to the monitoring.

Test Procedure

We have processed data runs in the DEV pipeline with this version of L1Proc.

Rollback procedure

We can switch back to the previous version of L1Proc in case of problems.

CCB Jira

SSC-116@JIRA

Details

L1Pipeline v1r63
- Add alarms on calHist. LONE-107@JIRA
- In test mode only, acqQuery.runTimes returns values supplied by halfpipe if run is not in ACQSUMMARY.
- Always use prod L0 archive to make magic7.
- Automatically determine host groups attached to glastdataq when making host lists.
- Premake run, chunk and crumb directories on buffers to reduce concurrent writes.
- Fix chunk overlap testing, add more tests. LONE-92@JIRA
- Put CMTPATH in environment for all jobs. Might help with broken headers when merging.

GlastRelease v15r39:
- Systemtestsfor GR v15r39with respect to the previous production release, GR v15r33. There are a certain number of small differences in the ACD and CAL. They all seem consistent with random fluctuations and/or the code changes. The one worry is that all the 'random' fluctuations only seem to occur for ACD and CAL quantities, not TKR. However, the differences are small. And it's not obvious how to disentangle these things without making many new releases were we back out of one thing at the time. This is the drawback when we hold of GR for a while and get a lot of new things in all at once.   
- RM diffwith respect to GR v15r33.

svac/EngineeringModelRoot: v4r3
- Added compressed event size and compression level.
- Added an explicit exit(1) in case of problems writing the output file, JIRA SVAC-117.
- Added the new Gleam event flag.

dataMonitoring/AlarmsCfg: v4r0p0
- Format of the exceptions donfig file modified as to make the mechanism more flexible. NOTE: the chahnge is not backward compatible. Need dataMonitoring/Common v4r1p0.
- Upper warning limit for spikes_and_holes on TkrHitsCounter_PlaneGTFE_TH2_Tower_ moved from 400 to 1000 (we were having quite a few of not significant warnings).
- Number of sigma for issuing warnings/errors moved from 1 to 2 on all the tkrTrend alarms, as per Hiro's request.
- Alarm limits on the calhist monitoring products made wider so that they do not cause spurious alarms in the tuning phase.
- Alarms on the ACD asymmetries added.
- Bins not corresponding to unphysical ACD tiles removed from the alarms (uses a new ad-hoc feature of the y_values alarm algorithm).
- Added a few alarms on the ACD pedestals.
- num_sigma set to zero for all the alarms on calhist; This takes care, for the moment, of the wrong error bars on the plots.
- Lower error limits on Mean_ToT_con0_Trunc_1_249_TowerPlane (digi trend) changed from 15 to 1 in such a way that an error is reported only if a controller is dead (and not in case of noisy strips). High error limits moved to 150.
- Upper warning limit on Rate_NTkrHits_TowerPlane moved from 2.2 to 2.5 in digi trend. Relevant jira(s): GDQMQ-237

dataMonitoring/FastMonCfg: v1r4p5
- Add ToT plots per plane in merging list file : MergeHistos_FastMon.txt Relevant jira(s): GDQMQ-148

dataMonitoring/DigiReconCalMeritCfg: v1r2p17
- Add  to address Jira GDQMQ-232.
- Add quantities to address jiras GDQMQ-224 and GDQMQ-239. Change cut FswFilters[0]>0 for FswGammaStatus ==0 in merit configs.
- Corrected bug i monconfig_recon_histos.xml (related to jira GDQMQ-229)
- Corrected quantities addressed by Jiras GDQMQ-229 and GDQMQ-233
- Add quantities addressed by Jira GDQMQ-241.
- Add to address Jira GDQMQ-232
- Changes to address jira GDQMQ-242.

dataMonitoring/Common: v4r1p0
- Changed the format for the xml files for the alarm exceptions. The new format is more flexible and allows to explicitely decide the output status (either WARNING or ERROR) upon exception violation. NOTE: the change is not backward compatible with the old configuration file format. Need datamonitoring/AlarmsCfg v4r4p0.
- Some refactoring in the base class for the alarm algorithm. A new general method called getFitOutput() has been implemented to replace the old getFitParameter() and getFitParametes(). It makes easier to implement new alarms involving fitting.
- Added new base class pGenericFitAlgorithm to deal in a uniform fashion with all the algorithms involving simple fitting.
- alg_gauss_mean, alggauss_rms, alggauss_norm, algpeak_position and alg_peak_width modified as to use the new method and the new base class.
- pBaseAnalyzer class modifies as to use the new method.
- Implemented a mechanism to link to each alarm on a multidimentional trending quantity the actual plot that triggered the alarm. Relevant jira(s): GDQMQ-240
-  Added an option to skip a defined set of histogram bins in alg__y_values.py. Necessary to put alarms on the ACD histograms with the GEM tile id on the x axis, since we do have bins which do not correspond to physical tiles, there. Relevant jira(s): GDQMQ-238
- One more handle implemented for the "values" and the "y_values" algorithms: multiplicative factor on the error bar of each trending point/bin value that can be set from the xml file (corresponds to put an alarm at the level of n sigma). Defaults to one, which was the previous behaviour. This also allows to put a zero factor which is handy for the digiLong histograms, in
which the errors are wrong, at the moment.
- Minimum TrueTimeInterval for the alg_values increased from 5 to 10 s. Relevant jira(s): GDQMQ-226
- Added a utility script to map the alarms on the plots.

svac/Monitor: v1r2p23
- This corrects a bug related to the filling of the histograms Garc:Gafe: Jira GDQMQ-244
- Add quantities to address Jira GDQMQ-232.
- Add quantities to address jiras GDQMQ-224 and GDQMQ-239.
- Add function computeratio_v2 and add new quantities. This addresses Jira GDQMQ-241.
- Upgrade of treemerge.cxx. It now Discard chunks contained in between two chunks. This addresses Jira GDQMQ-223
- Add quantities to address Jira GDQMQ-232.
- Add possibiliity of seeting histogram errors to zero. This addresses Jira GDQMQ-242.

dataMonitoring/FastMon: v4r1p3
-  Add back cmt requirements file. It had disappeared from the latest tag
- pCustomPlotter tagged with the ACD plot in elx space. It had disappeared from the latest tag
- Looks like an old version of pXmlInputList was tagged. I fixed it and tagged again.
- Retag things below after removing a wrong tag. Hopefully, everything is OK now. I hate cvs.
- Work only with LDF > 6.4.0
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account : pAEMcontributionIteratorBase.py
- Use the logger to dump debug message in case of an error : pCALcontributionIteratorBase.py
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account : pEBFeventIterator.py
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account. TEM_BUG is now known and reported as such : pERRcontributionIteratorBase.py
- New error contributions added for each iterators. ACD renamed as AEM : pError.py
- New error bits added. ACD renamed as AEM. LAT_DATAGRAM and EBF_EVENT have the same code : pErrorEvent.py
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account. Note that the new LDF code checks the status of the iterator and exits whenever a serious error is found. pLATcomponentIterator.py
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account : pLATcontributionIterator.py
- Use the logger to dump debug message in case of an error : pLATdatagramIterator.py
- Improve handling of errors following the new version of the LDF, all kind of errors now taken into account. EM_BUG is now known and reported as such : pTKRcontributionIteratorBase.py. Relevant jira(s): GDQMQ-197

svac/EngineeringModelRoot: v4r3
- Added EventGleamFlags.
- Added compressed event size and compression level. Needs GR v15r35 or greater. Added 'exit(1)' in case of problems writing the svac.root file.
- Added version of the gamma-filter to the ntuple.

svac/TestReport: v6r16
- Enabling only needed branches in digi takes care of memory leaks

users/richard/pipelineDatasets: v0r6
- Log result of header->Write in pruneTuple::copyHeader.

Complete set of tags for L1Proc 1.63

Code Versions

GlastRelease (sim/recon) v15r39*

ScienceTools (Level 2) : v9r6p2

Science Ops (task defs, scripts):

Level 1 pipeline code and applications running in L1:

svac/L1Pipeline: v1r63*

calibTkrUtil v2r5
calibGenTKR v4r5

dataMonitoring/AlarmsCfg: v4r0p0*
dataMonitoring/FastMonCfg: v1r4p5*

dataMonitoring/DigiReconCalMeritCfg: v1r2p17*

dataMonitoring/Common: v4r1p0*
dataMonitoring/FastMon: v4r1p3*
datMonitoring/IGRF: v1r0p1

svac/Monitor: v1r2p23*
svac/EngineeringModelRoot: v4r3*
svac/TestReport: v6r16*

users/richard/pipelineDatasets: v0r6*

ft2Util: v1r2p23

evtClassDefs v0r6

GPLtools: v1r11

Reason for change

A new tag (v6r16) was just created for the package svac/TestReport. In this tag, only the digi branches needed by the Verify module are enabled. This takes care of a memory leak coming from the branch m_tkrDigiCol (currently under investigation) and allows the verify module to run on bigger digi files.

Since the beginning of data taking, the verify module crashed in 3 runs because of this memory leak (run 236902646 processed with L1Proc 1.58 and runs 239129222 and 239119881 processed with L1Proc 1.61). We would like to upgrade svac/TestReport for these 2 versions of L1Proc and rollback the verify module (instead of reprocessing the entire runs with a newer - and still non existing - L1Proc). This will not touch any of the existing data or monitoring products - only Verify.

Test Procedure

The new tag was tested on one of the previously crashed digi files, while continuously monitoring the memory usage on the host machine. Not only the file was processed correctly, but the memory usage did not show any leak over 4M events.

Rollback procedure

The previous versions of TestReport (v6r12 for L1Proc 1.58 and v6r15 for L1Proc 1.61) will remain in the respective build directory. We can enable back those two very easily.

CCB Jira

SSC-115@JIRA.

Reason for change

The updates in the alarm handler limits respond to changes in the hardware configuration as well as in the ground software. Some minor fine tuning based on the experience with the new data is also included. The proposed package version is dataMonitoring/AlarmsCfg v3r0p1, as opposed to the original v2r4p2.
The detailed description of the changes is in the last section.

Test Procedure

We have processed monitoring products from real on-orbit data (LPA) locally with this version of AlarmsCfg.

Rollback procedure

The package can be rolled back to the previous version by flipping a soft link. Also note that the package is completely independent from any other package running in the pipeline and will not cause a version change of L1Proc.

CCB Jira

SSC-114@JIRA

Details

Here is a cut and paste of all the relevant release notes.

v3r0p1
  • Changed the upper limits on the significances of the empty bins in the FastMon eor.
v3r0p0
  • Configuration files for the alarms on the CAL and ACD calibration quantities added (not yet running in the pipeline).
  • Relevant jira(s): GDQMQ-216
v2r5p3
  • Alarm limits on the Tracker (and in particular on Hiro's quantities) updated.
  • The spikes_and_holes algorithm on the GTFE occupancy in fastmon eor is not allowed to produce errors anymore (only warnings).
  • Alarms on tower hit and trigger efficiency re-enabled. Lower limits are 0.98 and 0.90 respectively. An error is issued upon limit violations.
  • Relevant jira(s): GDQMQ-227
v2r5p2
  • Lower limits on the source event rate (merit eor) set to zero, as we were having alarms occasionally firing.
  • Relevant jira(s): GDQMQ-235
v2r5p1
  • Alarms on the quantity ReconAcdPhaMips_PMT*TH1_AcdTile* in recon eor removed as the corresponding plots are undergoing a revision and redefinition process after some changes in GR.
  • Relevant jira(s): GDQMQ-228
v2r5p0
  • Exception in fastmon eor on the noisy strip of tower 3 (which has been masked) removed.
  • Relevant jira(s): GDQMQ-225