Set up HGCAL GPU vs CPU DQM by fiemmi · Pull Request #50974 · cms-sw/cmssw

fiemmi · 2026-05-19T09:00:57Z

PR description:

This PR implements a new DQMEDAnalyzer to monitor TICL GPU and CPU reconstruction for HGCAL. It further schedules the analyzer through the alpakaValidationHLT procModifier. The output consists of TH1Fs and TH2Fs storing $x_{\textrm{GPU}} - x_{\textrm{CPU}}$ and $x_{\textrm{GPU}}:x_{\textrm{CPU}}$ respectively, where $x$ is a given TICL observable.

This PR is not dependent on any other PR.

PR validation:

The PR has been validated through the following pipeline:

cmsenv
mkdir testMatrix
cd testMatrix
runTheMatrix.py -w upgrade -l 36034.7503 -j 0
cd 36034.7503_TTbar_14TeV+Run4D125_HLTHeterogeneousValid
cmsRun TTbar_14TeV_TuneCP5_cfi_GEN_SIM.py
cmsRun step2_DIGI_L1TrackTrigger_L1_L1P2GT_DIGI2RAW_HLT_VALIDATION.py
cmsRun step3_HARVESTING.py

After running it, the aforementioned histograms can be found by opening the ROOT file DQM_V0001_R000000001__Global__CMSSW_X_Y_Z__RECO.root and inspecting the directory DQMData/Run 1/HGCAL/Run summmary.

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

This PR is not a backport.

cmsbuild · 2026-05-19T09:01:31Z

cms-bot internal usage

cmsbuild · 2026-05-19T09:32:26Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49380

There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/EventContent/python/EventContent_cff.py modified in PR(s): L1S(Nano): add subpackage L1TriggerScouting/NanoAOD #50941
- File HLTrigger/Configuration/python/HLT_75e33_cff.py modified in PR(s): Phase2 Single_Tau_Trigger Path Added #49637, LST: add LSTGeometry package and associated ESProducer #50679

cmsbuild · 2026-05-19T09:32:52Z

A new Pull Request was created by @fiemmi for master.

It involves the following packages:

Configuration/EventContent (operations)
DQM/HGCAL (****)
HLTrigger/Configuration (hlt)

The following packages do not have a category, yet:

DQM/HGCAL
Please create a PR for https://github.com/cms-sw/cms-bot/blob/master/categories_map.py to assign category

@Martin-Grunewald, @cmsbuild, @davidlange6, @fabiocos, @ftenchini, @mandrenguyen, @mmusich can you please review it and eventually sign? Thanks.
@Martin-Grunewald, @SohamBhattacharya, @VourMa, @fabiocos, @missirol, @mmusich, @rovere this is something you requested to watch as well.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.

cms-bot commands are listed here

mmusich · 2026-05-19T09:35:06Z

+#include "FWCore/Framework/interface/MakerMacros.h"
+#include "DataFormats/Candidate/interface/Candidate.h"
+#include "DataFormats/CaloRecHit/interface/CaloClusterCollection.h"
+#include "DataFormats/CaloRecHit/interface/CaloCluster.h"


picky but can you alpha-order?

mmusich · 2026-05-19T09:35:26Z

+  edm::EDGetTokenT<reco::CaloClusterCollection> tokenMonitoredLayerClusters_;
+  edm::EDGetTokenT<reco::CaloClusterCollection> tokenReferenceLayerClusters_;


Suggested change

edm::EDGetTokenT<reco::CaloClusterCollection> tokenMonitoredLayerClusters_;

edm::EDGetTokenT<reco::CaloClusterCollection> tokenReferenceLayerClusters_;

const edm::EDGetTokenT<reco::CaloClusterCollection> tokenMonitoredLayerClusters_;

const edm::EDGetTokenT<reco::CaloClusterCollection> tokenReferenceLayerClusters_;

mmusich · 2026-05-19T09:35:52Z

+      tokenReferenceLayerClusters_(
+          consumes<reco::CaloClusterCollection>(iConfig.getParameter<edm::InputTag>("referenceLayerClusters"))) {}
+
+HGCALGPUvsCPUComparisonHists::~HGCALGPUvsCPUComparisonHists() {}


Suggested change

HGCALGPUvsCPUComparisonHists::~HGCALGPUvsCPUComparisonHists() {}

provided the method declaration is declared override

mmusich · 2026-05-19T09:36:31Z

+void HGCALGPUvsCPUComparisonHists::beginJob(const edm::EventSetup& iSetup) {}
+
+void HGCALGPUvsCPUComparisonHists::bookHistograms(DQMStore::IBooker& iBooker, edm::Run const&, edm::EventSetup const&) {
+  iBooker.setCurrentFolder("HGCAL");


shouldn't this be configurable ?
At the very least I'd like the new plot to appear under HLT...

mmusich · 2026-05-19T09:38:01Z

+  const std::vector<reco::CaloCluster>& monitoredLayerClusters = iEvent.get(tokenMonitoredLayerClusters_);
+  const std::vector<reco::CaloCluster>& referenceLayerClusters = iEvent.get(tokenReferenceLayerClusters_);


what if the product is not available?
We don't want to crash processing because of missing input in DQM.

mmusich · 2026-05-19T09:41:47Z

@fiemmi in addition to the review above, this relatively simple PR has 15 commits with sometimes not very useful comments, please consider squashing to a minimum. Also

DQM/HGCAL
Please create a PR for https://github.com/cms-sw/cms-bot/blob/master/categories_map.py to assign category

mmusich · 2026-05-19T09:46:28Z

+protected:
+  void beginJob(const edm::EventSetup& iSetup);
+  void analyze(const edm::Event& iEvent, const edm::EventSetup& iSetup) override;
+  void bookHistograms(DQMStore::IBooker& iBooker, edm::Run const& iRun, edm::EventSetup const& iSetup) override;


if this module has to run in the HLT, it must have a fillDescriptions. Please provide one.

cmsbuild · 2026-05-20T13:56:28Z

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49421

There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/EventContent/python/EventContent_cff.py modified in PR(s): L1S(Nano): add subpackage L1TriggerScouting/NanoAOD #50941
- File HLTrigger/Configuration/python/HLT_75e33_cff.py modified in PR(s): Phase2 Single_Tau_Trigger Path Added #49637, LST: add LSTGeometry package and associated ESProducer #50679

Code check has found code style and quality issues which could be resolved by applying following patch(s)

code-format:
https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49421/code-format.patch
e.g. curl -k https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49421/code-format.patch | patch -p1
You can also run scram build code-format to apply code format directly

cmsbuild · 2026-05-20T14:09:59Z

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49423

There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/EventContent/python/EventContent_cff.py modified in PR(s): L1S(Nano): add subpackage L1TriggerScouting/NanoAOD #50941
- File HLTrigger/Configuration/python/HLT_75e33_cff.py modified in PR(s): Phase2 Single_Tau_Trigger Path Added #49637, LST: add LSTGeometry package and associated ESProducer #50679

Code check has found code style and quality issues which could be resolved by applying following patch(s)

code-format:
https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49423/code-format.patch
e.g. curl -k https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49423/code-format.patch | patch -p1
You can also run scram build code-format to apply code format directly

cmsbuild · 2026-05-20T14:17:16Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49424

There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/EventContent/python/EventContent_cff.py modified in PR(s): L1S(Nano): add subpackage L1TriggerScouting/NanoAOD #50941
- File HLTrigger/Configuration/python/HLT_75e33_cff.py modified in PR(s): Phase2 Single_Tau_Trigger Path Added #49637, LST: add LSTGeometry package and associated ESProducer #50679

cmsbuild · 2026-05-20T18:40:08Z

-1

Failed Tests: UnitTests
Size: This PR adds an extra 48KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-f5699a/53390/summary.html
COMMIT: dadb719
CMSSW: CMSSW_17_0_X_2026-05-20-1100/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/50974/53390/install.sh to create a dev area with all the needed externals and cmssw changes.

Failed Unit Tests

I found 1 errors in the following unit tests:

---> test test_check_phase2_hlt_duplicates had ERRORS

Comparison Summary

Summary:

You potentially removed 3 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 14 differences found in the comparisons
DQMHistoTests: Total files compared: 66
DQMHistoTests: Total histograms compared: 4596347
DQMHistoTests: Total failures: 68
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 4596259
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 65 files compared)
DQMHistoSizes: changed ( 34434.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 276 log files, 236 edm output root files, 66 DQM output files
TriggerResults: found differences in 1 / 64 workflows

AMD_MI300X Comparison Summary

There are some workflows for which there are errors in the baseline:
34634.402 step 2
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

Summary:

You potentially removed 80 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 299 differences found in the comparisons
DQMHistoTests: Total files compared: 12
DQMHistoTests: Total histograms compared: 203010
DQMHistoTests: Total failures: 32404
DQMHistoTests: Total nulls: 38
DQMHistoTests: Total successes: 170568
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 11 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 47 log files, 48 edm output root files, 12 DQM output files
TriggerResults: found differences in 2 / 11 workflows

AMD_W7900 Comparison Summary

Summary:

You potentially removed 34 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 364 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 40799
DQMHistoTests: Total nulls: 31
DQMHistoTests: Total successes: 177889
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 4 / 12 workflows

NVIDIA_H100 Comparison Summary

Summary:

You potentially removed 13 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 349 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 28892
DQMHistoTests: Total nulls: 32
DQMHistoTests: Total successes: 189795
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 2 / 12 workflows

NVIDIA_L40S Comparison Summary

Summary:

You potentially removed 13 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 363 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 27816
DQMHistoTests: Total nulls: 37
DQMHistoTests: Total successes: 190866
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 2 / 12 workflows

Max Memory Comparisons exceeding threshold NVIDIA_H100

@cms-sw/core-l2 , I found 1 workflow step(s) with memory usage exceeding the error threshold:

Expand to see workflows ...

Error: Workflow 34634.7503_TTbar_14TeV+Run4D121PU_HLTHeterogeneousValid step2 max memory diff 157.4 exceeds +/- 90.0 MiB

Max Memory Comparisons exceeding threshold NVIDIA_L40S

@cms-sw/core-l2 , I found 1 workflow step(s) with memory usage exceeding the error threshold:

Expand to see workflows ...

Error: Workflow 34634.7503_TTbar_14TeV+Run4D121PU_HLTHeterogeneousValid step2 max memory diff 150.3 exceeds +/- 90.0 MiB

cmsbuild · 2026-05-21T04:00:59Z

Pull request #50974 was updated. @Martin-Grunewald, @ctarricone, @davidlange6, @fabiocos, @ftenchini, @gabrielmscampos, @mandrenguyen, @mmusich, @rseidita can you please check and sign again.

waredjeb · 2026-05-21T09:17:57Z

+    layerClusters = cms.VInputTag("hltHgcalLayerClustersEE", *ceh_layerClusters),
+    time_layerclusters = cms.VInputTag("hltHgcalLayerClustersEE:timeLayerCluster", *ceh_time_layerClusters),


I think we should define hltMergeLayerClustersSerialSync directly with the SerialSync collections. Otherwise, at definition time it is identical to hltMergeLayerClusters, and the Phase-2 HLT duplicate-check unit test fails.

Suggested change

layerClusters = cms.VInputTag("hltHgcalLayerClustersEE", *ceh_layerClusters),

time_layerclusters = cms.VInputTag("hltHgcalLayerClustersEE:timeLayerCluster", *ceh_time_layerClusters),

layerClusters = cms.VInputTag("hltHgCalLayerClustersFromSoAProducerSerialSync", *ceh_layerClusters),

time_layerclusters = cms.VInputTag("hltHgCalLayerClustersFromSoAProducerSerialSync:timeLayerCluster", *ceh_time_layerClusters)

waredjeb · 2026-05-21T09:19:19Z

+alpakaValidationHLT.toModify(hltMergeLayerClustersSerialSync,
+   layerClusters = ["hltHgCalLayerClustersFromSoAProducerSerialSync", *ceh_layerClusters],
+   time_layerclusters = ["hltHgCalLayerClustersFromSoAProducerSerialSync:timeLayerCluster", *ceh_time_layerClusters]
+)


Given the comment above, this should no longer be needed

Suggested change

alpakaValidationHLT.toModify(hltMergeLayerClustersSerialSync,

layerClusters = ["hltHgCalLayerClustersFromSoAProducerSerialSync", *ceh_layerClusters],

time_layerclusters = ["hltHgCalLayerClustersFromSoAProducerSerialSync:timeLayerCluster", *ceh_time_layerClusters]

)

cmsbuild · 2026-05-21T11:42:01Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-50974/49431

There are other open Pull requests which might conflict with changes you have proposed:
- File HLTrigger/Configuration/python/HLT_75e33_cff.py modified in PR(s): Phase2 Single_Tau_Trigger Path Added #49637, LST: add LSTGeometry package and associated ESProducer #50679

cmsbuild · 2026-05-21T11:42:25Z

Pull request #50974 was updated. @Martin-Grunewald, @cmsbuild, @ctarricone, @davidlange6, @fabiocos, @ftenchini, @gabrielmscampos, @mandrenguyen, @mmusich, @rseidita can you please check and sign again.

mmusich · 2026-05-21T12:05:42Z

@cmsbuild, please test

mmusich · 2026-05-21T12:06:49Z

 )

+hltHgcalSoARecHitsProducerSerialSync = makeSerialClone(hltHgcalSoARecHitsProducer
+)


can you avoid going into the next line?

mmusich · 2026-05-21T12:07:10Z

 # Process modifiers: ticl_barrel and alpaka
 from Configuration.ProcessModifiers.alpaka_cff import alpaka
 from Configuration.ProcessModifiers.ticl_barrel_cff import ticl_barrel
+from Configuration.ProcessModifiers.alpakaValidationHLT_cff import alpakaValidationHLT


what do you need this for here?

Good catch, this is just a leftover from the previous update, where we removed the invocation of alpakaValidationHLT.toModify(). It is not needed in the current version of the code. Will be removed in the next update.

mmusich · 2026-05-21T12:07:22Z

@@ -1,4 +1,5 @@
 import FWCore.ParameterSet.Config as cms
+from HeterogeneousCore.AlpakaCore.functions import makeSerialClone


what do you need this for here?

Along the same lines of the comment above, this is a leftover from a previous version of the code. Will be removed in the next update. Thanks for spotting it.

mmusich · 2026-05-21T12:10:14Z

+        hltHgcalLayerClustersHSci+
+        hltHgcalLayerClustersHSi+
+        hltMergeLayerClustersSerialSync)
+alpakaValidationHLT.toReplaceWith(HLTTICLLocalRecoSequence, _HLTTICLLocalRecoSequence_heterogeneousGPUCPU)


the modifier should be explicitly imported in this file.

mmusich · 2026-05-21T12:17:20Z

@@ -1,4 +1,5 @@
 import FWCore.ParameterSet.Config as cms
+from HeterogeneousCore.AlpakaCore.functions import makeSerialClone


what is this needed for here?

cmsbuild · 2026-05-21T14:33:23Z

+1

Size: This PR adds an extra 56KB to repository
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-f5699a/53407/summary.html
COMMIT: cd5ccdb
CMSSW: CMSSW_17_0_X_2026-05-20-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/50974/53407/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

No significant changes to the logs found
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 16 differences found in the comparisons
DQMHistoTests: Total files compared: 66
DQMHistoTests: Total histograms compared: 4596347
DQMHistoTests: Total failures: 6
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 4596321
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 65 files compared)
DQMHistoSizes: changed ( 34434.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 276 log files, 236 edm output root files, 66 DQM output files
TriggerResults: found differences in 1 / 64 workflows

AMD_MI300X Comparison Summary

Summary:

You potentially removed 3 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 375 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 32349
DQMHistoTests: Total nulls: 38
DQMHistoTests: Total successes: 186332
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 3 / 12 workflows

AMD_W7900 Comparison Summary

Summary:

You potentially added 7 lines to the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 378 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 31368
DQMHistoTests: Total nulls: 33
DQMHistoTests: Total successes: 187318
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 3 / 12 workflows

NVIDIA_H100 Comparison Summary

Summary:

You potentially removed 7 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 362 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 31661
DQMHistoTests: Total nulls: 32
DQMHistoTests: Total successes: 187026
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 2 / 12 workflows

NVIDIA_L40S Comparison Summary

Summary:

You potentially removed 5 lines from the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 377 differences found in the comparisons
DQMHistoTests: Total files compared: 13
DQMHistoTests: Total histograms compared: 218719
DQMHistoTests: Total failures: 28591
DQMHistoTests: Total nulls: 27
DQMHistoTests: Total successes: 190101
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 808.695 KiB( 12 files compared)
DQMHistoSizes: changed ( 34634.7503 ): 808.695 KiB HLT/HeterogeneousComparisons
Checked 49 log files, 50 edm output root files, 13 DQM output files
TriggerResults: found differences in 2 / 12 workflows

Max Memory Comparisons exceeding threshold NVIDIA_H100

@cms-sw/core-l2 , I found 1 workflow step(s) with memory usage exceeding the error threshold:

Expand to see workflows ...

Error: Workflow 34634.7503_TTbar_14TeV+Run4D121PU_HLTHeterogeneousValid step2 max memory diff 157.1 exceeds +/- 90.0 MiB

Max Memory Comparisons exceeding threshold NVIDIA_L40S

@cms-sw/core-l2 , I found 1 workflow step(s) with memory usage exceeding the error threshold:

Expand to see workflows ...

Error: Workflow 34634.7503_TTbar_14TeV+Run4D121PU_HLTHeterogeneousValid step2 max memory diff 164.5 exceeds +/- 90.0 MiB

mmusich

few other comments.

mmusich · 2026-05-21T20:09:13Z

+private:
+  const edm::EDGetTokenT<reco::CaloClusterCollection> tokenMonitoredLayerClusters_;
+  const edm::EDGetTokenT<reco::CaloClusterCollection> tokenReferenceLayerClusters_;
+  const std::string topFolderName;


Suggested change

const std::string topFolderName;

const std::string topFolderName_;

to be consistent.

mmusich · 2026-05-21T20:13:03Z

+  //2D
+  hLayerCluster2D_x = iBooker.book2D("hLayerCluster2D_x", "hLayerCluster2D_x", 200, -50, 50, 200, -50, 50);
+  hLayerCluster2D_y = iBooker.book2D("hLayerCluster2D_y", "hLayerCluster2D_y", 200, -50, 50, 200, -50, 50);
+  hLayerCluster2D_z = iBooker.book2D("hLayerCluster2D_z", "hLayerCluster2D_z", 250, -500, 500, 250, -500, 500);


do we really need this amount of bins in the 2D histograms?
The memory footprint of this PR in terms of DQM memory is on the high-ish side. See #50974 (comment)

DQMHistoSizes: Histogram memory added: 808.695 KiB( 65 files compared) DQMHistoSizes: changed ( 34434.7503 ): 808.695 KiB HLT/HeterogeneousComparisons

Will halve the number of bins on both axes in the next update.

mmusich · 2026-05-21T20:18:07Z

+    auto seed = (*referenceLayerClusters)[idx].seed();
+    if (seedToIdx.find(seed) != seedToIdx.end()) {
+      edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Duplicate seed in reference collection.";
+      return;


do you really want to return here, or just continue in the loop?

mmusich · 2026-05-21T20:21:18Z

+  edm::Handle<reco::CaloClusterCollection> monitoredLayerClusters_, referenceLayerClusters_;
+  iEvent.getByToken(tokenMonitoredLayerClusters_, monitoredLayerClusters_);
+  iEvent.getByToken(tokenReferenceLayerClusters_, referenceLayerClusters_);
+  if (!(monitoredLayerClusters_.isValid()) || !(referenceLayerClusters_.isValid())) {
+    edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Monitored or reference collection is invalid.";
+    return;
+  }
+  const std::vector<reco::CaloCluster>* monitoredLayerClusters = monitoredLayerClusters_.product();
+  const std::vector<reco::CaloCluster>* referenceLayerClusters = referenceLayerClusters_.product();


Suggested change

edm::Handle<reco::CaloClusterCollection> monitoredLayerClusters_, referenceLayerClusters_;

iEvent.getByToken(tokenMonitoredLayerClusters_, monitoredLayerClusters_);

iEvent.getByToken(tokenReferenceLayerClusters_, referenceLayerClusters_);

if (!(monitoredLayerClusters_.isValid()) || !(referenceLayerClusters_.isValid())) {

edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Monitored or reference collection is invalid.";

return;

}

const std::vector<reco::CaloCluster>* monitoredLayerClusters = monitoredLayerClusters_.product();

const std::vector<reco::CaloCluster>* referenceLayerClusters = referenceLayerClusters_.product();

const auto& monitoredHandle = iEvent.getHandle(tokenMonitoredLayerClusters_);

const auto& referenceHandle = iEvent.getHandle(tokenReferenceLayerClusters_);

if (!monitoredHandle.isValid() || !referenceHandle.isValid()) {

edm::LogWarning("HGCALGPUvsCPUComparisonHists")

<< "Monitored or reference LayerCluster collection is invalid.";

return;

}

const reco::CaloClusterCollection& monitoredLayerClusters = *monitoredHandle;

const reco::CaloClusterCollection& referenceLayerClusters = *referenceHandle;

Use edm::Handle with auto, and getHandle() instead of getByToken()

Validity check -> same logic, cleaner syntax

Prefer a const reference over a raw pointer

do not use trailing underscores in locals.

mmusich · 2026-05-21T20:21:44Z

+
+  //look for GPU and CPU LayerClusters whose seeds match
+  //map LC seeds to LC indices for the reference collection
+  std::unordered_map<uint32_t, std::pair<unsigned, bool>>


I think the corresponding header file is missing for this.

mmusich · 2026-05-21T20:28:35Z

+    auto it = seedToIdx.find(monitored.seed());
+    if (it != seedToIdx.end() && it->second.second == false) {
+      it->second.second = true;  //establish a match
+      const auto& reference = (*referenceLayerClusters)[it->second.first];


I think the whole code block L110 to L127 can be rewritten slightly more efficiently:

std::unordered_map<uint32_t, unsigned> seedToIdx; seedToIdx.reserve(referenceLayerClusters->size()); for (unsigned idx = 0; idx < referenceLayerClusters->size(); idx++) { auto [it, inserted] = seedToIdx.try_emplace((*referenceLayerClusters)[idx].seed(), idx); if (!inserted) { edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Duplicate seed in reference collection."; return; // continue? } } std::unordered_set<uint32_t> matched; matched.reserve(referenceLayerClusters->size()); for (const auto& monitored : *monitoredLayerClusters) { auto it = seedToIdx.find(monitored.seed()); if (it != seedToIdx.end() && !it->second.second) { it->second.second = true; const auto& reference = (*referenceLayerClusters)[it->second.first]; // fill histograms... } }

I really like the idea of using try_emplace. The current version of the code performs two hash-table operations (find(seed) and operator[](seed)) while try_emplace merges them to just one.
I propose to change

for (unsigned idx = 0; idx < referenceLayerClusters->size(); idx++) { auto seed = (*referenceLayerClusters)[idx].seed(); if (seedToIdx.find(seed) != seedToIdx.end()) { edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Duplicate seed in reference collection."; return; } seedToIdx[seed] = {idx, false}; //initialze all reference LCs as unmatched }

to

for (unsigned idx = 0; idx < referenceLayerClusters.size(); idx++) { auto [it, inserted] = seedToIdx.try_emplace(referenceLayerClusters[idx].seed(), idx, false); //initialze all reference LCs as unmatched if (!inserted) { edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "Duplicate seed in reference collection."; continue; } }

At the same time, this proposal still uses one container (a std::unordered_map<uint32_t, std::pair<unsigned, bool>>) instead of two. Let me know if you would find this acceptable.

mmusich · 2026-05-21T20:29:47Z

+      hLayerCluster2D_nRecHits->Fill(reference.size(), monitored.size());
+    } else {
+      edm::LogWarning("HGCALGPUvsCPUComparisonHists") << "No match or duplicate match to reference collection found.";
+      return;


again do you really want to return here?

mmusich · 2026-05-21T20:39:12Z

+        hltHgcalLayerClustersEE+
+        hltHgcalLayerClustersHSci+
+        hltHgcalLayerClustersHSi+
+        hltMergeLayerClustersSerialSync)


overall I am a bit confused by how this sequence is written. Why are the producer instances hltHGCalUncalibRecHit , hltHGCalRecHit and the bloc hltHgcalLayerClustersEE+hltHgcalLayerClustersHSci+hltHgcalLayerClustersHSi repeated twice? Even if the framework elides the duplication is confusing to see them two times in the same sequence.

cmsbuild added this to the CMSSW_17_0_X milestone May 19, 2026

cmsbuild added hlt-pending operations-pending pending-signatures tests-pending orp-pending new-package-pending code-checks-pending labels May 19, 2026

cmsbuild added code-checks-approved and removed code-checks-pending labels May 19, 2026

mmusich reviewed May 19, 2026

View reviewed changes

cmsbuild added code-checks-pending and removed code-checks-approved labels May 20, 2026

cmsbuild added code-checks-rejected and removed code-checks-pending labels May 20, 2026

fiemmi force-pushed the ticl_dqm_GPUvsCPU_CMSSW_17_0_0_pre1 branch from 58d87d8 to ad87c6a Compare May 20, 2026 14:07

cmsbuild added code-checks-pending and removed code-checks-rejected labels May 20, 2026

cmsbuild added code-checks-rejected code-checks-pending and removed code-checks-pending code-checks-rejected labels May 20, 2026

cmsbuild added tests-rejected dqm-pending and removed tests-started new-package-pending labels May 20, 2026

waredjeb reviewed May 21, 2026

View reviewed changes

ingredients for HGCAL GPU vs CPU DQM

cd5ccdb

fiemmi force-pushed the ticl_dqm_GPUvsCPU_CMSSW_17_0_0_pre1 branch from dadb719 to cd5ccdb Compare May 21, 2026 11:39

cmsbuild added tests-pending code-checks-pending and removed tests-rejected code-checks-approved labels May 21, 2026

cmsbuild added code-checks-approved and removed code-checks-pending labels May 21, 2026

cmsbuild added tests-started and removed tests-pending labels May 21, 2026

mmusich reviewed May 21, 2026

View reviewed changes

cmsbuild added operations-approved tests-approved and removed operations-pending tests-started labels May 21, 2026

mmusich reviewed May 22, 2026

View reviewed changes

cmsbuild mentioned this pull request May 22, 2026

LST: add LSTGeometry package and associated ESProducer #50679

Open

		edm::EDGetTokenT<reco::CaloClusterCollection> tokenMonitoredLayerClusters_;
		edm::EDGetTokenT<reco::CaloClusterCollection> tokenReferenceLayerClusters_;

		const std::vector<reco::CaloCluster>& monitoredLayerClusters = iEvent.get(tokenMonitoredLayerClusters_);
		const std::vector<reco::CaloCluster>& referenceLayerClusters = iEvent.get(tokenReferenceLayerClusters_);

		layerClusters = cms.VInputTag("hltHgcalLayerClustersEE", *ceh_layerClusters),
		time_layerclusters = cms.VInputTag("hltHgcalLayerClustersEE:timeLayerCluster", *ceh_time_layerClusters),

		@@ -1,4 +1,5 @@
		import FWCore.ParameterSet.Config as cms
		from HeterogeneousCore.AlpakaCore.functions import makeSerialClone

	const std::string topFolderName;
	const std::string topFolderName_;

Conversation

fiemmi commented May 19, 2026

PR description:

PR validation:

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

Uh oh!

cmsbuild commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmsbuild commented May 19, 2026

Uh oh!

cmsbuild commented May 19, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mmusich commented May 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cmsbuild commented May 20, 2026

Uh oh!

cmsbuild commented May 20, 2026

Uh oh!

cmsbuild commented May 20, 2026

Uh oh!

cmsbuild commented May 20, 2026

Failed Unit Tests

Comparison Summary

AMD_MI300X Comparison Summary

AMD_W7900 Comparison Summary

NVIDIA_H100 Comparison Summary

NVIDIA_L40S Comparison Summary

Max Memory Comparisons exceeding threshold NVIDIA_H100

Max Memory Comparisons exceeding threshold NVIDIA_L40S

Uh oh!

cmsbuild commented May 21, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cmsbuild commented May 21, 2026

Uh oh!

cmsbuild commented May 21, 2026

Uh oh!

mmusich commented May 21, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cmsbuild commented May 21, 2026

Comparison Summary

AMD_MI300X Comparison Summary

AMD_W7900 Comparison Summary

NVIDIA_H100 Comparison Summary

NVIDIA_L40S Comparison Summary

Max Memory Comparisons exceeding threshold NVIDIA_H100

Max Memory Comparisons exceeding threshold NVIDIA_L40S

Uh oh!

mmusich left a comment

cmsbuild commented May 19, 2026 •

edited

Loading

mmusich commented May 19, 2026 •

edited

Loading

mmusich May 21, 2026 •

edited

Loading