RoI feedback and HLT in DESY-TB R.Itoh, KEK Trigger Belle II DAQ - - PowerPoint PPT Presentation

roi feedback and hlt in desy tb
SMART_READER_LITE
LIVE PREVIEW

RoI feedback and HLT in DESY-TB R.Itoh, KEK Trigger Belle II DAQ - - PowerPoint PPT Presentation

RoI feedback and HLT in DESY-TB R.Itoh, KEK Trigger Belle II DAQ System FTSW FTSW FTSW FE PXD readout box PXD dig (ONSEN) reco DATCON rder HLT ~250 COPPERs distributor ~40 R/O PCs reco FE tx SVD Event Builder 2 rder dig


slide-1
SLIDE 1

RoI feedback and HLT in DESY-TB

R.Itoh, KEK

slide-2
SLIDE 2

FE dig FE dig FE dig FE dig FE dig tx tx tx tx tx rx rx rx rx R/O PC R/O PC R/O PC R/O PC Event Builder 1 ~250 COPPERs HLT (>6400 cores) ~40 R/O PCs Rocket IO

  • ver fiber

CDC SVD PID ECL KLM Event Builder 2 PXD FE dig PXD readout box (ONSEN) HLT distributor reco rder reco rder reco rder reco rder ...... ..... DATCON

Belle2link

Express Reco FTSW Trigger FTSW FTSW

Belle II DAQ System

HLT (10-20 units) (>= 2 units)

slide-3
SLIDE 3
  • 1. Stable operation of tracking code (VXDTF and VXDTF2).

* In the 2nd beam test, we had a lot of HLT crashes because of the segmentation fault in the tracking code. * Self-recovery mechanism was implemented and tested.

  • 2. Real time RoI (Region of Interest on PXD surface) generation

and feedback to PXD readout subsystem * In addition to RoI obtained from tracks, “dummy RoI”s are added for the monitoring purpose.

  • 3. Real time DQM histogram accumulation and live histogram transfer

to the monitor node. * DQM codes for SVD hit-map, tracking quality check, etc. are implemented.

  • 4. Test of Software Trigger scheme

* Just a test of putting selection tag in output objects. * No selection at all.

Test of HLT operation in 3rd DESY test beam (Feb-Mar)

Goals:

slide-4
SLIDE 4

DAQ for DESY beam test

COPPER COPPER FTB

FTSW

ttdcpu

DHP DHE DHC

Onsen R/O PC 1

hltin hltctl hltwn1 hltwn2 hltwn3 hltout

R/O PC 2

ExpReco

Event display FTB

FADC FADC

TLU

swi tch switch

SVD 4 layers PXD OB 2 scinti.

coincidence POCKET DAQ HLT

HLT RoI

PXD R/O

DATCON RoI

EVB2 Storage ExpressReco Belle2link “Pocket TTD”

“Full functionality of Belle II DAQ”

DATCON DHP DHE DHC PXD IB

DQM server

Histo browserr

DQM histos

slide-5
SLIDE 5

HLT architecture in DESY DAQ

basf2 -p 8 basf2 -p 8 basf2 -p 8 basf2 Socket Receiver RingBuffer Socket Transmitter EventBuilder1(R/O PC1) EventBuilder2(R/O PC2)

Raw data formatter module DataStore streamer module DataStore destreamer module DataStore streamer+ encapsulater

RoI to PXD R/O

RoI sender module

8 cores (3GHz) x 3 PC servers 12 cores(2.4GHz) 12 cores(2.4GHz)

belle-hltin belle-hltwn{1,2,3}

slide-6
SLIDE 6

T.Konno

slide-7
SLIDE 7

Why did the HLT performance degrade as a function of time?

  • The performance degradation first observed when operated

with VXDTF + SVDDQM.

  • At the time, the SVDDQM was considered to be the cause

because of a large CPU consumption, and moved to ExpReco. The observed performance degradation was gone by the fix.

  • Still the degradation was observed and “tuning” was done.

* Various delay parameters in RingBuffer queuing were adjusted.

  • > The degradation was gone and the stable operation up to 16

hours was confirmed. No HLT crash occurred.

  • The problem occurred again when switching tracking software to

VXDTF2. Observations:

slide-8
SLIDE 8
  • The same DESY software environment was ported to KEK test

bench (real HLT : HLT unit 3) to reproduce the degradation.

  • The same HLT configuration (8 cores x 3 servers with input/output)

was built.

  • The recorded data were fed into the same HLT processing chain at

a Poisson distributed rate (3kHz input).

  • The degradation was reproduced on the test bench.
  • After detailed investigations, the cause was pinned down.

1) The duration of DQM histogram transfer over the network was too frequent. The socket buffer became full gradually and the event processing was blocked until the buffer becomes available. 2) The other DQM histograms(TTrees, Tuples) were dumped to a file at the same interval. It fully occupied the NFS bandwidth and slowed down the network speed.

  • By reducing the frequency, the degradation was gone.
slide-9
SLIDE 9

Input : EventMetaData + RawSVDs + RawFTSWs Input rate : Poisson 3kHz HLT processing : VXDTF2 + TrackDQM + RoI extraction Histogram dump rate : once / 1000 events once / 10000 events HLT input buffer is always empty. HLT input buffer is full * Histogram dump was performed for every 1000/8 for each core...

slide-10
SLIDE 10

RoI Generation in HLT at DESY

  • Tracking of beam was done by VXDTF(2) on each of 24 cores

in HLT.

  • From the reconstructed track information, RoIs are extracted

by “PXDDataReduction” module on 24 cores.

  • In addition, “dummy RoIs” are added for debugging purpose.

* Generated by multiple “ROIGenerator” module * “full frame” RoIs for the debugging purpose for every 1000 events. * The RoIs are packed by “ROIPayloadAssembler” module

  • On the HLT output node, ROIs are extracted by “ROISender”

module and put them in message queues.

  • The ROIs are then sent to ONSEN through network socket.
slide-11
SLIDE 11

RoI transport to ONSEN hltout2merger

Destreamer RoI Sender

basf2

Streamer

rb2mrb 1,4,7...

2,5,8...

3,6,9...

1,4,7...

2,5,8...

3,6,9...

RoI: 1, 2, 3, 4, ..... HLTOUT: 1, 2, 3, 4, ..... mrb2rb

3 basf2's run in different processes * rb2mrb, mrb2rb, and hltout2merger distribute/pick up records in turn to/from ringbuffers/mqueues in the same order. mqueue mergermerge

ONSEN

hltwn1-3

hltout

rpc2

slide-12
SLIDE 12

Management of HLT script

  • In the previous beam test at DESY, the management of the HLT

script was chaotic and it caused a problem in taken data. * Wrong dummy RoIs were sent to PXD readout box and the taken data were sometimes useless.

  • Lesson : the HLT script should be modified and checked by

experts (not DAQ operators) before implemented in HLT.

  • We introduced “git” based management of HLT scripts.

* Ask experts to check and update the script in git. * The latest script is “pulled” to HLT and used. * The modification history is tracked by git. * The script is managed as a part of Belle2 software library

  • The scheme worked well in the test beam.

* RoI generation : Klemens, Giulia.... * Tracking : Thomas Lueck, Eugenio, Tobias.... * I/O and DQM management : me

slide-13
SLIDE 13

Multi-layered Live Histogram Collection for DQM in DESY-TB DAQ

input mod. Dqm Histo Manager mod1 mod2 modn

x 8 cores hserver basf2/core belle-hltwn1 to 3 HLT worker nodes TMemFile hrelay belle-hltin hserver TMapFile hrelay

histogram collection from 3 servers

belle-dqm hserver TMemFile Express Reco DQM Browser

socket connection

slide-14
SLIDE 14

Real time DQM browsing in DESY TB

PXD histograms accumulated in ExpressReco SVD histograms accumulated in HLT

slide-15
SLIDE 15

Express Reco and Event Display

storage

basf2 Storage node (“belle-rpc2”)

trans mitter free running ringbuffer

* with scaling (1/10) <- can be removed (as much as possible basis)

receiver

PXD unpk

input mod. Dqm Histo Manager

PXD DQM Samp ler

eventserver

basf2 Express Reco node (“belle-reco”)

recv. evdisp

“belle-dqm” hserver@belle-dqm

  • Event processing at Express Reco

* ~100Hz * Simple PXD monitoring only

  • > more complex monitoring with PXD+SVD tracking was possible, but

no time remained to make it work.

slide-16
SLIDE 16
  • 1 秒間に2000回のデータを実時間処理して粒子の飛跡を求め、内側のセンサーの

どこを通ったかを求める。

  • 飛跡が通ったまわりだけのデータを後段に送る。
slide-17
SLIDE 17

Summary

  • HLT and ExpReco framework were confirmed to work in the

DESY test beam runs.

  • A real time RoI feedback to ONSEN was proven to work.
  • The optimization of operation parameters have to be done carefully

in coming phase 2 operation.

  • The management of HLT software and scripts with “git” was

tested in DESY-TB and confirmed to be effective. The imple- mentation in the on-going cosmic ray run is in progress.