Compressing Coldbox Data
Ivan K. Furic, Remington Gerras University of Florida
Compressing Coldbox Data Ivan K. Furic, Remington Gerras University - - PowerPoint PPT Presentation
Compressing Coldbox Data Ivan K. Furic, Remington Gerras University of Florida ProtoDUNE-SP TDR: Lossless compression factor = 4 Implies reduction from 12bits/ADC readout to 3 bits per ADC readout In the rest of this talk, not
Ivan K. Furic, Remington Gerras University of Florida
readout to 3 bits per ADC readout
factors, only average bits / ADC readout
[effective compression factor 2.1, half of the TDR spec]
RMS of ! =
# √%& ;
( = ! 12 is the width of a flat distribution needed for a given !
% & log & 12 = log &(!) + 1.8
can’t be more than 2.3 ADC counts
with frequencies p_i, the number of bits per symbol is the (Shannon) entropy:
Theory of Communication". Bell System Technical
achieves Shannon entropy level of performance
compress into 3 bits
compress into 4 bits
compress into 4.6-5.0 bits
Xn Xn-Xn-1 Xn-2Xn-1+Xn-2
Raw ADC Counts Difference wrt previous count Difference wrt linear prediction (based on previous two counts)
Raw ADC Counts Difference Linear prediction
“value outside range”, followed by full 12-bit value
entropy
specific Huffman Trees
(random) Huffman Tree for all data Distributions of avg bits per ADC word
(avg less than 4 bits per ADC word)
Encode Raw Values Encode Differences Encode wrt Linear Prediction
Encode Differences Encode wrt Linear Prediction
Run #1287
Raw ADC Channel-Channel Correlation Factor Raw ADC Channel-Channel Correlation Factor
Run #973
Run Number Difference, Custom Trees Difference, Single Tree Linear Prediction, Custom trees Linear Prediction, Single Tree Size wrt TDR Spec 1287 72.5 MB 73.4 MB 71.5 MB 72.2 MB +25% 0973 (scaled) 70.3 MB 71.1 MB 70.3 MB 70.4 MB +22%
events (simplifies firmware implementation)
(further reduce avg RMS by 10%, i.e. 5% better compression)
shrink target variable RMS
(e.g. entire run 1287)