processing real time lofar processing real time lofar
play

Processing Real-Time LOFAR Processing Real-Time LOFAR Telescope - PowerPoint PPT Presentation

Processing Real-Time LOFAR Processing Real-Time LOFAR Telescope Data on a Blue Gene/P Telescope Data on a Blue Gene/P John W. Romein John W. Romein Stichting ASTRON (Netherlands Institute for Radio Astronomy) Dwingeloo, the Netherlands


  1. Processing Real-Time LOFAR Processing Real-Time LOFAR Telescope Data on a Blue Gene/P Telescope Data on a Blue Gene/P John W. Romein John W. Romein Stichting ASTRON (Netherlands Institute for Radio Astronomy) Dwingeloo, the Netherlands ScicomP/SP-XXL'09 May 20, 2009 1 ScicomP/SP-XXL'09 May 20, 2009

  2. LOw w F Frequency requency AR ARray ray LO  radio telescope  10–240 MHz  unexplored  dishes infeasible  ionospheric disturbance  new design ScicomP/SP-XXL'09 May 20, 2009 2 ScicomP/SP-XXL'09 May 20, 2009

  3. A New Design A New Design  distributed sensor network  no dishes  O (10,000) antennas  omni-directional  concurrent observations  software telescope  flexible  requires supercomputer ScicomP/SP-XXL'09 May 20, 2009 3 ScicomP/SP-XXL'09 May 20, 2009

  4. LOFAR Structure LOFAR Structure  hierarchical  receiver  (tile)  station  telescope  central core  Exloo  central processing  Groningen  real time  off-line ScicomP/SP-XXL'09 May 20, 2009 4 ScicomP/SP-XXL'09 May 20, 2009

  5. LOFAR Science LOFAR Science  Epoch of Re-ionization  cosmic rays  extragalactic surveys  transients  pulsars ScicomP/SP-XXL'09 May 20, 2009 5 ScicomP/SP-XXL'09 May 20, 2009

  6. Outline Outline  from wave to image  basics  receivers  stations  real-time Blue Gene/P processing  performance  off-line processing  image ScicomP/SP-XXL'09 May 20, 2009 6 ScicomP/SP-XXL'09 May 20, 2009

  7. Reflectors vs. Phased Arrays Reflectors vs. Phased Arrays Physical delay Receiver Receiving array Combiner Artificial delay Output ScicomP/SP-XXL'09 May 20, 2009 7 ScicomP/SP-XXL'09 May 20, 2009

  8. Beam Forming Beam Forming Physical  delay determines delay observation direction  beam forming = delayed addition Receiving array  diameter determines FoV  use earth rotation Combiner Artificial delay Output ScicomP/SP-XXL'09 May 20, 2009 8 ScicomP/SP-XXL'09 May 20, 2009

  9. LOFAR Antennas LOFAR Antennas  two antenna types  Low-Band Antenna (10–80 MHz)  High-Band Antenna (110–240 MHz)  FM radio range not covered ScicomP/SP-XXL'09 May 20, 2009 9 ScicomP/SP-XXL'09 May 20, 2009

  10. Low-Band Antennas Low-Band Antennas  10–80 MHz  dual polarized ScicomP/SP-XXL'09 May 20, 2009 10 ScicomP/SP-XXL'09 May 20, 2009

  11. LBA Field LBA Field ScicomP/SP-XXL'09 May 20, 2009 11 ScicomP/SP-XXL'09 May 20, 2009

  12. HBA Tiles HBA Tiles  110–240 MHz  dual polarized  4x4 receivers = 1 tile  analogue beam forming ScicomP/SP-XXL'09 May 20, 2009 12 ScicomP/SP-XXL'09 May 20, 2009

  13. A Station A Station  48–96 LBAs  48–96 HBA tiles ScicomP/SP-XXL'09 May 20, 2009 13 ScicomP/SP-XXL'09 May 20, 2009

  14. Station Cabinet Station Cabinet  station processing ScicomP/SP-XXL'09 May 20, 2009 14 ScicomP/SP-XXL'09 May 20, 2009

  15. Remote Control Unit Remote Control Unit  2 LBAs + 1 HBA tile  filter  200 (or 160) MHz A → D conversion ScicomP/SP-XXL'09 May 20, 2009 15 ScicomP/SP-XXL'09 May 20, 2009

  16. Remote Station Processing Boards Remote Station Processing Boards  FPGAs  PPF: creates 512 * 195 KHz subbands  select up to 164 subbands  beam form LBAs/tiles  UDP packets over WAN to correlator ScicomP/SP-XXL'09 May 20, 2009 16 ScicomP/SP-XXL'09 May 20, 2009

  17. Transient Buffer Boards Transient Buffer Boards  4 sec. raw antenna data stored in TBB  trigger → freeze → dump → post analysis  not possible with dishes! ScicomP/SP-XXL'09 May 20, 2009 17 ScicomP/SP-XXL'09 May 20, 2009

  18. Stations Stations  ≤ 2009: prototypes  building real stations now  18–25 core  18–25 remote  8–20 European  dedicated fibers to correlator ScicomP/SP-XXL'09 May 20, 2009 18 ScicomP/SP-XXL'09 May 20, 2009

  19. Observation Characteristics Observation Characteristics  2 polarizations  32 MHz bandwidth from 1 mode  select 164 * 195 KHz subbands  up to 8 concurrent observations  trade bandwidth for beams ScicomP/SP-XXL'09 May 20, 2009 19 ScicomP/SP-XXL'09 May 20, 2009

  20. LOFAR Processing LOFAR Processing ScicomP/SP-XXL'09 May 20, 2009 20 ScicomP/SP-XXL'09 May 20, 2009

  21. Central Processing Pipelines Central Processing Pipelines  standard imaging mode  pulsar survey mode  known pulsar mode  transients mode  very/ultra high-energy modes  ... ScicomP/SP-XXL'09 May 20, 2009 21 ScicomP/SP-XXL'09 May 20, 2009

  22. Blue Gene History Blue Gene History  6 racks Blue Gene/L (2005–2008)  2½ rack Blue Gene/P (2008–) ScicomP/SP-XXL'09 May 20, 2009 22 ScicomP/SP-XXL'09 May 20, 2009

  23. The Blue Gene/P The Blue Gene/P  850 MHz PPC  4 cores * 2 FPUs * 1 FMA/cycle  complex numbers  3-D torus, collective, barrier, 10 GbE, JTAG networks  2½ racks = 10,880 cores = 37 TFLOP/s + 160*10 Gb/s ScicomP/SP-XXL'09 May 20, 2009 23 ScicomP/SP-XXL'09 May 20, 2009

  24. BG/P Pset BG/P Pset  I/O Nodes (ION) & Compute Nodes (CN)  ION handles I/O requests of CN  transparent  ION:CN = 1:16  64 IONs/rack ScicomP/SP-XXL'09 May 20, 2009 24 ScicomP/SP-XXL'09 May 20, 2009

  25. The BG/P Correlator The BG/P Correlator  three distributed applications/platforms  BG/P I/O nodes (ION)  BG/P compute nodes (CN)  external storage nodes ScicomP/SP-XXL'09 May 20, 2009 25 ScicomP/SP-XXL'09 May 20, 2009

  26. Application Software on I/O Node Application Software on I/O Node  unorthodox  more efficient & flexible  BG/L: saved costs; for input cluster  BG/L: major system software changes (ZOID) (thanks ANL!) [PPoPP'08]  BG/P: better support ScicomP/SP-XXL'09 May 20, 2009 26 ScicomP/SP-XXL'09 May 20, 2009

  27. I/O Node Processing I/O Node Processing  two sections  input  output  multi threaded ScicomP/SP-XXL'09 May 20, 2009 27 ScicomP/SP-XXL'09 May 20, 2009

  28. I/O Node Input Section I/O Node Input Section  ION receives from 1 station  48,828 pkt/s  handles missing packets ScicomP/SP-XXL'09 May 20, 2009 28 ScicomP/SP-XXL'09 May 20, 2009

  29. Circular Buffer Circular Buffer  circular buffer (~2.5 s)  WAN delays  delay stream  handle hiccups Δt = 22μs ≈ 4 * 5.12 μs samples ScicomP/SP-XXL'09 May 20, 2009 29 ScicomP/SP-XXL'09 May 20, 2009

  30. → Compute Node I/O Node → Compute Node I/O Node  ION sends data to CN  wall-clock time trigger  chunk  = 196,608 samples (1.007 s), 1 subband, 2 pols, 1 station ScicomP/SP-XXL'09 May 20, 2009 30 ScicomP/SP-XXL'09 May 20, 2009

  31. Compute Node Processing Compute Node Processing ScicomP/SP-XXL'09 May 20, 2009 31 ScicomP/SP-XXL'09 May 20, 2009

  32. Exchange Exchange  hundreds of Gb/s  asynchronous ScicomP/SP-XXL'09 May 20, 2009 32 ScicomP/SP-XXL'09 May 20, 2009

  33. PolyPhase Filter PolyPhase Filter  splits subband into channels  time vs. frequency resolution  FIR filter + FFT  allows narrow-band RFI removal ScicomP/SP-XXL'09 May 20, 2009 33 ScicomP/SP-XXL'09 May 20, 2009

  34. Phase Correction Phase Correction  correct observation direction  already shifted samples — correct rest  interpolate Δt = 22μs = 4 * 5.12 μs samples + e -2iπ f *1.52 ScicomP/SP-XXL'09 May 20, 2009 34 ScicomP/SP-XXL'09 May 20, 2009

  35. Band Pass Correction Band Pass Correction  channel powers unequal  caused by station PPF power  correct channel ScicomP/SP-XXL'09 May 20, 2009 35 ScicomP/SP-XXL'09 May 20, 2009

  36. Beam Forming Beam Forming  add group of stations to form “superstation”  optional ScicomP/SP-XXL'09 May 20, 2009 36 ScicomP/SP-XXL'09 May 20, 2009

  37. Correlate Correlate  filters noise  multiply samples of all pairs of stations  integrate over time ScicomP/SP-XXL'09 May 20, 2009 37 ScicomP/SP-XXL'09 May 20, 2009

  38. Correlator Output Correlator Output 60.1 frequency (MHz) 59.9 0 time (h) 9  correlations between two stations  color = phase, intensity = power  combined contribution of (strong) sources  earth rotation changes phase ScicomP/SP-XXL'09 May 20, 2009 38 ScicomP/SP-XXL'09 May 20, 2009

  39. Work Distribution Work Distribution  process subbands independently  stations must be combined  chunk needs > 1 second processing time  round-robin distribution  receive, process, send, idle  OVERLY SIMPLIFIED! ScicomP/SP-XXL'09 May 20, 2009 39 ScicomP/SP-XXL'09 May 20, 2009

  40. I/O Node Output Section I/O Node Output Section  (adds correlations)  best-effort queue  ensures real-time continuation of correlator ScicomP/SP-XXL'09 May 20, 2009 40 ScicomP/SP-XXL'09 May 20, 2009

  41. I/O Node Real-Time Scheduling I/O Node Real-Time Scheduling  use Linux RT scheduler ScicomP/SP-XXL'09 May 20, 2009 41 ScicomP/SP-XXL'09 May 20, 2009

  42. I/O Node Memory I/O Node Memory PPC 450: software TLB-miss handler [P2S2'09]  Linux: slows down applications by 40%−300%  modified kernel to provide 6 * 256 MiB “fast” pages (thanks ANL!)  ScicomP/SP-XXL'09 May 20, 2009 42 ScicomP/SP-XXL'09 May 20, 2009

  43. Storage Storage  correlations saved on disk  external cluster  ~1 PB  post-processed within week ScicomP/SP-XXL'09 May 20, 2009 43 ScicomP/SP-XXL'09 May 20, 2009

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend