parsa andrew siemion dan werthimer mel wright outline
play

Parsa, Andrew Siemion, Dan Werthimer, Mel Wright Outline What is a - PowerPoint PPT Presentation

Jason Manley, Aaron Parsons, Don Backer, Henry Chen, Terry Filiba, David MacMahon, Peter McMahon, Arash Parsa, Andrew Siemion, Dan Werthimer, Mel Wright Outline What is a correlator? Scalable packetized correlators: The architecture


  1. Jason Manley, Aaron Parsons, Don Backer, Henry Chen, Terry Filiba, David MacMahon, Peter McMahon, Arash Parsa, Andrew Siemion, Dan Werthimer, Mel Wright

  2. Outline • What is a correlator? • Scalable packetized correlators: – The architecture – The hardware – The software – The cost • Closing thoughts • Walk through actual design • Questions and comments

  3. Interferometry …

  4. Basic idea V k ∑  V ik Z -n ∑ 90 °  V ii Amplitude Amplitude ∑  V jk Time Time ∑  V ik V i V i ∑ ∑  V ij  V ij Z -n ∑  V jk 90 ° 90 ° ∑  V jj ∑  V kk V j V j ∑ ∑ Z -n  V ij  V ij

  5. “Actual” FX Correlator ∑ Z -n FFT ∑ Z -n FFT ∑ Z -n FFT

  6. CASPER DSP backend concept Reconfigurable Polyphase Compute Cluster Filter Banks FPGA DSP PFB ADC Module FPGA DSP Module PFB . ADC Correlator . . FPGA DSP Module FPGA DSP Commercial off-the-shelf Module . Beamformers/ Multicast 10 Gbps (10GE Spectrometers or InfiniBand) Switch . FPGA DSP Module . FPGA DSP Pulsar timer Module . . . . . . PFB General-purpose CPUs ADC

  7. Design Philosophy • Standardized processing hardware • Commercial interconnect • Asynchronous compute engines • Synchronization using common 1PPS • UDP output delivery over ethernet network • Correlator scales with your array

  8. CASPER FX Architecture F Engine 0 X Engine 0 F Engine 1 X Engine 1 . . . . . . . . . 10GbE Switch F Engine N-1 X Engine N-1

  9. Implementation F Engine 0 X Engine 0 F Engine 1 X Engine 1 . . . . . . . . . 10GbE Switch F Engine N-1 X Engine N-1

  10. Architecture to hardware mapping Example 8 Antenna system iBOB BEE2 iBOB BEE2 user FPGA BEE2 user FPGA F Eng F Eng X Eng X Eng X Eng X Eng F Eng F Eng iBOB iBOB BEE2 user FPGA BEE2 user FPGA F Eng F Eng X Eng X Eng X Eng X Eng F Eng F Eng 10GbE Switch

  11. F Engine Operations DDC Quantize Channelize Reformat X Engine ADC • Two F engines per iBOB • Dual polarization design • Currently uses ASTRO library • Currently processes data at native clock rate (<200MHz IBOB or < 400MHz ROACH)

  12. Setup and Control Clocks: • X engines each run off independent clock – Sampling synchronized at F engines, but clock not distributed to X engines – Synchronized using global 1pps signal at ADCs • Propagated to X engines using out-of-band signaling on XAUI links – Headers labeling 10GbE Ethernet packet data – System control: separate 100Mbps Ethernet network on BEE2 • F engines configured from BEEs through XAUI links • Control packets: CASPER UDP framework on BEE2 control FPGA • Execute Python scripts for configuration, control and debugging •

  13. F engine development • 2008: – Coarse delays (cable length compensation) – Fringe-stopping & fine delays – Walsh code generation and phase switching – Real sampling (low bandwidth) – Parallel streams (high bandwidth) • Future: – Ability to output subset of band – Spectral zoom modes

  14. X Engine Operations 10GbE Buffer X Eng Accum F Engine • Using CASPER library • Scales with 2^N antennas • Fit as many X engines on an FPGA as possible (2x 16 ant on BEE2 usr)

  15. Backend Software • UDP packets received • Currently received, parsed and saved in MIRIAD file format by single computer. • Computing requirements dependant on experiment; • Usually single computer ok: 128 antennas, 1 sec integrations, 2k chan = 512MB/s

  16. Pending systems • Bench sys: 8ant, DP, 200MHz, 2k ch • PAPER: 128ant, DP, 100MHz, 2k ch • KAT-7: 8ant, DP, 256MHz, 2k ch • meerKAT: 80ant, DP, 1GHz, 16k ch • Bologna: 32ant, SP, 32MHz, 1k ch • GMRT: 32ant, DP, 400MHz, 4k-8k ch

  17. How does it scale 1000000 100000 10000 1000 100 F Engines X Engines 10 1

  18. FPGA Roadmap Xilinx Virtex Family 400 Logic Cells Thousands 300 330 200 200 100 100 43 0 2000 2002 2004 2006 • Processing power doubling every two years • V4 = ½ power requirements of V2Pro* * Manufacturers claim - Xilinx Inc.

  19. Coming soon… • 10Gbps output optionally gives integrations ~10ms • More efficient use of hardware DSP slices • High speed, scalable, distributed data capture software • Walsh codes and phase switching • Phase rotation • 64 antenna design • Upgrade to 4096 channels • ROACH hardware: – <400MHz bandwidth – 16 384 channels – 128 antennas – no architectural changes

  20. Questions and Comments Visit the CASPER correlator page: http://casper.berkeley.edu/wiki/index.php?title=Correlator Add your own requirements: http://casper.berkeley.edu/wiki/index.php?title=International_ Correlator_Collaboration Email me: jason_manley@hotmail.com

  21. PFB-FFT response

  22. Current uses Pocket Spectrometer • Using ATMEL ADC’s at 2 Gsamples/sec • Performing 4 real FFT’s in 1 (complex) biplex pipelined FFT module. • 2048 channels • Uses just 1 ADC, 1 IBOB, and your laptop.

  23. ROACH block diagram

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend