Energy Efficient Adaptive Beamforming
- n Sensor Networks
Viktor K. Prasanna Bhargava Gundala, Mitali Singh
- Dept. of EE-Systems
University of Southern California email: prasanna@usc.edu http://ceng.usc.edu/~prasanna http://pacman.usc.edu
Energy Efficient Adaptive Beamforming on Sensor Networks Viktor K. - - PowerPoint PPT Presentation
Energy Efficient Adaptive Beamforming on Sensor Networks Viktor K. Prasanna Bhargava Gundala, Mitali Singh Dept. of EE-Systems University of Southern California email: prasanna@usc.edu http://ceng.usc.edu/~prasanna http://pacman.usc.edu
University of Southern California email: prasanna@usc.edu http://ceng.usc.edu/~prasanna http://pacman.usc.edu
1
Passive Active
2
3
Each CPI (Coherent Processing Interval) 1 2 L Range gates 1 N Elements Pulse Repetition Interval N L M P R I s
4
N (22) M (64) (1920) L
.. .. . .
5
6
FFT Adaptive Sampling Rate Conventional Beamforming Beam- FFT Adaptive Beam- 100 ~5000 Beams Frequency Domain Time Domain per Output Output Rate =1 Hz~100 Hz Element forming forming Space Beam Space =10 Hz~25 KHz
7
Frequency Bins s
FFT Corner Linear Solver
N F N F F N F
Factorization
Covariance Steering
B F N B N N
& Beamformer Turn
Channel Beams per Bin
8
S1 S2 S3 S4 D A T A D A T A D A T A D A T A
9
adaptation apply
10
Target detection
11
87 ~ 253 nsec
1~2 sec (SHMEM)
~ 9 nsec/byte/node transfer rate
µ µ µ µ
12
S 1 S2 S3
Data Access Pattern
P0 P 0 P0 P 3 P3 P3
Remap?
13
Our Results Results reported in IPPS ‘95
14
15
Energy Constrained
Network
Sensors
16
Power Total = Power Processor +Power Data bus + Power Memory Power unit = Power Dynamic + Power Static = 0.5f(n)CV2fActive + VI Leakage Fmax ∝ (V-Vt)/V
Processor Processor
17
4.30 MOV [BX] DX 3.53 MOV DX [BX] 2.49 MOV DX BX Energy (10-8 Joules) Instruction (Intel 486DX2)
18
Do i = 0 ; Do j = 0 ; A[i,j]
Do k = 0 ;
A[i, j]
A B
i j i k
C
k j
19
20
Time = Data =
2 3
2
21
Block Layout Row major Layout a 0,0 a 1,0 a 2,0 a 3,0 a 0,1 a 1,1 a 2,1 a 3,1 a 0,2 a 1,2 a 2,2 a 3,2 a 0,3 a 1,3 a 2,3 a 3,3 Page 0 Page 1 Page 2 Page 3 a 0,0 a 1,0 a 2,0 a 3,0 a 0,1 a 2,1 a 3,1 a 0,2 a 1,2 a 2,2 a 3,2 a 0,3 a 1,3 a 2,3 a 3,3 Page 0 Page 1 Page 2 Page 3 a 1,0 22
23
Step1: compute N2 DFTs of size N1 Step2: multiply twiddle factors Step3: compute N1 DFTs of size N2
Determine optimal factorization Perform low level optimizations for kernels Construct larger size FFTs from kernels
All DFTs of same size have same execution time
24
Sun Ultra 1: 167MHz, L2 Cache = 512 KB = 32 K points
N = 32
10 20 30 40 50 60 70 5 10 15 20 Stride (2^s) Execution Time (usec)
25
N1-point FFTs N2 Data Reorganization N2-point FFTs N1
26
Decomposition trees for a 1024*1024 point FFT
27
3.26 MUL 3.26 OR Energy (10-8 Joules) Instruction (Fujitsu Sparc‘934)
28
31
c c B11 B12 B1N A11 A12 A1N
= computation for result (i,j) c = cache size
33
34
Computation cost is much lower than communication cost Radio interface consumes a large amount of power Energy to transfer 32 bits over 100m in WINS sensor node =( (600 +300)mw ÷ 100kbits/s) x 32 = 288 x 10 –6 Joules Energy to execute a 32 bit instruction using SA1100 processor = 1 ÷ 250 MIPS/watt = 0.004 x 10 –6 Joules Additional overhead for bits added for error correction Retransmissions are frequent due to unreliable links(e.g.wireless)
300mw Reception 250MIPS/watt Processor (SA1100) Transmission(100m) POWER Consumed 600mw (at 100kbits/sec) WINS sensor Node
29
30
32