[PPT] - Lecture 7: MIMO Capacity and Multiplexing Architectures PowerPoint Presentation

SLIDE 1

Lecture ¡7: ¡MIMO ¡Capacity ¡and ¡ Multiplexing ¡Architectures

I-Hsiang Wang ihwang@ntu.edu.tw 5/15, 2014

SLIDE 2

Design ¡of ¡MIMO ¡Systems

Regarding MIMO, what we have done so far:
Established solid foundation on the statistical channel modeling
Analyzed AWGN (no fading) MIMO capacity
Indeed, MIMO is capable of the following:
Multiplex multiple data streams simultaneously
Provide spatial diversity
Increase power gain
What’s next:
Derive MIMO capacity under fading
Design transceiver architectures to extract multiplexing gain,

diversity gain, and power gain

2

SLIDE 3

Plot

Derive capacity of fading MIMO channel
Fast fading: CSIR only and full CSI
Slow fading: outage probability
Discuss the nature of performance gains
Introduce transceiver architectures for fast fading
The V-BLAST family
Introduce a transceiver architecture for slow fading
D-BLAST

3

SLIDE 4

Outline

Capacity of fast fading MIMO
V-BLAST
Receiver architectures:
Linear filters: decorrelator, matched filter, MMSE
Successive interference cancellation (SIC)
Outage probability of slow fading MIMO
D-BLAST

4

SLIDE 5

5

Fast ¡Fading ¡MIMO ¡Channel

SLIDE 6

Fast ¡Fading ¡MIMO ¡with ¡Full ¡CSI

Channel model: y[m] = H[m]x[m] + w[m]
{H[m]}: random fading process which is stationary and ergodic
With full CSI, Tx and Rx can perform pre- and post-

processing based on the SVD of H[m] at each time:

H[m] = U[m]Λ[m]V[m]*
Convert the fading MIMO into a fading parallel channel:

(nmin := min{nt,nr})

Water-filling to find the optimal power allocation

6

e yi[m] = λi[m]e xi[m] + e wi[m], i = 1, 2, . . . , nmin

SLIDE 7

Ergidic ¡Capacity ¡with ¡Full ¡CSI

Capacity via water-filling:

7

V*[m]

...

U[m] V[m] U*[m] λ1[m] e w1[m] λnmin[m] e wnmin[m] x[m] y[m] e x[m] e y[m] CMIMO =

nmin

X

i=1

E  log ✓ 1 + λ2

i P ∗(λi)

σ2 ◆ P ∗(λ) = ✓ ν − σ2 λ2 ◆+ , ν satisfies

nmin

X

i=1

E "✓ ν − σ2 λ2

i

◆+# = P

SLIDE 8

Transceiver ¡Architecture ¡with ¡Full ¡CSI

8 + AWGN coder AWGN coder {x1[m]} ~ {y1 [m]} ~ {xnmin[m]} ~ {ynmin[m]} ~ . . . . . . . . . n min information streams {0} {0} {w[m]} U* H V Decoder Decoder

pre-processing post-processing

SLIDE 9

Receiver ¡CSI ¡Only: ¡V-‑BLAST

Tx cannot apply the pre-processing matrix V
V-BLAST architecture:
Tx prepares nt data streams, each encoded with a rate Ri coder
Generate x by multiplying them with a unitary matrix Q
Rx carries out joint decoding of the streams (eg., ML)
We will discuss Rx architecture later

9

+ Pnt P1 Q x[m] H[m] w[m] y[m] Joint decoder AWGN coder rate R1 AWGN coder rate Rnt

· · · · · · · · · · · ·

SLIDE 10

Capacity ¡of ¡Fast ¡Fading ¡MIMO ¡with ¡CSIR

Using information theoretic arguments (or a sphere

packing argument), one can show that V-BLAST can achieve the capacity, which is given by the following:

Kx := the covariance matrix of transmit signal vector x
With V-BLAST,
The issue boils down to finding the optimizing Kx for a

given stationary distribution of H

10

Kx = Q diag (P1, . . . , Pnt) Q∗ C = max

Kx:Tr(Kx)≤P E

 log det ✓ Inr + HKxH∗ σ2 ◆

SLIDE 11

Multiplexing ¡in ¡the ¡Angular ¡Domain

In V-BLAST, Q can be thought of as the coordinate

system onto which Tx modulates its data streams

The question is, which coordinate system Q should be used?
The choice of Q (and the power allocation {P1, … ,Pn}) depends on

the statistical property of H, so let’s focus on the angular domain representation: Ha = Ur*HUt

Under rich scattering, entries of Ha are statistically

independent and zero-mean ⟹ it is reasonable to multiplex data on the coordinate system (indeed, optimal. HW.)

Choose Q = Ut and hence Kx = Ut ΛpUt*.
Still need to determine the power allocation Λp

11

SLIDE 12

Uniform ¡Power ¡Allocation

If further symmetry is present the random Ha, uniform

power allocation (

) turns out to be optimal
Sufficient condition: the nt column vectors of Ha are i.i.d.
For example, i.i.d. Rayleigh faded Ha.
Hence the covariance matrix Kx = Ut ΛpUt* =
This gives us the capacity formula: (SNR := P/σ2)
In this case, Q can be any unitary matrix; in particular, it

suffices to choose Q = Int

12

Λp = P

nt Int P nt Int

C = E  log det ✓ Inr + SNR nt HH∗ ◆

SLIDE 13

V-‑BLAST ¡under ¡i.i.d. ¡Rayleigh

13

+ Pnt P1 Q x[m] H[m] w[m] y[m] Joint decoder AWGN coder rate R1 AWGN coder rate Rnt

· · · · · · · · · · · ·

I

Effectively, each of the Tx antennas, say, antenna i,

transmits an independent data stream of rate Ri

How to determine Ri?
For joint ML, it does not matter as long as ΣRi = the total capacity
For other Rx architectures (later), the individual rate depends on

the effective channel it faces with, after the MIMO detector

P1 = P2 = · · · = Pnt = P

nt

SLIDE 14

Receiver ¡CSI ¡vs. ¡Full ¡CSI

Capacity formula can be rewritten using singular values
f the random matrix H:
No CSIT ⟹ water-filling is not possible
Recall the capacity with full CSI:

14

λ1 ≥ λ2 ≥ · · · ≥ λnmin ≥ 0 CCSIR =

nmin

P

i=1

E h log ⇣ 1 + SNR

nt λ2 i

⌘i CFull CSI =

nmin

P

i=1

E ⇥ log

1 + SNR∗(λi)λ2

i

⇤

SNR∗(λ) =

ν −

1 λ2

+, ν satisfies

nmin

P

i=1

E ⇣ ν −

1 λ2

i

⌘+ = SNR

SLIDE 15

DoF ¡Gain ¡at ¡High ¡SNR

15

nt = nr = 1 nt = nr = 4 nt = 1 nr = 4 C (bits /s / Hz) 35 30 25 20 15 10 5 –10 10 20 30 SNR (dB) nt = nr = 1 nt = nr = 8 nt = 1 nr = 8 C (bits /s / Hz) 70 60 50 40 30 20 10 –10 10 20 30 SNR (dB)

High SNR regime: nmin-fold DoF gain
nmin := min{nt,nr} determines the high-SNR slope

CCSIR ≈ nmin log ⇣

SNR nt

⌘ +

nmin

P

i=1

E ⇥ log

λ2

i

⇤ CFull CSI ≈ nmin log ⇣

SNR nmin

⌘ +

nmin

P

i=1

E ⇥ log

λ2

i

⇤

SLIDE 16

Power ¡Gain ¡at ¡Low ¡SNR

16

nr determines the power gain under CSIR
If CSIT is available, power gain is larger due to both

beamforming and dynamic power allocation

C C1,1 (bits / s / Hz) 4 3.5 2.5 3 10 –10 –20 –30 nt = 1 nr = 4 nt = nr = 4 SNR (dB) C C1,1 (bits / s / Hz) 8 7 6 5 4 3 SNR (dB) 10 –10 –20 –30 nt = 1 nr = 8 nt = nr = 8

= nrSNR log2 e CCSIR ≈

nmin

P

i=1 SNR nt E

⇥ λ2

i

⇤ log2 e = SNR

nt E [Tr (HH∗)] log2 e

SLIDE 17

Nature ¡of ¡Peformance ¡Gain ¡(CSIR ¡only)

High SNR (DoF-limited):
min{nt,nr}-fold DoF gain
capacity scaling linearly with nmin := min{nt,nr}
MIMO is crucial
Low SNR (Power-limited):
nr-fold power gain
capacity scaling linearly with nr
Only need multiple Rx antennas
At moderate SNR
min{nt,nr}-fold gain
Due to a combination of both effects

17

SLIDE 18

18

Receiver ¡Architectures

SLIDE 19

Decoding ¡at ¡the ¡Receiver

In the previous V-BLAST architecture, Rx uses ML:
ML is optimal
But the complexity grows exponentially with the # of data streams
A natural approach:
First separate the signal of each data stream from others with

certain linear operations

Then decode each data stream using single-user decoder
In the following we focus on Rx architectures that use

linear operations in the first step

Assuming V-BLAST with Q = Int , that is, each Tx antenna sends

an independent data stream

If not, we can just lump Q into the channel matrix H

19

SLIDE 20

Decorrelator: ¡Interference ¡Nulling

Rewrite the received signal vector y as follows:
xi denotes the signal sent from the i-th Tx antenna
hi denotes the i-th column of channel matrix H, representing the

signal direction of xi .

To decode xi , a simple idea is to use a decorrelator:
First null out interference by projecting y onto the null space of

the directions of all interfering vectors {hj | j≠i}

Then apply matched filter to the projected signal
The Rx architecture consists of a bank of decorrelators.
Also called interference nulling, zero forcing, etc.

20

y = hixi + X

j6=i

hjxj + w

SLIDE 21

Bank ¡of ¡Decorrelators

21

Decorrelator for stream nt Decorrelator for stream 2 Decorrelator for stream 1 y[m]

e y1 := (Q1h1)∗ (Q1y) e y2 := (Q2h2)∗ (Q2y) Rows of Qk form a

rthonormal basis
f the null space
f {hj | j≠k}.

nulling matched filter after projection Note: for successful decorrelation, nt ≤ nr and hence nmin = nt Rk = E ⇥ log

1 + Pk

σ2 ||Qkhk||2⇤

SLIDE 22

Performance ¡in ¡i.i.d. ¡Rayleigh

Uniform power allocation ⟹ the overall achievable rate:
At high SNR:
At low SNR: lose the power gain (homework)
The decorrelator fully extracts the spatial multiplexing

gain of V-BLAST, but not the power gain

22

Rdecorr =

nmin

P

k=1

E h log ⇣ 1 + SNR

nt ||Qkhk||2⌘i

Rdecorr ≈ nmin log ⇣

SNR nt

⌘ +

nmin

P

k=1

E ⇥ log

||Qkhk||2⇤

CCSIR ≈ nmin log ⇣

SNR nt

⌘ +

nmin

P

i=1

E ⇥ log

λ2

i

⇤

SLIDE 23

Performance ¡Gap ¡from ¡Capacity

23

Decorrelator Capacity 70 60 50 40 30 20 10 –10 –5 5 10 15 20 25 30 bits / s / Hz SNR (dB)

Constant gap due to loss of power gain; quite substantial

SLIDE 24

Removing ¡the ¡Gap: ¡SIC

Recall in Lecture 5, to achieve the uplink capacity, we

use successive interference cancellation (SIC) at Rx

Similar idea can be applied here to remove this gap
The only difference from the previous derivation
The nulling projection Qk is replaced by
The rows of

now form an orthonormal basis of the null space

f {hj | j > k}.
Indeed this removes the gap at high SNR

24

e Qk e Qk

SLIDE 25

Successive ¡Interference ¡Cancellation

25

Decode stream nt Decode stream 3 Decode stream 2 Decode stream 1 Decorrelator 2 Decorrelator 3 Decorrelator nt Decorrelator 1 Stream nt Stream 1 Subtract stream 1, 2, ..., nt –1 y[m] Stream 3 Subtract stream1, 2 Subtract stream1 Stream 2

SLIDE 26

Zero ¡Forcing ¡vs. ¡Matched ¡Filtering

Decorrelator (zero forcing): remove all interference at the

expense of reducing the received SNR

Matched filter: projecting onto hi to maximize the SNR

but SINR may be bad

26

y = hixi + X

j6=i

hjxj + w

SLIDE 27

Zero ¡Forcing ¡vs. ¡Matched ¡Filtering

27

Decorrelator Matched fillter SNR (dB) 20 30 0.1 0.8 0.9 0.7 0.6 0.5 0.4 0.3 0.2 –30 –20 –10 10 1

Low SNR: power-limited, interference not significant
High SNR: interference-limited, power not significant

R C

SLIDE 28

Optimal ¡Linear ¡Filter: ¡MMSE

28

y = hixi + X

j6=i

hjxj + w

The optimal linear filter should maximize SINR at all SNR
It offers the optimal compromise between ZF and MF
Procedure:
First whiten the aggregate interference + noise
The apply MF since matched filter is optimal for white noise
MMSE filter:

zi := P

j6=i

hjxj + w e y1 := ⇣ K

− 1

2

z1 h1

⌘∗ ⇣ K

− 1

2

z1 y

⌘ whitening matched filter after whitening =

K−1

z1 h

∗ y

SLIDE 29

Geometric ¡Picture

29 Interference subspace Decorrelator Optimal filter Signal direction (matched filter)

Low SNR: MMSE → ZF
High SNR: MMSE → MF

SLIDE 30

Linear ¡MMSE: ¡Performance

For the k-th stream, its data rate is
Since
, we have
Hence the achievable rate can be computed
We can further improve the performance of the MMSE

receiver by using SIC

30

Rk = E h log ⇣ 1 + Pk||K

− 1

2

zk hk||2⌘i

= E ⇥ log

1 + Pkh∗

kK−1 zk hk

⇤ zk := P

j6=k

hjxj + w Kzk := σ2Inr + P

j6=k

Pjhjh⇤

j

SLIDE 31

Linear ¡MMSE: ¡Performance

31

Decorrelator 20 10 –10 –20 –30 30 SNR (dB) MMSE Matched filter 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1

R C

Low SNR: MMSE → ZF
High SNR: MMSE → MF

SLIDE 32

MMSE-‑SIC

32

Subtract stream 1, 2, ... , nt –1 Stream 2 Decode stream nt Stream nt Subtract stream 1 Stream 1 Decode stream 1 Decode stream 2 Decode stream 3 Subtract stream 1, 2 MMSE receiver 1 MMSE receiver nt MMSE receiver 3 MMSE receiver 2 y[m] Stream 3

SINRk,MMSE = Pkh∗

kK−1 zk hk

Kzk := σ2Inr + P

j>k

Pjhjh∗

j

SLIDE 33

MMSE-‑SIC ¡Achieves ¡MIMO ¡Capacity

33

R C

–30 10 –10 –20 20 Decorrelator 30 SNR (dB) MMSE–SIC MMSE Matched filter 1.1 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1

In fact for any channel matrix H and Kx, we have

nt

P

k=1

log (1 + SINRk,MMSE) = log det (Inr + HKxH∗)

SLIDE 34

Optimality ¡of ¡MMSE-‑SIC

Sending independent streams at individual Tx antennas

⟹ can be viewed as a multi-user uplink channel

Hence, the optimality of SIC is straightforward
At each stage, MMSE filter gives a sufficient statistics for

decoding the data from the vector signal

This is because the capacity achieving code for each data stream

looks like Gaussian

Hence the aggregate noise + interference is still Gaussian
Hence, MMSE is information lossless at each stage

34

Lecture ¡7: ¡MIMO ¡Capacity ¡and ¡ Multiplexing ¡Architectures

I-Hsiang Wang ihwang@ntu.edu.tw 5/15, 2014

Design ¡of ¡MIMO ¡Systems

Plot

Outline

Fast ¡Fading ¡MIMO ¡Channel

Fast ¡Fading ¡MIMO ¡with ¡Full ¡CSI

processing based on the SVD of H[m] at each time:

(nmin := min{nt,nr})

e yi[m] = λi[m]e xi[m] + e wi[m], i = 1, 2, . . . , nmin

Ergidic ¡Capacity ¡with ¡Full ¡CSI

...

Transceiver ¡Architecture ¡with ¡Full ¡CSI

pre-processing post-processing

Receiver ¡CSI ¡Only: ¡V-­‑BLAST

Capacity ¡of ¡Fast ¡Fading ¡MIMO ¡with ¡CSIR

packing argument), one can show that V-BLAST can achieve the capacity, which is given by the following:

given stationary distribution of H

Kx = Q diag (P1, . . . , Pnt) Q∗ C = max

 log det ✓ Inr + HKxH∗ σ2 ◆

Multiplexing ¡in ¡the ¡Angular ¡Domain

system onto which Tx modulates its data streams

the statistical property of H, so let’s focus on the angular domain representation: Ha = Ur*HUt

independent and zero-mean ⟹ it is reasonable to multiplex data on the coordinate system (indeed, optimal. HW.)

Uniform ¡Power ¡Allocation

power allocation (

suffices to choose Q = Int

Λp = P

C = E  log det ✓ Inr + SNR nt HH∗ ◆

V-­‑BLAST ¡under ¡i.i.d. ¡Rayleigh

transmits an independent data stream of rate Ri

Receiver ¡CSI ¡vs. ¡Full ¡CSI

λ1 ≥ λ2 ≥ · · · ≥ λnmin ≥ 0 CCSIR =

P

E h log ⇣ 1 + SNR

⌘i CFull CSI =

P

E ⇥ log

⇤

DoF ¡Gain ¡at ¡High ¡SNR

Power ¡Gain ¡at ¡Low ¡SNR

beamforming and dynamic power allocation

Nature ¡of ¡Peformance ¡Gain ¡(CSIR ¡only)

Receiver ¡Architectures

Decoding ¡at ¡the ¡Receiver

linear operations in the first step

Decorrelator: ¡Interference ¡Nulling

Bank ¡of ¡Decorrelators

Performance ¡in ¡i.i.d. ¡Rayleigh

gain of V-BLAST, but not the power gain

Performance ¡Gap ¡from ¡Capacity

Removing ¡the ¡Gap: ¡SIC

use successive interference cancellation (SIC) at Rx

Successive ¡Interference ¡Cancellation

Zero ¡Forcing ¡vs. ¡Matched ¡Filtering

expense of reducing the received SNR

but SINR may be bad

Zero ¡Forcing ¡vs. ¡Matched ¡Filtering

Optimal ¡Linear ¡Filter: ¡MMSE

Geometric ¡Picture

Linear ¡MMSE: ¡Performance

receiver by using SIC

Linear ¡MMSE: ¡Performance

MMSE-­‑SIC

MMSE-­‑SIC ¡Achieves ¡MIMO ¡Capacity

Optimality ¡of ¡MMSE-­‑SIC

⟹ can be viewed as a multi-user uplink channel

decoding the data from the vector signal

Receiver ¡CSI ¡Only: ¡V-‑BLAST

V-‑BLAST ¡under ¡i.i.d. ¡Rayleigh

MMSE-‑SIC

MMSE-‑SIC ¡Achieves ¡MIMO ¡Capacity

Optimality ¡of ¡MMSE-‑SIC