- Designing Autonomic Wireless Multi-hop
Networks for Delay-Sensitive Applications
Peter Hsien-Po Shiang Advisor : Prof. Mihaela van der Schaar Electrical Engineering, UCLA
Designing Autonomic Wireless Multi-hop Networks for Delay-Sensitive - - PowerPoint PPT Presentation
Designing Autonomic Wireless Multi-hop Networks for Delay-Sensitive Applications Peter Hsien-Po Shiang Advisor : Prof. Mihaela van der Schaar Electrical Engineering, UCLA Delay-sensitive applications are booming! Examples of
Peter Hsien-Po Shiang Advisor : Prof. Mihaela van der Schaar Electrical Engineering, UCLA
Video telephony Surveillance Live audio Live video Vehicular communications Battlefield sensing Games Video conferencing
Channel condition
Autonomic node = Agent
Information gathering phase Control info. Data Decision making phase
Learning phase
Application layer traffic requirements
Network environment
SINR Interference coupling among transmitter- receiver pairs Power control over ad hoc mobile networks Primary users’ loading, other secondary users’ actions Resource availability (spectrum holes) Distributed resource management over cognitive radio networks Source rate, transmission rate, packet error rate Source traffic, channel condition Multimedia transmission over wireless mesh network Local information Dynamics Network scenarios
……… ………
D1 r4 r5 r1 r2 r3 S1 D2 S2 S2 D1 D1 r4 r5 r1 r1 r2 r3 S1 D2
1. Information gathering phase Control info. Data packets
phase
Multimedia characteristics
V1 V2
Function of resource, e.g. throughput
Low High Complexity Distributed Centralized Decision maker Fast Slow Adaptation ability Low Proposed sol. High Traditional sol. Information
Online adaptation Predetermined Resource allocation Packet-based Flow-based Application model Explicit Proposed sol. Implicit Traditional sol. Delay constraint
GOP Transmission Time Multimedia Packets
: : : :
Classes
Relay Nodes
Application scheduling Network relay selection Data Link retransmission limit Physical MCS selection
Classes
–
No predetermined rate allocation
–
Low complexity
–
Fully distributed solution
–
Fast adaptation to network changes
Delay constraints
" # "
Information to the previous hop relay selecting parameter
"
! "
Evaluate
Get information feedback
selecting prob.
analysis for
" # "
Advantages:
Video streams statistics Relay selection
(MAC layer) Retransmission, TXOP (PHY) Modulation
Delay/Packet loss
"# " "
&
" " " #
& & &
& &
"#
Simulation Analytical
35.59 35.10 34.29 32.26 35.61 35.34 33.93 32.49 PSNR(dB) Coastguard 33.05 32.00 31.41 29.34 33.12 31.74 30.34 30.15 PSNR(dB) Mobile 0.6 0.5 0.4 0.3 0.6 0.5 0.4 0.3 Tm(Mbps)
v1 v2 v2 v1 m1 m2 m3 m4 5Tm 4Tm 5Tm 3Tm 5Tm 4Tm 5Tm 3Tm 3Tm 4Tm 5Tm 5Tm 4Tm 5Tm 3Tm 5Tm
Analytical Result
Simulation Analytical
35.58 33.88 33.56 31.86 35.61 33.93 33.92 32.48 PSNR(dB) Coastguard 32.85 31.35 30.21 28.39 33.12 31.74 30.34 28.20 PSNR(dB) Mobile 0.6 0.5 0.4 0.3 0.6 0.5 0.4 0.3 Tm(Mbps)
Analytical Result
S2 D1 r4 r5 r1 r2 r3 S1 D2
: Physical connections : Overlay connections
S2 D1 D1 r4 r5 r1 r1 r2 r3 S1 D2
: Physical connections : Overlay connections
33.10 33.27 30.42
Self-learning policy
35.58 32.85 31.86 28.39
MDTMR [Wei, Zakhor 2004]
34.32 31.37 30.67 24.98
AODV [Perkins 1999]
“Coastguard” Y-PSNR (dB) “Mobile” Y-PSNR(dB) “Coastguard” Y-PSNR (dB) “Mobile” Y-PSNR(dB) Tm = 0.6 (Mbps) Tm = 0.3 (Mbps)
Simulated method
The need of information feedback
Similar concept can be found in distance vector routing protocols, e.g. AODV [Perkins 1999], DSDV [Perkins 1994]
Information horizon n1 n2 n3 n4 n5 n6 n7 hop1 hop2 hop3
Video data (With TX strategies) Information feedback
TX strategies
n1 n2 n3 n4 n5 n6 n7 hop1 hop2 hop3
TX strategies
!'!
!'!
"
“Dropped” packets “Almost dropped” packets “Seldom dropped” packets
) #
)
" " "
" "# "
Class should be sent before class during , since it is more “risky”
User 1: Mobile Deadline: 500 ms User 2: Coastguard Deadline: 300 ms
" "
%
!'!
!
%
S1 S2 D1 D2 10Tm 10Tm 5Tm 5Tm 5Tm 5Tm 4Tm 3Tm 3Tm 4Tm 5Tm 5Tm 5Tm 5Tm 4Tm 3Tm 5Tm 4Tm 5Tm 5Tm Video: Mobile Deadline = 500ms Video: Coastguard Deadline = 300ms Hop1 Hop2 Hop3 Hop4
n1 n3 n4 n5 n6 n7 n8 n2
5Tm 5Tm Hop5
n9
n10 n11 n12 n13
Hop6 5Tm 10Tm 5Tm 5Tm 4Tm 3Tm 3Tm 4Tm 3Tm 5Tm 5Tm 10Tm 10Tm 10Tm S1 S2 D1 D2 10Tm 10Tm 5Tm 5Tm 5Tm 5Tm 4Tm 3Tm 3Tm 4Tm 5Tm 5Tm 5Tm 5Tm 4Tm 3Tm 5Tm 4Tm 5Tm 5Tm Video: Mobile Deadline = 500ms Video: Coastguard Deadline = 300ms Hop1 Hop2 Hop3 Hop4
n1 n3 n4 n5 n6 n7 n8 n2
5Tm 5Tm Hop5
n9
n10 n11 n12 n13
Hop6 5Tm 10Tm 5Tm 5Tm 4Tm 3Tm 3Tm 4Tm 3Tm 5Tm 5Tm 10Tm 10Tm 10Tm
31.75 30.85 29.59 Risk h=4 32.0 31.1 29.63 Risk h=3 31.55 30.80 30.1 Risk h=2 29.61 300Kbps 30.75 400Kbps 31.50 500Kbps Priority h=1
Tm
Analytical average PSNR (dB) for various information horizon
Remarks:
local information
Agent e.g. input rate, SINR, etc.
Utility evaluation Determine transmission action Gather local information
Wireless networks (other agents)
future influence
Key ideas:
Future utility evaluation Gather local Information State Determine transmission action
Agent Priority queuing model
input rate, SINR, etc. Wireless networks (other agents)
future influence
State transition prob.
Immediate Reward Discounted Expected Future Reward
.
.
Cannot take advantage of the queuing model Converges slowly
Priority queuing model (M/G/1 preemptive-repeat model) Maximum likelihood state transition probability
.
.
Information feedforward
"# "
Information feedback
%
% $
Future delay
Future utility evaluation Distributed MDP Local Information State Determine transmission action
process
transition
%
utility evaluation Distributed MDP Local Information State Determine transmission action
process
transition
converge
Proposition 2: The transmission policy of the distributed MDP will converge if and only if the priority class is not dropped in the networks
utility evaluation Distributed MDP Local Information State Determine transmission action
process
%
transition
converge
Existing routing solutions
–
Throughput optimal [Tassiulas 1996]
–
Flow-based optimized routing using queue size backpressure [Neely and Modiano 2006]
–
Throughput and delay optimized opportunistic routing [Gupta and Javidi 2007]
–
Low complexity distributed joint scheduling-routing algorithms [Gupta, Lin, Srikant 2007]
–
Selfish routing based on congestion information [Roughgarden 2002]
–
Network utility maximization framework (NUM) [Kelly 1998][Xu 2008] Required knowledge Decision making Online learning based on local information A priori known environment (e.g. given capacity region) Foresighted decision making Myopic decision making Proposed autonomic multi-hop routing Traditional routing
40 60 80 100 120 20 40 E[Delay1] 20 40 60 80 100 120 20 40 E[Delay2] 20 40 60 80 100 120 20 40 E[Delay3] 20 40 60 80 100 120 20 40 E[Delay4] 40 Model-based learning Self-learning Q-learning
Delay deadline: 1sec
Packet loss time (sec)
Delay deadline: 1sec Delay deadline: 1sec
– Q-learning [Watkins 1992] – TD-learning [Sutton 1988] – Reinforcement learning
– Fictitious play
– Model-based
Information
Reinforcement learning Model-based reinforcement learning
about the agent itself
Fictitious play
about all the other agents
Reinforcement learning SINR Interference coupling among agents Power control over ad hoc mobile networks Fictitious play Primary users’ loading, other secondary users’ actions Resource availability (spectrum holes) Distributed resource management over cognitive radio networks Model-based reinforcement learning Source rate, transmission rate, packet error rate Source traffic, channel condition Multimedia transmission over wireless mesh network Suitable learning Local information Dynamics Network scenarios
Cross-layer design for multimedia streaming
Multi-user video streaming over multi-hop wireless networks [JSAC 2007, Asilomar 2006, IIH-MSP 2006] Risk-aware scheduling [TMM 2007, VCIP 2008]
Dynamic resource management in cognitive radio networks
Queuing-based channel selection for multimedia transmission [TMM 2008, ICIP 2008] Joint route/channel selection in multi-hop cognitive radio networks [TVT 2008, DySPAN 2008]
Learning in games
Adaptive learning in power control game [TVT 2009] Predictive channel selection [ICC 2008] Learning in conjecture-based channel selection game [TNet submitted, Gamenets 2009] Model-based reinforcement learning for distributed MDP [under preparation]
Other
Routing decision for surveillance network under information constraint [TCSVT Submitted]
Accepted
Networks: A Distributed, Cross-layer Approach Based on Priority Queuing,” IEEE Journal of Selected Areas in Communications, vol. 25, no. 4, pp. 770-785, May 2007.
Multi-hop Wireless Networks,” IEEE Transactions on Multimedia, vol. 9, no. 6, pp. 1299-1313, Oct 2007.
Heterogeneous Multimedia Applications over Cognitive Radio Networks,” IEEE Transactions on Multimedia, vol. 10, no. 5, pp. 896-909, Aug. 2008.
Cognitive Radio Networks for Delay Sensitive Transmission,” IEEE Transactions on Vehicular Technology, vol. 52, no.2, pp. 941-953, Feb 2009.
Wireless Resource Management for Delay Sensitive Users,” IEEE Transactions on Vehicular Technology, accepted, to appear. Submitted
Autonomous Delay-Sensitive Users in Multi-Channel Wireless Networks,” submitted to IEEE Transactions on Networking.
Multi-Camera Wireless Surveillance Networks,” submitted to IEEE Transactions on Circuits and Systems for Video Technology.
Cognitive Radio Networks" in IEEE Dynamic Spectrum Access Networks (DySPAN 2008), Oct. 2008.
Streaming over Cognitive Radio Networks," in Proc. Int. Conf. On Image Processing. (ICIP 2008)
Sensitive Users Using Interactive Learning over Multi-carrier Networks," in Proc. Int. Conf.
(VCIP 2008), San Jose, Jan 2008.
Networks: A Cross-layer Priority Queuing Approach,” in IEEE Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2006), pp. 255-258, Dec 2006.
Streaming over Wireless Mesh Networks with Optimal Dynamic Routing and Time Allocation,” in Proceedings of the 40th Asilomar Conference on Signals, Systems, and Computers, Oct 2006.
Cross-Layer Cross-Overlay Architecture for Proactive Adaptive Processing in Mesh Networks,” in 2nd IEEE Workshop on Wireless Mesh Networks (WiMesh 2006), Sep 2006.
[WCZ05] Y. Wu, P. A. Chou, Q. Zhang, K. Jain, W. Zhu, S.Y. Kung, "Network Planning in Wireless Ad Hoc Networks: A Cross-Layer Approach", IEEE Journal on Selected Areas in Communications,
[SYZ05] E. Setton, T. Yoo, X. Zhu, A. Goldsmith, and B. Girod, “Cross-layer design of Ad hoc Networks for real-time video streaming,” IEEE Wireless Communications Mag., pp. 59-65, Aug 2005. [JF07] D. Jurca, P. Frossard, “Packet Selection and Scheduling for Multipath video streaming,” IEEE Transactions on Multimedia, vol. 9, no. 2, Apr. 2007. [AMV06] Y. Andreopoulos, N. Mastronarde, and M. van der Schaar, “Cross-layer Optimized video Streaming over wireless multi-hop Mesh Networks,” IEEE Journal on Selected Areas in Communications, vol. 24, no. 11, Nov 2006, pp. 2104-2115. [PR99] C. E. Perkins, E. M. Royer, “Ad hoc on-demand distance vector routing,” in Proceedings of the 2nd IEEE Workshop on Mobile Computing Systems and Applications, pp. 90-100, Feb 1999. [PB94] C. E. Perkins, P. Bhagwat, “Highly Dynamic Destination-Sequenced Distance-Vector Routing (DSDV) for Mobile Computers,” ACM SIGCOMM Computer Communication Review, vol. 24, no. 4, pp. 234-244, Oct. 1994. [WZ02] W. Wei, and A. Zakhor, “Multipath unicast and multicast video communication over wireless ad hoc networks,” Proc. Int. Conf. Broadband Networks, Broadnets, pp. 496-505, 2002. [DPZ04] R. Draves, J. Padhye, and B. Zill, “Routing in multi-radio, multi-hop wireless mesh networks,” in Proc. ACM Internat. Conf. on Mob. Computing and Networking (MOBICOM), 2004, pp. 114- 128. [AL94] B. Awerbuch and T. Leighton, “Improved Approximation Algorithms for the Multi-commodity Flow Problem and Local Competitive Routing in Dynamic Networks,” Proc. 26th ACM Symposium
[NMR05] M. J. Neely, E. Modiano, and C. E. Rohrs, “Dynamic Power Allocation and Routing for Time- Varying Wireless Networks”, IEEE Journal on Selected Areas in Communications, vol. 23. no1, Jan 2005. pp. 89-103. [GJ07] P. Gupta and T. Javidi, "Towards Throughput and Delay-Optimal Routing for Wireless Ad-Hoc Networks,'' Asilomar Conference on Signals, Systems and Computers, Nov. 2007.
[WD92] C. J. C. H. Watkins, P. Dayan, “Q-learning”, Machine Learning, vol. 8, no. 3-4, pp. 279-292, May 1992. [Sut88] R. S. Sutton, ”Learning to predict by the method of temporal differences,” Machine Learning,
[TO98] P. Tadepalli and D. Ok, "Model-based average reward reinforcement learning", Artificial Intelligence, Volume 100, Issues 1-2, January 1998, Pages 177-224. [BBS95] A. G. Barto, S. J. Bradtke and S. P. Singh, "Learning to act using real-time dynamic programming", Artificial Intelligence, Volume 72, Issues 1-2, January 1995, Pages 81-138.
Local information Transmission action Delay evaluation
–
Delay deadlines
–
Time-varying complexity
1 2 3 4 5 6 7 8 9 1000 2000 3000 4000 5000 6000 7000 8000 Complexity profile over time for decoding four layers -- Silent.CIF at 1.5 Mb/s Time (sec) Normalized Processor Ticks 1 2 3 4 5 6 7 8 9 1000 2000 3000 4000 5000 6000 7000 8000 Complexity profile over time for decoding four layers -- Silent.CIF at 1.5 Mb/s Time (sec) Normalized Processor Ticks
Decoding complexity (Silent sequence) Time (seconds) Normalized Complexity
(a) Sequential Dependencies (a) Typical Hybrid Coder Dependencies (MPEG-2, H.264/AVC) (a) Scalable Coding Dependencies
[Chou, 2006]
S2 D1 r4 r5 r1 r2 r3 S1 D2 S2 S2 D1 D1 r4 r5 r1 r1 r2 r3 S1 D2
20 40 60 80 100 120 140 20 40 60 80 100 120 140 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Users: 15 secondary users (nodes) Actions: channel/route selection 2 frequency channels Utilities: reduce packet loss rate of delay-sensitive applications Transmission range: 40 meters Primary user around node 11,12 Adopt fictitious play
– Goal: learn the other agents’ policies – Count the empirical frequency of the other agents’ actions – Probabilistic behaviors
+
+ + + + +
information Evaluate and maximize
User
network environment
Should an agent monitor all the other agents??
Build more accurate belief Avoid “information mismatch problem”
Packet transmission
(
Packet transmission
(
Primary users
route/channel selection
resource Adaptive Fictitious play
in horizon Node n
2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 Average Transmission Rate T(e,f) (Mbps) Packet Loss Rate 2 3 4 5 6 7 8 9 10 0.2 0.4 0.6 0.8 1 Average Transmission Rate T(e,f) (Mbps) Packet Loss Rate AODV V2 AODV/LB V2 DCS V2 AFP horizon 2 V2 AFP horizon 1 V2 AODV V1 AODV/LB V1 DCS V1 AFP horizon 2 V1 AFP horizon 1 V1
Myopic channel selection Learn from less neighbors Learn from more neighbors Random channel selection (Primary users loading ~ 0)
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.4 0.5 0.6 0.7 0.8 Primary user time fraction Packet loss rate AFP horizon 3 V1 AFP horizon 2 V1 AFP horizon 1 V1 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.1 0.15 0.2 0.25 0.3 0.35 Primary user time fraction Packet loss rate AFP horizon 3 V2 AFP horizon 2 V2 AFP horizon 1 V2
(Primary users around nodes 11, 12, T=5Mbps) Information cost