informationally efficient multi user communication
play

Informationally Efficient Multi user communication Yi Su Advisor: - PowerPoint PPT Presentation

Informationally Efficient Multi user communication Yi Su Advisor: Professor Mihaela van der Schaar Electrical Engineering, UCLA 1 Outline Motivation and existing approaches Informationally efficient multi user communication


  1. Additively Coupled Sum Constrained Games • Definition – A multi ‐ user interaction in which A2: The utility function satisfies states cost in which is an increasing and strictly concave function. Both and are twice differentiable. 33

  2. Additively Coupled Sum Constrained Games • Definition – A multi ‐ user interaction in which Structure of the utility: additive coupling between action and state A2: The utility function satisfies states cost in which is an increasing and strictly concave function. Both and are twice differentiable. 34

  3. Additively Coupled Sum Constrained Games • Definition – A multi ‐ user interaction in which Structure of the utility: additive coupling between action and state A2: The utility function satisfies states diminishing return per invested action cost in which is an increasing and strictly concave function. Both and are twice differentiable. 35

  4. Examples of ACSCG • Power control in interference channels 36

  5. Examples of ACSCG • Power control in interference channels 37

  6. Examples of ACSCG • Power control in interference channels 38

  7. Examples of ACSCG (cont’d) • Delay minimization in Jackson networks m k r im j i k r ij k ψ k i r i 0 39

  8. Examples of ACSCG (cont’d) • Delay minimization in Jackson networks m k r im j i k r ij k ψ k i r i 0 40

  9. Examples of ACSCG (cont’d) • Delay minimization in Jackson networks m k r im j i k r ij k ψ k i r i 0 41

  10. Nash equilibrium in ACSCG • Existence of pure NE – A subclass of concave games • When is the NE unique? When does best response converges to such a NE? – Existing literatures are not immediately applicable • Diagonal strict convexity condition [Rosen] • Use gradient play and stepsizes need to be carefully chosen • Super ‐ modular games [Topkis] • Action space is not a lattice • Sufficient conditions for specific and [Yu] 42

  11. Best response dynamics • Best response iteration 43

  12. Best response dynamics • Best response iteration in which is chosen such that • When does it converges? – By intuition, the weaker the mutual coupling is, the more likely it converges – How to measure and quantify this coupling strength? 44

  13. Best response dynamics sum constraint additive coupling • Best response iteration in which is chosen such that state • When does it converges? A competition scenario in which every user aggressively uses up all his resources – By intuition, the weaker the mutual coupling is, the more likely it converges – How to measure and quantify this coupling strength? 45

  14. Best response dynamics sum constraint additive coupling • Best response iteration in which is chosen such that state • When does it converges? – By intuition, the weaker the mutual coupling is, the more likely it converges – How to measure and quantify this coupling strength? 46

  15. A measure of the mutual coupling Define represents the maximum impact that user m’s action can make over user n’s state 47

  16. Convergence conditions Theorem 1 : If then best response dynamics converges linearly to a unique pure NE for any set of initial conditions. • The contraction factor is a measure of the overall coupling strength • If is affine, the condition in Theorem 1 is not impacted by ; otherwise it may depend on . 48

  17. Convergence conditions Theorem 1 : If then best response dynamics converges linearly to a unique pure NE for any set of initial conditions. Contraction mapping • The contraction factor is a measure of the overall coupling strength • If is affine, the condition in Theorem 1 is not impacted by ; otherwise it may depend on . 49

  18. Convergence conditions Theorem 1 : If then best response dynamics converges linearly to a unique pure NE for any set of initial conditions. Contraction mapping • The contraction factor is a measure of the overall coupling strength • If is affine, the condition in Theorem 1 is not impacted by ; otherwise it may depend on . is a constant for affine 50

  19. Convergence conditions • If have the same sign, the condition in Theorem 1 can be relaxed to • This is true in many communication scenarios – Increasing power causes stronger interference – Increasing input rate congests the server 51

  20. Convergence conditions • If have the same sign, the condition in Theorem 1 can be relaxed to • This is true in many communication scenarios – Increasing power causes stronger interference – Increasing input rate congests the server Strategic complements (or strategic substitutes) 52

  21. A special class of For , define [Walrand] 53

  22. A special class of For , define [Walrand] Define A measure of the similarity between users’ parameters 54

  23. Convergence conditions Theorem 2 : If then best response dynamics converges linearly to a unique pure NE for any set of initial conditions. 55

  24. Convergence conditions Theorem 2 : If then best response dynamics converges linearly to a unique pure NE for any set of initial conditions. Contraction mapping Theorem 1 Theorem 1 Theorem 2 Theorem 2 56

  25. Conclusion so far… If Information is constrained and no message passing is available… Pareto boundary u 2 When will it c o nverge Concave games ? And ho w fast ? to a NE ACSCG Power control, Nash equilibrium Flow control u 1 57

  26. Conclusion so far… If Information is constrained and no message passing is available… Pareto boundary u 2 When will it c o nverge Concave games ? And ho w fast ? to a NE Suffic ient c o nditio ns that ACSCG guarantee linear c o nvergenc e Power control, Nash equilibrium Flow control u 1 58

  27. Power control as an ACSCG • Power control in interference channels 59

  28. Performance comparison • Solutions without information exchange – Iterative water ‐ filling algorithm [Yu] k P n ∑ ∑ k k k k H H P P mn mn m m ≠ ≠ m m n n k k σ σ n n k k user n ’s spectrum • Solutions with information exchange ∑ ω max R k k k 60

  29. Performance comparison • Solutions without information exchange – Iterative water ‐ filling algorithm [Yu] k P n OSB = Optimal Spectrum ∑ ∑ k k k k H H P P Balancing mn mn m m ≠ ≠ m m n n k k σ σ n n ASB = Autonomous k k user n ’s spectrum Spectrum Balancing • Solutions with information exchange ∑ ω max R k k k 61

  30. Outline • Motivation and existing approaches • Informationally efficient multi ‐ user communication – Vector cases • Convergence conditions with decentralized information • Improve efficiency with decentralized information – Scalar cases • Achieve Pareto efficiency with decentralized information • Conclusions 62

  31. How to model the mutual coupling • A reformulation of the coupling =× – State space S S ∈ n n N × → u : S A R – Utility function n n n → s : – State determination function A S − n n n → � s : A S – Belief function n n n – Conjectural Equilibrium (CE) : a configuration of ∗ ∗ ∗ ∗ ∗ � � = belief functions and joint action ( s , � , s ) a ( a , � , a ) 1 N 1 N satisfying ( ) ∗ ( ∗ ) ∗ ∗ ∗ ( ) � = s a s ( a ) and = � a arg max u s a , a − n n n n n n n n n ∈ a A n n 63

  32. How to model the mutual coupling • A reformulation of the coupling it captures the =× – State space S S aggregate effect of ∈ n n N the other users’ actions × → u : S A R – Utility function n n n → s : – State determination function A S − n n n it models the aggregate effect → � s : A S – Belief function n n n of the other users’ actions – Conjectural Equilibrium (CE) : a configuration of ∗ ∗ ∗ ∗ ∗ � � = belief functions and joint action ( s , � , s ) a ( a , � , a ) 1 N 1 N satisfying ( ) ∗ ( ∗ ) ∗ ∗ ∗ ( ) � = s a s ( a ) and = � a arg max u s a , a − n n n n n n n n n ∈ a A n n 64

  33. How to model the mutual coupling • A reformulation of the coupling it captures the =× – State space S S aggregate effect of ∈ n n N the other users’ actions × → u : S A R – Utility function n n n → s : – State determination function A S − n n n it models the aggregate effect → � s : A S – Belief function n n n of the other users’ actions – Conjectural Equilibrium (CE) : a configuration of ∗ ∗ ∗ ∗ ∗ � � = belief functions and joint action ( s , � , s ) a ( a , � , a ) 1 N 1 N satisfying ( ) ∗ ( ∗ ) ∗ ∗ ∗ ( ) � = s a s ( a ) and = � a arg max u s a , a − n n n n n n n n n ∈ a A n n each user behaves optimally beliefs are realized according to its expectation 65

  34. CE in power control games [SuTSP’09] • One leader and multiple followers • State space k : the interference caused to user n in channel k I – n • Utility function ⎛ ⎞ k K P ⎟ ⎜ ∑ ⎟ n = ⎜ + R log 1 ⎟ ⎜ n 2 ⎟ ⎜ k k σ + ⎝ I ⎠ = n n k 1 • State determination function actual play = ∑ N k k k α I P n in i i = i ≠ n 1, • Belief function (linear form) conceived play � k k k k = β − γ I P 1 1 66

  35. Why Linear belief? k k ∂ ∂ I I 1 1 is piece ‐ wise linear; , if the = ≠ 0, j k j k ∂ ∂ P P 1 1 number of frequency bins is sufficiently large. � Linear belief is sufficient to capture the interference coupling! 67

  36. Why Linear belief? k k ∂ ∂ I I 1 1 is piece ‐ wise linear; , if the = ≠ 0, j k j k ∂ ∂ P P 1 1 number of frequency bins is sufficiently large. � Linear belief is sufficient to capture the interference coupling! f P 2 f f H P 12 1 f σ 2 f 68

  37. Why Linear belief? k k ∂ ∂ I I 1 1 is piece ‐ wise linear; , if the = ≠ 0, j k j k ∂ ∂ P P 1 1 number of frequency bins is sufficiently large. � Linear belief is sufficient to capture the interference coupling! f f P P 2 2 f f f f H P H P 12 1 12 1 f f σ σ 2 2 f f 69

  38. Why Linear belief? k k ∂ ∂ I I 1 1 is piece ‐ wise linear; , if the = ≠ 0, j k j k ∂ ∂ P P 1 1 number of frequency bins is sufficiently large. � Linear belief is sufficient to capture the interference coupling! f f f P P P 2 2 2 f f f f f f H P H P H P 12 1 12 1 12 1 f f f σ σ σ 2 2 2 f f f 70

  39. Why Linear belief? k k ∂ ∂ I I 1 1 is piece ‐ wise linear; , if the = ≠ 0, j k j k ∂ ∂ P P 1 1 number of frequency bins is sufficiently large. � Linear belief is sufficient to capture the interference coupling! f f f P P P 2 2 2 f f f f f f H P H P H P 12 1 12 1 12 1 f f f σ σ σ 2 2 2 f f f 71

  40. Main results • Stackelberg equilibrium ( ) ( ) * * – Strategy profile that satisfies a , NE a 1 1 ( ) ( ) ( ) * * ( ) ≥ ∀ ∈ A u a , NE a u a NE a , , a 1 1 1 1 1 1 1 1 • NE and SE are special CE SE R • 1 R 1 NE N R ∑ 1 k k k k • NE: β = α γ = P , 0 i 1 i = i 2 k k ∂ ∂ I I k k k k 1 1 β = − ⋅ γ = − SE: I P , . 1 1 k k ∂ ∂ P P β 1 1 • • Infinite set of CE Open sets of CE that contain • γ NE and SE may exist 72

  41. Achieving the desired CE • Conjecture ‐ based rate maximization (CRM) leader followers solvable using dual method 73

  42. Discussion about CRM • Essence of CRM – local approximation of the computation of SE • Advantages – the structure of the utility function is explored – only local information is required – it can be applied in the cases where N>2 – if it converges, the outcome is a CE 74

  43. Simulation results 1 1 0.9 0.9 0.8 0.8 NE R 1 /R 1 0.7 0.7 NE R 2 /R 2 NE R 1 /R 1 0.6 0.6 NE R 2 /R 2 NE R 3 /R 3 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 0.5 1 1.5 2 2.5 3 0.8 1 1.2 1.4 1.6 1.8 2 Average rate improvements: ( ) 2 ∑ k α = ≠ 0.5, i j 2 ‐ user case: 24.4% for user 1; 33.6% for user 2 ij k ( ) 2 ∑ k 3 ‐ user case: 26.3% for user 1; 9.7% for user 2&3 α = ≠ 0.33, i j ij k 75

  44. Conclusions so far… If Information is constrained and no message passing is allowed Pareto boundary u 2 Concave games H o w to impro ve an ineffic ient NE witho ut message passing ? ACSCG Power control Nash equilibrium u 1 76

  45. Conclusions so far… If Information is constrained and no message passing is allowed Pareto boundary u 2 Concave games H o w to impro ve an ineffic ient NE witho ut message passing ? Overall effic ienc y may be impro ve d! ACSCG Build belief, learn, and adapt Power control Nash equilibrium u 1 77

  46. Outline • Motivation and existing approaches • Informationally efficient multi ‐ user communication – Vector cases • Convergence conditions with decentralized information • Improve efficiency with decentralized information – Scalar cases • Achieve Pareto efficiency with decentralized information • Conclusions 78

  47. Linearly coupled games • A non ‐ cooperative game model • Users’ states are linearly impacted by their competitor’s actions • Contributions – Characterize the structures of the utility functions – Explicitly compute Nash equilibrium and Pareto boundary – A conjectural equilibrium approach to achieve Pareto boundary without real ‐ time information exchange 79

  48. Definition A multi ‐ user interaction is considered a linearly coupled game if the action set is convex and the utility function satisfies in which . In particular, the basic assumptions about include: States are linearly impacted by actions A1: is non ‐ negative; A2: is strictly linearly decreasing in ; is non ‐ increasing and linear in . 80

  49. Definition (cont’d) Denote . A3: is an affine function, A4: Actions are linearly coupled at NE and PB 81

  50. Two basic types • For the games satisfying A1 ‐ A4, the utility functions can take two types of form: – Type I [SuJSAC’10] • e.g. random access – Type II [SuTR’09] • e.g. rate control 82

  51. Two basic types • For the games satisfying A1 ‐ A4, the utility functions can take two types of form: – Type I [SuJSAC’10] • e.g. random access – Type II [SuTR’09] • e.g. rate control 83

  52. Type I games: wireless random access • Player set: Rx 2 Tx 1 – nodes in a single cell • Action set: Rx K – transmission probability Tx K • Payoff: Rx 1 Tx 2 – throughput • Key issues – stability, convergence, throughput, and fairness 84

  53. Conjecture ‐ based Random Access • Individual conjectures actual play – state: conceived play – linear belief: • Two update mechanisms – Best response – Gradient play 85

  54. Main results Protocol design: how to achieve efficient outcomes? • Existence of CE – all operating points in action space are CE • Stability and convergence – sufficient conditions • Throughput performance – the entire throughput region can be achieved with stable CE • Fairness issue – conjecture ‐ based approaches attain weighted fairness 86

  55. How to select suitable a k ? • Adaptively alter a k when the network size changes • Adopt aggregated throughput or “idle interval” as the indicator of the system efficiency • Advantages – No need of a centralized solver – Throughput efficient with fairness guarantee – Stable equilibrium – Autonomously adapt to traffic fluctuation 87

  56. Engineering interpretation • DCF vs. the best response update – re ‐ design the random access protocol 88

  57. Engineering interpretation • DCF vs. the best response update – re ‐ design the random access protocol similar different 89

  58. Engineering interpretation • DCF vs. the best response update – re ‐ design the random access protocol similar different CBRA makes use of 4-bit information, while DCF only uses 2 bits 90

  59. Simulation results • Throughput • Stability and convergence Accumulative throughput (Mbps) 36 36 P-MAC 35 Best response 35.5 Gradient play 34 35 Accumulative throughput (Mbps) Optimal throughput 33 P-MAC 34.5 Conjecture-based algorithms 32 IEEE 802.11 DCF 34 31 33.5 30 33 29 32.5 28 32 27 31.5 26 25 31 5 10 15 20 25 30 35 40 45 50 0 100 200 300 400 500 600 Number of nodes DCF: low throughput; P ‐ MAC: instability due to the online estimation P ‐ MAC: needs to know the number of nodes 91

  60. Conventional solutions in Type II games • Utility function • Nash equilibrium • Pareto boundary • Efficiency loss 92

  61. Best response dynamics in Type II games Observed state Linear belief • At stage t , • Theorem 5 : A necessary and sufficient condition for the best response dynamics to converge is Determine the eigenvalues of the Jacobian matrix 93

  62. Stability of the Pareto boundary • Theorem 6 : All the operating points on the Pareto boundary are globally convergent CE under the best response dynamics. The belief configurations lead to Pareto ‐ optimal operating points if and only if : the ratio between the immediate – performance degradation and the conjectured long ‐ term effect Theorem 5 and expressions of Pareto boundary and CE 94

  63. Pricing vs. conjectural equilibrium • Pricing mechanism in communication networks [Kelly][Chiang] – Users repeatedly exchange coordination signals • Conjectural equilibrium for linearly coupled games – Coordination is implicitly implemented when the participating users initialize their belief parameters – Pareto ‐ optimality can be achieved solely based on local observations on the states – No message passing is needed during the convergence process – The key problem is how to design belief functions 95

  64. Conclusions so far… Can we still ac hieve Pareto o ptimality ? u 2 Pareto boundary Global (exchanged) information Concave games If Information is constrained and no message passing is available… LCG Decentralized (insufficient) information Random Access, Nash equilibrium Rate control Decentralized (limited) information u 1 The optimal way of designing the beliefs and updating the actions based on conjectural equilibrium is addressed 96

  65. Conclusions so far… Can we still ac hieve Pareto o ptimality ? u 2 Pareto boundary Global (exchanged) information Concave games If Information is constrained and Pareto o ptimality no message passing is available… c an be ac hieved! Conjectural equilibrium LCG Decentralized (insufficient) information Random Access, Nash equilibrium Rate control Decentralized (limited) information u 1 The optimal way of designing the beliefs and updating the actions based on conjectural equilibrium is addressed 97

  66. Conclusions • We define new classes of games emerging in multi ‐ user communication networks and investigate the information and efficiency trade ‐ off – Provide sufficient convergence conditions to NE – Suggest a conjectural equilibrium based approach to improve efficiency – Quantify the performance improvement 98

  67. References • J. Rosen, “Existence and uniqueness of equilibrium points for concave n ‐ person games,” Econometrica , vol. 33, no. 3, pp. 520 ‐ 534, Jul. 1965. • D. Monderer and L. S. Shapley, “Potential games,” Games Econ. Behav. , vol. 14, no. 1, pp. 124 ‐ 143, May 1996. • D. Topkis, Supermodularity and Complementarity . Princeton University Press, Princeton, 1998. • F. Kelly, A. K. Maulloo, and D. K. H. Tan, “Rate control in communication networks: shadow prices, proportional fairness and stability,” Journal of the Operational Research Society , vol. 49, pp. 237 ‐ 252, 1998. • M. Chiang, S. H. Low, A. R. Calderbank, and J. C. Doyle, “Layering as optimization decomposition: A mathematical theory of network architectures,” Proc. of the IEEE , vol. 95, no. 1, pp. 255 ‐ 312, January 2007. 99

  68. References (cont’d) • W. Yu, G. Ginis, and J. Cioffi, “Distributed multiuser power control for digital subscriber lines,” IEEE J. Sel. Areas Commun. , vol. 20, no. 5, pp. 1105 ‐ 1115, June 2002. • J. Mo and J. Walrand, “Fair end ‐ to ‐ end window ‐ based congestion control,” IEEE Trans. on Networking , vol. 8, no. 5, pp. 556 ‐ 567, Oct. 2000. 100

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend