an online learning based task offloading framework for 5g
play

An Online Learning-Based Task Offloading Framework for 5G Small Cell - PowerPoint PPT Presentation

An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks ICPP2020 1 Background & Motivation System Model Outline Algorithm Design Analysis & Performance Conclusion ICPP2020 2 Background &


  1. An Online Learning-Based Task Offloading Framework for 5G Small Cell Networks ICPP2020 1

  2.  Background & Motivation  System Model Outline  Algorithm Design  Analysis & Performance  Conclusion ICPP2020 2

  3. Background & Motivation - 5G 5G technology: the fifth-generation mobile communication technology Higher transmission rate • Faster speed • Larger number of connections • ICPP2020 3

  4. Background & Motivation - small cell node Small cell node (SCN): fundamental element of 5G network Low-power, short-range • Cover small geographical areas • Millimeter-Wave • Closer to wireless devices • Process larger amount of data at faster speeds ICPP2020 4

  5. Background & Motivation - small cell network  SCN Improve coverage and capacity • Better and faster connectivity •  Macrocell Wide coverage • Further away from the users • Communicate with SCNs • Fig. An illustration of small cell network ICPP2020 5

  6. Background & Motivation  5G boosts demand for new services  Characteristic of services Security surveillance Huge amount of data • • Virtual reality Delay sensitive • • Automatic driving High computing requirements • • ……. • Small cells equipped with edge servers represent a competitive solution for mobile task offloading ICPP2020 6

  7. Background & Motivation  Features of offloading tasks from wireless devices (WDs) in small cell network Limited connections • Finite computation resources • Significant uncertainties in the task offloading process, such as blockage •  A fundamental problem Given limited computation and communication resources, how to select computing tasks to maximize effective reward? ICPP2020 7

  8.  Background & Motivation  System Model Outline  Algorithm Design  Analysis & Performance  Conclusion ICPP2020 8

  9. System Model - overview MBS: Controls and prioritizes offloading • task to SCNs SCN : 𝑁 SCNs • Limited coverage • Equipped with computing server • Task : Provide task context information • May be covered by multiple small • cells ICPP2020 9

  10. System Model - goal  Random process in task offloading: • Reward for SCN 𝑛 to complete task 𝑗 at time 𝑢 𝑉 𝑛, 𝜚 𝑗 , 𝑢 Likelihood for SCN 𝑛 to complete 𝑗 at time 𝑢 • 𝑊(𝑛, 𝜚 𝑗 , 𝑢) Resource consumption 𝑅(𝑛, 𝜚 𝑗 , 𝑢) •  Our goal: Compound reward: the effective reward per unit resource for SCN 𝑛 to • complete the task with context 𝜚 at time 𝑢 𝐻 𝑛, 𝜚 𝑗 , 𝑢 = 𝑉 ∗ 𝑊/𝑅 Maximize the total compound reward under system constraints • ICPP2020 10

  11. System Model - constraints The number of tasks accepted by each SCN does not exceed its • communication capacity Each task is not repetitively offloaded by multiple SCNs • Number of successfully processed tasks by each SCN at a time slot • Resource capacity • ICPP2020 11

  12.  Background & Motivation  System Model Outline  Algorithm Design  Analysis & Performance  Conclusion ICPP2020 12

  13. Algorithm Design - difficulties 1. Uncertain and stochastic environment: reward, likelihood, resource consumption 2. The balance between maximizing the total compound reward and satisfying the system constraints 3. Enumerating all possible sets and selecting the optimal one leads to large search space. How to avoid combinatorial explosion? 4. How to guarantee tasks are not repeatedly offloaded? ICPP2020 13

  14. Algorithm Design - technique 1  Multi-armed bandit (MAB) A model for sequential decision problems with • exploration-exploitation tradeoff Some bandit machines ( arms ), and drawing an • arm yields a reward. ICPP2020 14

  15. Algorithm Design - technique 2  Lagrangian multipliers General MAB framework does not consider any constraints • Introduce adjustable Lagrangian multipliers, to balance reward and constraint • violation Construct a new regret function • 1 𝑛, 𝑢 2 + 𝜇 2 𝑛, 𝑢 ∗ 𝑊 2 𝑛, 𝑢 2 𝑍 = 𝑆 𝑛, 𝑢 + 𝜇 1 𝑛, 𝑢 ∗ 𝑊 Oracle total compound reward - our algorithm total compound reward ICPP2020 15

  16. Algorithm Design - technique 3  Tailored Contextual MAB Each task comes with its context • Massive contexts to be learned • Divide task context space into small hypercubes of similar contexts • Maintain a weight for each hypercube, based on the regret 𝑍 • For each SCN, estimate the selection probability of each task • ICPP2020 16

  17. Algorithm Design - technique 4 k1  Weighted bipartite graph 0.7 A task may be covered by multiple SCNs • S1 0.9 k2 Coordinate multiple SCNs for task offloading • 0.6 Construct weighted bipartite graph, based on all • 0.4 tasks’selection probabilities k3 0.7 S2 Greedy manner • 0.8 k4 ICPP2020 17

  18. Algorithm Design - framework ICPP2020 18

  19.  Background & Motivation  System Model Outline  Algorithm Design  Analysis & Performance  Conclusion ICPP2020 19

  20. Analysis & Performance - theoretical analysis Regret = total reward of oracle – total reward of our algorithm • Violations of system constraints • Regret and violations are all sub-linear with respect to timespan 𝑈 • Converges to the optimal task offloading decisions over time • Asymptotically optimal performance when 𝑈 is sufficiently large • ICPP2020 20

  21. Analysis & Performance - performance  Simulation setup 10000 time slots • 30 SCNs connected to a MBS • 35-100 tasks in the coverage area of each SCN • each SCN can simultaneously support up to 20 wireless device • Reward and likelihood are normalized and uniformly distributed in [0, 1] • ICPP2020 21

  22. Analysis & Performance - performance  Performance Metrics Cumulative compound reward (violation) • Per-time-slot compound reward (violation) • Performance ratio • = cumulative compound reward / (cumulative violation1 + cumulative violation2) ICPP2020 22

  23. Analysis & Performance - performance Fig1 Fig2 Fig3 Fig4 ICPP2020 23

  24.  Background & Motivation  System Model Outline  Algorithm Design  Analysis & Performance  Conclusion ICPP2020 24

  25. Conclusion Study task offloading in 5G small cell networks • Propose an online learning-based solution framework • Leverage MAB technique to learn the best selection strategy, while considering • resource capacity constraints and QoS requirement The efficiency is verified by both theoretical analysis and simulation studies • Achieve sub-linear bounds for both regret and violations • ICPP2020 25

  26. Thanks for listening ! ICPP2020 26

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend