Ca Cart rtel: A System for Collaborative Transfer Learning at the Edge
Harshit Daga* | Patrick K. Nicholson+ | Ada Gavrilovska* | Diego Lugones+
*Georgia Institute of Technology, +Nokia Bell Labs
rtel : A System for Collaborative Transfer Learning at the Edge - - PowerPoint PPT Presentation
Ca Cart rtel : A System for Collaborative Transfer Learning at the Edge Harshit Daga * | Patrick K. Nicholson + | Ada Gavrilovska * | Diego Lugones + * Georgia Institute of Technology, + Nokia Bell Labs Multi-access Edge Computing (MEC)
Harshit Daga* | Patrick K. Nicholson+ | Ada Gavrilovska* | Diego Lugones+
*Georgia Institute of Technology, +Nokia Bell Labs
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 2
Nokia
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 3
context of MEC:
Microsoft
generated at the edge from end-user devices and IoT.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 4
(a) Data Edge Cloud
uses a lot of backhaul network bandwidth.
data can slow down the execution up to 53X[1].
[1] Kevin et al. Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 5
changing workload.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 6
Can we achieve a balance between centralized and isolated system? Leverage the resource-constrained edge nodes to train customized (smaller) machine learning models in a manner that reduces training time and backhaul data transfer while keeping the performance closer to a centralized system?
variety of data may not be required at an edge node.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 7
E node E node E node E node E node
Centralized Isolated Cartel Light Weight Models Data Transfer Online Training Time High Model accuracy
↓ ↓ ↓ ↓ ↑ ↑ x x
start to adapt to these changes by transferring knowledge from other edge(s) where similar patterns have been observed.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 8
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 9
§ Statistics about the network § Software configuration § Active user distribution by segments § Estimates of class priors (probability of certain classes), etc.
Metadata Server (MdS)
E1 node
Cartel maintains and aggregates metadata locally and in the metadata server (MdS).
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 10
collaborate with edge nodes for a model transfer.
detection mechanism.
Edge Node (E) Eis register and send metadata
E1 node
Metadata Server (MdS)
E2 node
Request Batch
E4 node E3 node
1 2
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 11
the target node.
be from anywhere in the network.
some shift, the empirical distributions from the target node is compared with those from the
subset of edge nodes are logical neighbors of the target node.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 12
1. Partitioning 2. Merging
Logical Neighbor Target Node Help Me (SOS)
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 13
Existing ML Library*
Data
Edge Node
Collaborative Component
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 14
Existing ML Library*
Data
Edge Node
Collaborative Component
Register Predict Train Merge Transfer Partition ML Model Accuracy Trend
Data
Distribution Drift
Edge Node
Collaborative Learning
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 16
workload?
costs, while providing lightweight and accurate models?
and the design choices?
Introduction Workload Fluctuation Workload
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 15
workload?
costs, while providing lightweight and accurate models?
and the design choices?
Introduction Workload Fluctuation Workload
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 17
Online Random Forest (ORF) Introduction Workload Number of Requests
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 18
Fluctuation Workload Online Support Vector Machine (OSVM)
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 19
Fluctuation Workload Online Support Vector Machine (OSVM)
Cartel provides a jump start by transferring knowledge from other edge(s) where similar patterns have been observed.
predictive performance compared to a centralized system.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 20
cost includes the transfer of raw data or metadata updates.
to the edge (periodically in case
centralized system or partial model request from a logical neighbor in Cartel).
up to 1500x when compared to a centralized system.
Cartel: A System for Collaborative Transfer Learning at the Edge | SoCC ’19, November 20–23, 2019, Santa Cruz, CA, USA | 21
machine learning models between edge nodes.
compared to an isolated system).
↓ compared to a centralized system).
leading to faster training (5.7x ↓) when compared to a centralized system.
Edge Node (E)
Request for nodes with similar model Subset of helpful neighbors (E3, E4) Insights
Eis register and send metadata
E1 node (t)
3
Metadata Service (MdS)
E2 node
Request Batch
E4 node E3 node Insights
1 2 4
Harshit Daga* | Patrick K. Nicholson+ | Ada Gavrilovska* | Diego Lugones+
*Georgia Institute of Technology, +Nokia Bell Labs