GEEM : An algorithm for Active Learning on Attributed Graphs
Florence Regol* Soumyasundar Pal*, Yingxue Zhang**, Mark Coates*
* McGill University Compnet Lab **Huawei Noah’s Ark Lab, Montreal Research Center
July 14th 2020
1 / 22
GEEM : An algorithm for Active Learning on Attributed Graphs - - PowerPoint PPT Presentation
GEEM : An algorithm for Active Learning on Attributed Graphs Florence Regol* Soumyasundar Pal*, Yingxue Zhang**, Mark Coates* * McGill University Compnet Lab **Huawei Noahs Ark Lab, Montreal Research Center July 14th 2020 1 / 22 Active
Florence Regol* Soumyasundar Pal*, Yingxue Zhang**, Mark Coates*
* McGill University Compnet Lab **Huawei Noah’s Ark Lab, Montreal Research Center
July 14th 2020
1 / 22
2 / 22
2 / 22
2 / 22
2 / 22
2 / 22
? ?
Goal: Choose optimal queries to maximize performance.
2 / 22
? ?
3 / 22
4 / 22
1
PREDICT : Infer ˆ Y = ft(X).
4 / 22
1
PREDICT : Infer ˆ Y = ft(X). Trained on (X, YLt) current labelled set Lt.
4 / 22
1
PREDICT : Infer ˆ Y = ft(X). Trained on (X, YLt) current labelled set Lt.
2
QUERY : Select q from the unlabelled set Ut.
4 / 22
1
PREDICT : Infer ˆ Y = ft(X). Trained on (X, YLt) current labelled set Lt.
2
QUERY : Select q from the unlabelled set Ut. Update Lt+1 = Lt ∪ {qt} and Ut+1 = Ut \ {qt}.
4 / 22
1
PREDICT : Infer ˆ Y = ft(X). Trained on (X, YLt) current labelled set Lt.
2
QUERY : Select q from the unlabelled set Ut. Update Lt+1 = Lt ∪ {qt} and Ut+1 = Ut \ {qt}. Repeat until the query budget B has been reached.
4 / 22
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X).
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X). → Run one epoch of GCN.
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X). → Run one epoch of GCN. → Save the node embeddings output from the GCN.
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X). → Run one epoch of GCN. → Save the node embeddings output from the GCN.
2
QUERY Select q ∈ Ut.
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X). → Run one epoch of GCN. → Save the node embeddings output from the GCN.
2
QUERY Select q ∈ Ut. → Select q based on metrics derived from GCN output.
5 / 22
SOTA Active leaning strategies based on GCN output. (AGE [1] and ANRMAB [2])
1
PREDICT : Infer ˆ Y = ft(X). → Run one epoch of GCN. → Save the node embeddings output from the GCN.
2
QUERY Select q ∈ Ut. → Select q based on metrics derived from GCN output.
[1] Cai et al. ”Active learning for graph embedding” arXiv 2017 [2] Gao et al. ”Active discriminative network representation learning” IJCAI 2018
5 / 22
GCN-based algorithms on Cora. 20 40 60 Number of nodes in labeled set 50 60 70 80 Accuracy
Accuracy of GCN without active learning with x = 120 nodes in labeled set
AGE ANRMAB
6 / 22
7 / 22
7 / 22
Cora with non-optimized version of AGE. 20 40 60 Number of nodes in labeled set 40 50 60 70 80 Accuracy AGE AGE non optimized ANRMAB
8 / 22
Amazon-photo. Hyperparameters not fine-tuned to the dataset. 10 20 30 40 50 Number of nodes in labeled set 40 60 80 Accuracy
Accuracy of GCN without active learning with x = 160 nodes in labeled set
AGE AGE non optimized ANRMAB
9 / 22
10 / 22
11 / 22
Risk of q : The expected 0/1 error once added to Lt.
11 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
11 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
EEM selects the query q that minimizes this risk.
11 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
EEM selects the query q that minimizes this risk. q∗ = arg min
q∈Ut
R+q
|YLt
11 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
EEM selects the query q that minimizes this risk. q∗ = arg min
q∈Ut
R+q
|YLt
R+q
|YLt =
1 |Uq
t |
t
k′∈K p(yi = k′|YLt, yq = k)
12 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
EEM selects the query q that minimizes this risk. q∗ = arg min
q∈Ut
R+q
|YLt
R+q
|YLt =
1 |Uq
t |
t
k′∈K p(yi = k′|YLt, yq = k)
13 / 22
Risk of q : The expected 0/1 error once added to Lt. Denoted by R+q
|YLt .
EEM selects the query q that minimizes this risk. q∗ = arg min
q∈Ut
R+q
|YLt
R+q
|YLt =
1 |Uq
t |
t
k′∈K p(yi = k′|YLt, yq = k)
14 / 22
15 / 22
Simplified GCN [3] : Removes non-linearities of GCNs to
15 / 22
Simplified GCN [3] : Removes non-linearities of GCNs to
Set p(yj = k|YL) = σ(˜ xjWYL)(k)
15 / 22
Simplified GCN [3] : Removes non-linearities of GCNs to
Set p(yj = k|YL) = σ(˜ xjWYL)(k) GEEM : R+q
|YLt =
1 |U−q
t
|
(1−max
k′∈K σ(˜
xiWLt,+q,yk)(k′))σ(˜ xqWYLt )(k)
16 / 22
Simplified GCN [3] : Removes non-linearities of GCNs to
Set p(yj = k|YL) = σ(˜ xjWYL)(k) GEEM : R+q
|YLt =
1 |U−q
t
|
(1−max
k′∈K σ(˜
xiWLt,+q,yk)(k′))σ(˜ xqWYLt )(k)
17 / 22
18 / 22
hyperparameters are fine-tuned. 20 40 60 Number of nodes in labeled set 40 50 60 70 80 Accuracy AGE AGE non optimized ANRMAB GEEM* Random
19 / 22
Amazon-photo. GEEM significantly outperforms GCN-based methods. 10 20 30 40 50 Number of nodes in labeled set 40 60 80 Accuracy
20 / 22
The proposed GEEM algorithm:
21 / 22
The proposed GEEM algorithm: Offers SOTA performance.
21 / 22
The proposed GEEM algorithm: Offers SOTA performance. Does not rely on validation set → More realistic scenario.
21 / 22
The proposed GEEM algorithm: Offers SOTA performance. Does not rely on validation set → More realistic scenario. Additional contributions :
21 / 22
The proposed GEEM algorithm: Offers SOTA performance. Does not rely on validation set → More realistic scenario. Additional contributions : Combined GEEM : Hybrid mixed with LP covers more cases.
21 / 22
The proposed GEEM algorithm: Offers SOTA performance. Does not rely on validation set → More realistic scenario. Additional contributions : Combined GEEM : Hybrid mixed with LP covers more cases. Preemptive GEEM (PreGEEM) : Take advantage of oracle delay with approximations.
21 / 22
The proposed GEEM algorithm: Offers SOTA performance. Does not rely on validation set → More realistic scenario. Additional contributions : Combined GEEM : Hybrid mixed with LP covers more cases. Preemptive GEEM (PreGEEM) : Take advantage of oracle delay with approximations. → Provide bounds on the approximation error.
21 / 22
[1] H. Cai, V. W. Zheng, and K. C. Chang, “Active learning for graph embedding,” arXiv preprint arXiv:1705.05085, 2017. [2] L. Gao, H. Yang, C. Zhou, J. Wu, S. Pan, and Y. Hu, “Active discriminative network representation learning,” in Proc. Int. Joint Conf. Artificial Intell., 2018, pp. 2142–2148. [3] F. Wu, A. Souza, T. Zhang, C. Fifty, T. Yu, and K. Weinberger, “Simplifying graph convolutional networks,” in Proc. Int. Conf. Machine Learning, Long Beach, California, USA, Jun. 2019, pp. 6861–6871.
22 / 22