Authority and Co-cite, Hub and Co-reference Given the adjacency - - PowerPoint PPT Presentation

authority and co cite hub and co reference
SMART_READER_LITE
LIVE PREVIEW

Authority and Co-cite, Hub and Co-reference Given the adjacency - - PowerPoint PPT Presentation

Authority and Co-cite, Hub and Co-reference Given the adjacency matrix A (with entries 0 or 1) a i = A T Aa i 1 a i = ( A T A ) i a 0 h i = AA T h i 1 h i = ( AA T ) i h 0 Co-citation: the number of pages co-cite P i and P j Co-reference:


slide-1
SLIDE 1

Authority and Co-cite, Hub and Co-reference

Given the adjacency matrix A (with entries 0 or 1) ai = ATAai−1 ai = (ATA)ia0 hi = AAThi−1 hi = (AAT)ih0 Co-citation: the number of pages co-cite Pi and Pj Co-reference: the number of pages co-referenced by Pi and Pj. ATA = D + C where C is the matrix of co-citation and D = diag(d1, d2, · · · , dj) Cij =

  • k

AkiAkj = (ATA)ij Cii = di =

  • k

Aki =

  • k

AkiAki = (ATA)ii ATA = D + C, AAT = O + R

slide-2
SLIDE 2

Authority and Co-cite, Hub and Co-reference

Given the adjacency matrix A (with entries 0 or 1) ai = ATAai−1 ai = (ATA)ia0 hi = AAThi−1 hi = (AAT)ih0 Co-citation: the number of pages co-cite Pi and Pj Co-reference: the number of pages co-referenced by Pi and Pj. ATA = D + C where C is the matrix of co-citation and D = diag(d1, d2, · · · , dj) Cij =

  • k

AkiAkj = (ATA)ij Cii = di =

  • k

Aki =

  • k

AkiAki = (ATA)ii ATA = D + C, AAT = O + R

slide-3
SLIDE 3

Authority and Co-cite, Hub and Co-reference

Given the adjacency matrix A (with entries 0 or 1) ai = ATAai−1 ai = (ATA)ia0 hi = AAThi−1 hi = (AAT)ih0 Co-citation: the number of pages co-cite Pi and Pj Co-reference: the number of pages co-referenced by Pi and Pj. ATA = D + C where C is the matrix of co-citation and D = diag(d1, d2, · · · , dj) Cij =

  • k

AkiAkj = (ATA)ij Cii = di =

  • k

Aki =

  • k

AkiAki = (ATA)ii ATA = D + C, AAT = O + R

slide-4
SLIDE 4

Authority and Co-cite, Hub and Co-reference

Given the adjacency matrix A (with entries 0 or 1) ai = ATAai−1 ai = (ATA)ia0 hi = AAThi−1 hi = (AAT)ih0 Co-citation: the number of pages co-cite Pi and Pj Co-reference: the number of pages co-referenced by Pi and Pj. ATA = D + C where C is the matrix of co-citation and D = diag(d1, d2, · · · , dj) Cij =

  • k

AkiAkj = (ATA)ij Cii = di =

  • k

Aki =

  • k

AkiAki = (ATA)ii ATA = D + C, AAT = O + R

slide-5
SLIDE 5

Probabilistic analysis

Expected value of co-citation/co-reference For a fixed degree sequence random graphs E(Cik) = didk n − 1 E(Rik) =

  • iok

n − 1 The node with large indegree di tend to have large co-citations with other nodes. E(ATA) = E(D) + E(C) = diag(h1, h2, · · · , hn) + ddT/n − 1 where hi ≡ di − d2

i /(n − 1) and d = (d1, d2, · · · , dn)T.

slide-6
SLIDE 6

Probabilistic analysis

Expected value of co-citation/co-reference For a fixed degree sequence random graphs E(Cik) = didk n − 1 E(Rik) =

  • iok

n − 1 The node with large indegree di tend to have large co-citations with other nodes. E(ATA) = E(D) + E(C) = diag(h1, h2, · · · , hn) + ddT/n − 1 where hi ≡ di − d2

i /(n − 1) and d = (d1, d2, · · · , dn)T.

slide-7
SLIDE 7

Spectral Decomposition of Diagonal Plus Rank-1 matrices

Let M = D + ccT, D is a diagonal n × n matrix of the block form: D = diag(τ1I1, τ2I2, · · · , τlIl) where Ik is the identity matrix of size nk, τ1 > τ2 > · · · > τl Then, the eigenvalues of M are given by ˆ τ1 > τ1 = · · · = τ1

  • > ˆ

τ2 > τ2 = · · · = τ2

  • > · · · > ˆ

τl > τl = · · · = τl

  • and the eigenvector of A corresponds to the eigenvalue ˆ

τk is ( cT

1

ˆ τ1 − τ1 , cT

2

ˆ τ2 − τ2 , · · · , cT

l

ˆ τl − τl )T. The eigenvector corresponds to τk is of the form (0 · · · 0, uT

k , 0 · · · 0)T

where uk is a vector of nk satisfying cT

k uk = 0.

slide-8
SLIDE 8

Average Analysis of HITS

E(ATA) = E(D) + E(C) = diag(h1, h2, · · · , hn) + ddT/n − 1 where hi ≡ di − d2

i /(n − 1) and d = (d1, d2, · · · , dn)T.

If h1 > h2 > · · · > hm ≥ hm+1 ≥ · · · ≥ hn, Then, the m largest eigenvalues λi satisfying λ1 > h1 > λ2 > h2 > · · · > λm > hm the corresponding eigenvectors are uk = ( d1 λk − h1 , d2 λk − h2 , · · · , dn λk − hn ) Prerequisite hi − hj = (di − dj)[1 − (di + dj)/(n − 1)] > 0 as long as d1 > · · · > dm > dm+1 ≥ dm+1 ≥ dm+2 · · · ≥ dn and di + dj < n − 1 for ∀i, j

slide-9
SLIDE 9

Average Analysis of HITS

E(ATA) = E(D) + E(C) = diag(h1, h2, · · · , hn) + ddT/n − 1 where hi ≡ di − d2

i /(n − 1) and d = (d1, d2, · · · , dn)T.

If h1 > h2 > · · · > hm ≥ hm+1 ≥ · · · ≥ hn, Then, the m largest eigenvalues λi satisfying λ1 > h1 > λ2 > h2 > · · · > λm > hm the corresponding eigenvectors are uk = ( d1 λk − h1 , d2 λk − h2 , · · · , dn λk − hn ) Prerequisite hi − hj = (di − dj)[1 − (di + dj)/(n − 1)] > 0 as long as d1 > · · · > dm > dm+1 ≥ dm+1 ≥ dm+2 · · · ≥ dn and di + dj < n − 1 for ∀i, j

slide-10
SLIDE 10

Average Analysis of HITS

E(ATA) = E(D) + E(C) = diag(h1, h2, · · · , hn) + ddT/n − 1 where hi ≡ di − d2

i /(n − 1) and d = (d1, d2, · · · , dn)T.

If h1 > h2 > · · · > hm ≥ hm+1 ≥ · · · ≥ hn, Then, the m largest eigenvalues λi satisfying λ1 > h1 > λ2 > h2 > · · · > λm > hm the corresponding eigenvectors are uk = ( d1 λk − h1 , d2 λk − h2 , · · · , dn λk − hn ) Prerequisite hi − hj = (di − dj)[1 − (di + dj)/(n − 1)] > 0 as long as d1 > · · · > dm > dm+1 ≥ dm+1 ≥ dm+2 · · · ≥ dn and di + dj < n − 1 for ∀i, j

slide-11
SLIDE 11

Eigenvectors

slide-12
SLIDE 12

HITS = ranking according to indegrees??

For any i < j u1(i) − u1(j) = di λ1 − hi − dj lambda1 − hj = (di − dj)[λ1 − didj/(n − 1)] (λ1 − hi)(λ − hj) ≥ as λ1 − didj/(n − 1) > hi − didj/(n − 1) = di(1 − di+dj

n−1 ) > 0

What’s the nature of AVERAGE? The authority ranking is, ON AVERAGE, identical to the ranking according to web page indegrees.

slide-13
SLIDE 13

HITS = ranking according to indegrees??

For any i < j u1(i) − u1(j) = di λ1 − hi − dj lambda1 − hj = (di − dj)[λ1 − didj/(n − 1)] (λ1 − hi)(λ − hj) ≥ as λ1 − didj/(n − 1) > hi − didj/(n − 1) = di(1 − di+dj

n−1 ) > 0

What’s the nature of AVERAGE? The authority ranking is, ON AVERAGE, identical to the ranking according to web page indegrees.