Collaboration insights from data access analytics "Follow the data"
Ravi Krishnaswamy Autodesk Inc.
Collaboration insights from data access analytics "Follow the - - PowerPoint PPT Presentation
Collaboration insights from data access analytics "Follow the data" Ravi Krishnaswamy Autodesk Inc. How Valuable is a Network ? Reed: the utility of large networks, particularly social networks, can scale exponentially with the size
Ravi Krishnaswamy Autodesk Inc.
Bob, a Desktop user Scott , a Desktop user
Bob, a Desktop user Scott , a Desktop user
Saves/exports
references Saves/exports
Bob, a Desktop user Mary, a Desktop user Yan, a Mobile user Joe and John, web users Scott , a Desktop user
Saves/exports
references Saves/exports Saves/exports
“Lineage”
Scott Mary Yan Bob Joe John
Bob, a Desktop user Mary, a Desktop user Yan, a Mobile user Joe and John, web users Scott , a Desktop user
Saves/exports
references Saves/exports Saves/exports
“Lineage”
EF8A09D D9A22B
saves
Log Item: (anonymized-user-id, platform, file-operation, hash-before, hash-after, time) (u88, ‘desktop-win’, ‘save’, ‘EF8A09D’, ‘D9A22B’, 9320031) (u89, ‘mobile-ios’, ‘open’, ‘D9A22B’, ‘D9A22B’, 10311299)
User88 EF8A09D’ D9A22B’ User89
Distinct users
platforms who share data
User88 Desktop-win Save EF8A09D D9A22B User89 Ios Open D9A22B n/a
Mixpanel Query/ Extract to CSV SPARK/ Qubole Bulk Import to Neo4j Layout and Visuzalization Tool (Gephi) Query /Output Query Results to GraphML/ CSV
Identify lineages with algo.unionFind()
Purple: Fingerprint of specific file version Chain of purple nodes: Lineages Size of arrow: Number of accesses to specific fingerprint version Green: Desktop; Red: Web; Blue: Mobile
Minimum number of file versions per lineage (%) lineages accessed by more than 1 device
Minimum number of file versions per lineage
(a) MERGE (b) SET (c) LOAD CSV
(a) O (n) (b) O (n2) (c) O (2n)