Networks & Operating Systems SRG, Computer Laboratory
On the Edge of Human-Data Interaction with the
Richard Mortier
DATAB X
01000100 01100001 01110100 01100001 01100010 01111000
DATAB X 01000100 01100001 01110100 01100001 01100010 01111000 - - PowerPoint PPT Presentation
On the Edge of Human-Data Interaction with the DATAB X 01000100 01100001 01110100 01100001 01100010 01111000 Richard Mortier Networks & Operating Systems SRG, Computer Laboratory Living in a Big Data World Challenges and
Networks & Operating Systems SRG, Computer Laboratory
Richard Mortier
01000100 01100001 01110100 01100001 01100010 01111000
http://weputachipinit.tumblr.com/ “It was just a dumb thing. Then we put a chip in it. Now it's a smart thing.” http://bigdatapix.tumblr.com/ “Big Data is visualized in so many ways... all of them blue and with numbers and lens flare.”
Key Challenge: How do we enable data subjects to control collection and exploitation of both their data and data about them?
2
3
your data your data you processors your data data data your data
4
https://www.stickermule.com/marketplace/3442-there-is-no-cloud
by default, we move data to the cloud for processing
your data your data you processors your data data data your data
5
❶ request
permission
❷
processing
❸
6
sources
processors
results
❹
subjects
❺ interac(ons
permission, audit
7
8
9
10
https://flic.kr/p/6thmfN
Capacity to act
collection,
analysis,
enough to want to employ them
11
http://appadvice.com/appnn/2012/04/facebooks-acquisition-of-instagram-just-another-question- mark-for-internet-privacy
Support for dynamics of interaction
analysed about us, and understand how to enact choices over these
and services
12
13
Databox moves code to the data, minimising data release and retaining control over processing
you
your data your data your data your data data your data data
processors
14
15
subject
databox
data app data driver data driver data driver data driver data app driver
processors
app store
Docker containers
platform independence, isolation, and management
16
Arbiter GitHub App Driver Container Manager Proxy
Dashboard User
Core Network CoreUI AppStore
17
subject
data app data driver data driver data driver data driver data app driver
processors
app store
container- manager arbiter
app-netif app-netif driver-netif
core- network
system-netif
lifecycle
middleware layer via provided data stores
by core-network interconnecting separate virtual interfaces
18
controlled by macaroons
data
HTTP solution
19
Temperature driver Solar generation driver Power consumption driver Real-time dashboard app Historical analysis app
store
route
store
route
store
route ZEST (OBSERVE) ZEST (GET)
CoAP/TCP: https://tools.ietf.org/html/draft-ietf-core-coap-tcp-tls-09 0MQ: http://api.zeromq.org/
widely understood
Databox and in my partner’s Databox, or when the green tag is in one Databox and we’re both in the house”
20
21
datastores processors
hue bulbs mobile sensors smart plugs map, reduce filter convert actuate display write to store
22
svg image image parts data x y z
rotate x degrees scale by y/2 fill with colour z
(i,j)
translate to (i,j)
transform
23
1110100 10100011 10101000 01001100 00101010 01001101 01111101 010010 0100011 00101011 11011100 00101010 01110110 01000001 10110001 000101 0011000 00001010 10101111 11011000 11100111 01000111 00111000 000010 1101110 00101110 01000010 01101010 11111100 01101110 11100001 000100 0001001 01111100 01011111 01111110 00010111 11010101 10010000 101011 0110101 11100111 01111101 01100001 00100011 11100101 00010111 111001 0101001 11110011 10100111 11000101 01110011 11100101 00011110 000011 1010011 10000011 00111000 10001111 11101100 11011110 01000100 010101 1001110 00001100 00011000 11011110 11010101 01010101 01001101 001101 1000110 11001111 01000011 01001101 00001111 10010010 00010100 111000 1001010 10010100 00010000 00101110 01100100 00010111 11101011 011100 0110111 11011111 01000100 01100011 01000100 11110101 11110100 001110 1101111 01010000 11000100 11110001 01000010 10010010 11110000 000100 1111011 00000100 10011110 10100010 00100001 01010001 01111101 111100 1110100 00010011 10011011 01001001 00110110 00111100 00010101 001100 1111011 10100110 01001010 11001010 00101110 10011011 01100011 100001 0010000 01000011 01000010 11110100 10011001 10100000 00100110 001001 1110000 11000111 11100011 01100110 10000101 01111110 11011010 101100
24
Big Data Big Data Analytics Small Data aggregate public private traditional centralised cloud
25
Big Data Big Data Analytics Small Data Small Data Analytics aggregate public private traditional centralised cloud exploratory decentralised computation aggregate
26
Online learning Cooperative learning
u1
MS Batch learning
Training data
ML1i+1Inference
d1 di+1
Inference
ML1i+1MP
i+1u1
ML1i+1Inference
d1 di+1
Inference
ML1i+1MP
i+1u3
ML1i+1Inference
d1 di+1
Inference
ML1i+1MP
i+1u2
Goal? Fully distributed inference and learning at scale
ASP SSP BSP pBSP pSSP ASP
Strong consistency Slow iteration rate Fully centralised Weak consistency Fast iteration rate Fully distributed BSP SSP ASP Consistency Completeness PSP
data data subject processors data data data data
27
28
parties implicated in the act of sharing
29
cooperative ensemble
30
31
User Driven Discovery
processors
32
https://flic.kr/p/4o1wLv
Legibility of Data Sources
processors — Recipient design
33
https://flic.kr/p/9AwFd3 https://flic.kr/p/c3jJAY
From My Data to Our Data
and control
34
https://flic.kr/p/drV8zY
Salient Dimensions of Collaboration
purpose — Transitivity
processes, e.g., current status reports
processing or transfer
35
https://flic.kr/p/e57ySb
36
37
https://bit.ly/encyclopedia-hdi http://hdiresearch.org/ https://databoxproject.uk/ https://ocaml.xyz/ https://mort.io/ richard.mortier@cl.cam.ac.uk