Rahul Simha department of computer science George Washington - - PowerPoint PPT Presentation

rahul simha
SMART_READER_LITE
LIVE PREVIEW

Rahul Simha department of computer science George Washington - - PowerPoint PPT Presentation

Big Dating: Computer Science and Relationships Rahul Simha department of computer science George Washington University Office hours: 1pm Tuesdays, 5pm Wednesdays some stats Usage of online da-ng websites : (total registra=ons) OKCupid: 5


slide-1
SLIDE 1

department of computer science

George Washington University Office hours: 1pm Tuesdays, 5pm Wednesdays

Big Dating:

Computer Science and Relationships

Rahul Simha

slide-2
SLIDE 2

some stats Usage of online da-ng websites: (total registra=ons)

  • OKCupid: 5 million
  • Chemistry: 11 million
  • E-harmony: 33 million
  • POF: 40 million
  • Tinder: 50 million
  • Match: 96 million
  • Badoo: 200 million
slide-3
SLIDE 3

your data Exercise 1 Part 1: On the worksheet, write three things about

yourself (without iden=fying yourself) that will help you stand out in an online da=ng site

Part 2: Write down your height in inches but add a fudge

factor of +10 or -10 using the following rule: if you were born in an even-numbered month, add 10. Otherwise subtract 10. Thus, if your height is 67 inches and you were born in March, you would write 57

Follow remaining “snowball” instruc=ons

slide-4
SLIDE 4

more stats Percep-on:

  • 60% of US adults: “online da=ng is a good way

to meet people”

  • 2/3 of online daters have gone on a date with

someone they met online

  • 27% between ages 18-24 have used online

da=ng But … 5% of those in commi[ed rela=onships say they met online

slide-5
SLIDE 5

why the gap? Exercise 2: At your table, come up with three reasons why

  • nline matchups may not lead to commi[ed

rela=onships

slide-6
SLIDE 6

Algorithmic matching:

  • OKCupid
  • EHarmony

Basic ideas:

  • Ask lots of ques=ons
  • Perform some kind of scoring and matching

computer science to the rescue

slide-7
SLIDE 7

High dimensional data Large data size Privacy, security (height example) Algorithm design

computer science issues

slide-8
SLIDE 8

Exercise 3: draw the points (1,2), (2,1), (2,2), (7,6), (8,7), (9,10), (1,9) on paper. How many clusters do they fall into? Exercise 4: how many dimensions are present in the survey data you filled?

high dimensional data

slide-9
SLIDE 9

Exercise 3: draw the points (1,2), (2,1), (2,2), (7,6), (8,7), (9,10), (1,9) on paper. How many clusters do they fall into? Exercise 4: how many dimensions are present in the survey data you filled? Algorithmic challenge: effec=ve clustering of high dimensional data E-Harmony: 29 dimensions

high dimensional data

slide-10
SLIDE 10

Distance measure:

  • Given any two users, compute “how

compa=ble they are” Sort:

  • Sort all users by compa=bility

For every user we now have a sorted list of other users, in order of preference

the scoring problem

slide-11
SLIDE 11

Suppose we need to match people, e.g., H1 H2 H3 with R1 R2 R3 Example matching: H1 H2 H3 R1 R2 R3

the matching problem

slide-12
SLIDE 12

Suppose we know “spousal” preferences:

H1 `s preferences: R2 R1 R3 H2 `s preferences: R2 R3 R1 H3 `s preferences: R2 R1 R3 R1 `s preferences: H1 H2 H3 R2 `s preferences: H3 H1 H2 R3 `s preferences: H2 H1 H3

So H1 would prefer R2 as spouse to R1 and R1 over R3

the matching problem

slide-13
SLIDE 13

Consider

H1 `s preferences: R2 R1 R3 H2 `s preferences: R2 R3 R1 H3 `s preferences: R2 R1 R3 R1 `s preferences: H1 H2 H3 R2 `s preferences: H3 H1 H2 R3 `s preferences: H2 H1 H3

the matching problem

Example matching: H1 H2 H3 R1 R2 R3 Exercise 5: what is the problem with this matching?

slide-14
SLIDE 14

Consider

H1 `s preferences: R2 R1 R3 H2 `s preferences: R2 R3 R1 H3 `s preferences: R2 R1 R3 R1 `s preferences: H1 H2 H3 R2 `s preferences: H3 H1 H2 R3 `s preferences: H2 H1 H3

the matching problem

Example matching: H1 H2 H3 R1 R2 R3 H3 and R2 will elope! Algorithmic challenge: devise an algorithm to create a stable matching

slide-15
SLIDE 15
  • 1. Ini=ally place all H’s in unmarried-list
  • 2. while unmarried-list is not empty
  • 3. Hi = lowest numbered from list
  • 4. try R’s in order of Hi’s preference
  • 5. if Rj is not matched, match Hi and Rj
  • 6. else if Rj prefers Hi to current match then
  • 7. match Rj with Hi
  • 8. return current match to unmarried list

Can prove: provides a stable matching Exercise 6: do the H’s or R’s get the best deal?

the proposal algorithm

slide-16
SLIDE 16

Demo Prac-cal applica-ons?

the proposal algorithm

slide-17
SLIDE 17

Demo Prac-cal applica-ons:

  • Med school internships
  • Clerkships with judges

the proposal algorithm

slide-18
SLIDE 18

Exercise 7: Go to h[ps://oracleomacon.org/ and enter actors in two movies YOU have seen. Try to find two actors with a distance of 4.

the (network) structure of rela-onships

slide-19
SLIDE 19

The Milgram experiment Demo

the (network) structure of rela-onships

slide-20
SLIDE 20

The Milgram experiment Demo Facebook: 4.57 (among 1.5b users) Da=ng app based on “who knows who”: Hinge

the (network) structure of rela-onships

slide-21
SLIDE 21

Exercise 8: Choose between

  • 1. Go to h[p://www.masswerk.at/elizabot/ and

converse with Eliza.

  • 2. Talk to Siri and record the exchange on paper.
  • 3. Volunteer as judge.

Compe-tors: your conversa=on must be short (two back-and-forths) Judges: pick the best conversa=on. Remember Robert Epstein?

talk to a bot

slide-22
SLIDE 22

Exercise 9: Choose between

  • 1. Yes, it’s fine for humans to marry robots in

the future.

  • 2. No, that should never be allowed.

Write down your reasons on the worksheet

  • ur future with robots
slide-23
SLIDE 23

summary

Computer science under the hood:

  • Programming of websites (for online da=ng)
  • Servers, networks, clouds, large data:

Ø Example: 25 TB of data at E-Harmony, incl. 200+ million images

  • Algorithms for matching, graph structure
  • Algorithms for clustering, machine learning,

natural languages

  • Robo=cs / AI