Making patterns show themselves Calin Constantinov Honest Agenda - - PowerPoint PPT Presentation

making patterns show themselves
SMART_READER_LITE
LIVE PREVIEW

Making patterns show themselves Calin Constantinov Honest Agenda - - PowerPoint PPT Presentation

Making patterns show themselves Calin Constantinov Honest Agenda 1. Shameless insufferable bragging 2. Needlessly confusing theoretical stuff 3. Disappointingly simple scenario for Facebook check-ins 4. Disappointingly simple scenario, but


slide-1
SLIDE 1

Making patterns show themselves

Calin Constantinov

slide-2
SLIDE 2

Honest Agenda

  • 1. Shameless insufferable bragging
  • 2. Needlessly confusing theoretical stuff
  • 3. Disappointingly simple scenario for Facebook check-ins
  • 4. Disappointingly simple scenario, but over LinkedIn skills,

endorsements and companies

  • 5. Reputation system with no real-world applicability
  • 6. Marginally interesting SocialRank and RelationshipStrength

algorithm for Facebook interactions

slide-3
SLIDE 3

About me

slide-4
SLIDE 4

MY USE-CASES

slide-5
SLIDE 5

Mathematical background Can you find the the mistake?

slide-6
SLIDE 6

Intermission …remember that this is all purely theoretical…

slide-7
SLIDE 7

PLACE-TO-BE

slide-8
SLIDE 8

Check-ins in Craiova Wanna go out tonight?

Back in 2013, Facebook data from 10 users and their friends was mined. The final dataset consisted of 21981 users, 48051 check-ins, 549 places and 76 categories, all linked by 392607 relationships. (7% of all check-ins ever placed in Craiova were captured!) Yes, this was before Cambridge Analytica.

slide-9
SLIDE 9

Popular places Pub crawl!

Most popular places, by number of visitors.

slide-10
SLIDE 10

Places where people return They keep coming back for more!

Most popular places, by the percentage of visitors that have returned at least once.

slide-11
SLIDE 11

Places visited by friends We're social people (at least on Facebook)

Places a given user hasn’t visited but are most commonly visited by users that are most commonly visiting places with the given user.

slide-12
SLIDE 12

Similar places Bear with me…

Similar places with a given place based on the number of common categories and largest number

  • f users commonly visiting both places.
slide-13
SLIDE 13

Highly-Available Neo4j Heterogeneous Load Balanced Cluster tl;dr

All read times reasonably fall within a “real-time” constraint. …a long time ago in a galaxy far, far away, when Neo4j v2.0.x was the crème de la crème…

slide-14
SLIDE 14

LOCALTALENT

slide-15
SLIDE 15

The graph model

The dataset: 206 complete profiles (2044 total), 275 active jobs (775 total), 361 companies 991 skills, 19421 endorsements, 89 educational institutions.

This is so META!

slide-16
SLIDE 16

Biggest companies

Top 15 companies by number of active jobs.

Size Matters!

slide-17
SLIDE 17

Loyal employees #relationshipgoals

Top 15 companies by average time an employee has a position in the company (in months).

slide-18
SLIDE 18

Employee leaves Time for breakup songs!

Top 10 leaves from one company to another.

slide-19
SLIDE 19

Active jobs Soooo many n00bs!

A view on the distribution of the active jobs.

slide-20
SLIDE 20

Showcased skills

Number of profiles displaying one of the top 20 displayed skills.

Who doesn’t like a show-off?

slide-21
SLIDE 21

Endorsements She didn’t endorse me back :(

Percentage distribution for top 20 endorsed skills.

slide-22
SLIDE 22

Wide-range and niche companies Finding the perfect job for your hipster-esque coding needs

Percentage distribution for top 3 endorsed skills for selected companies.

slide-23
SLIDE 23

MOVIERATE

slide-24
SLIDE 24

A Custom Social Reputation Engine for Troll Detection …again with the boring math stuff…

slide-25
SLIDE 25

Behavior Modelling Who's feeling naughty?

slide-26
SLIDE 26

Penalizing Malicious Behavior Imagine “Never Gonna Give You Up” playing in the background

9% of all users were trolls.

slide-27
SLIDE 27

System Scalability //TODO: find super awesome pun

This increase is rather steep. However:

  • Execution times were still very low.
  • A single commodity machine was used.
  • Computations were always done in full.
slide-28
SLIDE 28

INFLUENCERS

slide-29
SLIDE 29

When to post #WokeUpLikeThis

slide-30
SLIDE 30

Identifying holiday seasons Social-validation seeking, attention-wh**ing

slide-31
SLIDE 31

My community Who’s got swag?

slide-32
SLIDE 32

My closest friends We go way back, like spinal cords and car seats!

slide-33
SLIDE 33

(calin:IncredibleGraphExp xpert)-[: [:AN ANSWERS] S]->( >(an anyQu Ques estio ion)

slide-34
SLIDE 34

Hunger Games Questions for “Making Patterns Show Themselves”

  • 1. Easy: Which day is Graph Celebration Day?
  • a. Oct 31
  • b. Sep 13
  • c. Apr 15
  • 2. Medium: What is the name of the format used to style the

Neo4j Browser Visualization?

  • 3. Hard: When was load-balancing included in Bolt?
  • a. Neo4j v3.1
  • b. Neo4j v3.0
  • c. Neo4j v3.3

Answer here: r.neo4j.com/hunger-games