Webs of Trust in Distributed Environments Bringing Trust to Email - - PowerPoint PPT Presentation

webs of trust in distributed environments
SMART_READER_LITE
LIVE PREVIEW

Webs of Trust in Distributed Environments Bringing Trust to Email - - PowerPoint PPT Presentation

Webs of Trust in Distributed Environments Bringing Trust to Email Communication BSc. Presentation - Info-Lunch, 03.11.2004 Fighting Spam Hmmm, tasty!! Spamassassin Program for filtering unwanted Email messages Classifies Emails with


slide-1
SLIDE 1

Webs of Trust in Distributed Environments

Bringing Trust to Email Communication

  • BSc. Presentation - Info-Lunch, 03.11.2004
slide-2
SLIDE 2

Fighting Spam

slide-3
SLIDE 3

Hmmm, tasty!!

slide-4
SLIDE 4

Spamassassin

  • Program for filtering unwanted Email

messages

  • Classifies Emails with scores as Spam or

non-Spam

  • Written in Perl and extensible
slide-5
SLIDE 5

Email

Content Tests Online Tests AutoWhitelist

Score

slide-6
SLIDE 6

Tests

  • Header and text analysis: scanning for invalid

headers, bad words (”Porn”) etc.

  • Bayesian filtering: words or short sentences that
  • ften appear - filter “learns”
  • DNS Blocklists: connections from a listed server

are rejected

  • Collaborative filtering databases: DCC, Razor
slide-7
SLIDE 7

AutoWhitelist (AWL)

  • Computes a score based on the history of

a sender

  • Consists of:
  • 1. The sender of an Email
  • 2. The IP of the Email server
  • 3. Number of Emails received from sender
  • 4. Total score for that sender
slide-8
SLIDE 8

Scores in the AWL

MEAN = TOTAL COUNT FINALSCORE = SCORE + (MEAN −SCORE) ∗FACTOR

Example:

controller@club4x4.net|ip=82.49 2 37.628

New Email scores 20 Mean=18.814

Finalscore = 20 + (18.814-20) * 0.5 = 19.407

Factor=0.5 (default)

slide-9
SLIDE 9

Email

Content Tests Online Tests AutoWhitelist

FinalScore Score

slide-10
SLIDE 10

That’s it?

slide-11
SLIDE 11

Need for Mailrank

False positives in SpamAssassin: an Email is tagged as spam, but it’s actually not Example: Emails from friend’s friends

slide-12
SLIDE 12

Emails from friend’s friends

Albert Berta Charlotte

slide-13
SLIDE 13

The Idea of Mailrank

slide-14
SLIDE 14

Information from AWL

controller@club4x4.net|ip=82.49 2 37.628

Send Email address, IP , Count, Score to a central server

slide-15
SLIDE 15

From PageRank...

  • informal: “a page has a high rank if the sum of

the ranks of it’s backlinks is high”

  • exact:

R′(u) = c !

v∈Bu

R′(v) Nv + cE(u)

slide-16
SLIDE 16

... to Mailrank

  • Given a set of users , that “points” to a

spam address Spam

  • The Mailrank is given as:

MR(Spam) = c!

U

MR(U) NU

NU

Preliminary Version

slide-17
SLIDE 17

Using Mailrank

  • If Mailrank is in the top 20% of all non-

Spam Email addresses, add -5 to the Spam score

  • If Mailrank is in the last 20% of all non-

Spam Email addresses, add +10 to the Spam score

Examples

slide-18
SLIDE 18

Ziegler/Lausen AppleSeed: Spreading Activation

  • Propagation of energy in a network
  • Nodes are connected by edges
  • Directed graph
  • “Trust Decay”: keep some trust in nodes
  • Trust sinks: Backward propagation
  • This is PageRank? No, Edges are weighted
slide-19
SLIDE 19

A B C

10 0.25 0.75 7.5 2.5

D E F G

0.75 0.75 0.25 0.25 1.875 5.625 1.875 0.625

Weights Trustvalues

slide-20
SLIDE 20

Guha: Trust/Distrust

A B C

Direct Propagation

A B C

Co-Citation

D A B C

Transpose Trust

A B D

Trust Coupling

C

slide-21
SLIDE 21

The Implementation

slide-22
SLIDE 22

Design Goals

  • Flexibility
  • Abstraction
  • Simplicity
slide-23
SLIDE 23

Overview

MRServerChannel MRMail MRSocketthread MRSocket MRDataParser MRMySQLDatabase

MRConnectionHandler MRDatabaseHandler

MRData

slide-24
SLIDE 24

Abstraction: MRData

Fields in MRData

Command Email address of user Email address of AWL Entry Score of AWL Entry Count of AWL Entry

slide-25
SLIDE 25

Mail

MRMail MRServerChannel

MRConnectionHandler MRDatabaseHandler

MRMySQLDatabase MRDataParser MRData

slide-26
SLIDE 26

Socket

MRSocket MRSocketthread

MRConnectionHandler MRDatabaseHandler

MRServerChannel MRMySQLDatabase MRDataParser MRData

slide-27
SLIDE 27

Demo

slide-28
SLIDE 28

What’s next?

slide-29
SLIDE 29

Further Work

  • Develop the algorithm in detail
  • Get the implementation done
  • Provide a plug-in for SpamAssassin
  • Paper (?)
slide-30
SLIDE 30

Thanks! Questions?