webs of trust in distributed environments
play

Webs of Trust in Distributed Environments Bringing Trust to Email - PowerPoint PPT Presentation

Webs of Trust in Distributed Environments Bringing Trust to Email Communication BSc. Presentation - Info-Lunch, 03.11.2004 Fighting Spam Hmmm, tasty!! Spamassassin Program for filtering unwanted Email messages Classifies Emails with


  1. Webs of Trust in Distributed Environments Bringing Trust to Email Communication BSc. Presentation - Info-Lunch, 03.11.2004

  2. Fighting Spam

  3. Hmmm, tasty!!

  4. Spamassassin • Program for filtering unwanted Email messages • Classifies Emails with scores as Spam or non-Spam • Written in Perl and extensible

  5. Email Online Tests AutoWhitelist Content Tests Score

  6. Tests • Header and text analysis: scanning for invalid headers, bad words (”Porn”) etc. • Bayesian filtering: words or short sentences that often appear - filter “learns” • DNS Blocklists: connections from a listed server are rejected • Collaborative filtering databases: DCC, Razor

  7. AutoWhitelist (AWL) • Computes a score based on the history of a sender • Consists of: 1. The sender of an Email 2. The IP of the Email server 3. Number of Emails received from sender 4. Total score for that sender

  8. Scores in the AWL MEAN = TOTAL COUNT FINALSCORE = SCORE + ( MEAN − SCORE ) ∗ FACTOR Example: controller@club4x4.net|ip=82.49 2 37.628 Mean=18.814 Factor=0.5 (default) New Email scores 20 Finalscore = 20 + (18.814-20) * 0.5 = 19.407

  9. Email Score Online Tests AutoWhitelist Content Tests FinalScore

  10. That’s it?

  11. Need for Mailrank False positives in SpamAssassin: an Email is tagged as spam, but it’s actually not Example: Emails from friend’s friends

  12. Emails from friend’s friends Berta Charlotte Albert

  13. The Idea of Mailrank

  14. Information from AWL controller@club4x4.net|ip=82.49 2 37.628 Send Email address, IP , Count, Score to a central server

  15. From PageRank... • informal: “ a page has a high rank if the sum of the ranks of it’s backlinks is high ” • exact: R ′ ( v ) R ′ ( u ) = c ! + cE ( u ) N v v ∈ B u

  16. ... to Mailrank • Given a set of users , that “points” to a N U spam address Spam • The Mailrank is given as: MR ( U ) MR ( Spam ) = c ! N U U Preliminary Version

  17. Using Mailrank Examples • If Mailrank is in the top 20% of all non- Spam Email addresses, add -5 to the Spam score • If Mailrank is in the last 20% of all non- Spam Email addresses, add +10 to the Spam score

  18. Ziegler/Lausen AppleSeed: Spreading Activation • Propagation of energy in a network • Nodes are connected by edges • Directed graph • “Trust Decay”: keep some trust in nodes • Trust sinks: Backward propagation • This is PageRank? No, Edges are weighted

  19. 10 A 0.75 0.25 7.5 2.5 B C 0.75 0.25 0.75 0.25 1.875 0.625 5.625 1.875 D E F G Weights Trustvalues

  20. Guha: Trust/Distrust C B B C B B D C A C A A D A Direct Propagation Co-Citation Transpose Trust Trust Coupling

  21. The Implementation

  22. Design Goals • Flexibility • Abstraction • Simplicity

  23. Overview MRMail MRDataParser MRServerChannel MRMySQLDatabase MRSocketthread MRData MRSocket MRConnectionHandler MRDatabaseHandler

  24. Abstraction: MRData Fields in MRData Command Email address of user Email address of AWL Entry Score of AWL Entry Count of AWL Entry

  25. Mail MRMail MRDataParser MRServerChannel MRMySQLDatabase MRData MRConnectionHandler MRDatabaseHandler

  26. Socket MRDataParser MRServerChannel MRMySQLDatabase MRSocketthread MRData MRSocket MRConnectionHandler MRDatabaseHandler

  27. Demo

  28. What’s next?

  29. Further Work • Develop the algorithm in detail • Get the implementation done • Provide a plug-in for SpamAssassin • Paper (?)

  30. Thanks! Questions?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend