Phishing Detection Using Semi-Supervised Methods with New Features - - PowerPoint PPT Presentation

phishing detection using semi supervised methods with new
SMART_READER_LITE
LIVE PREVIEW

Phishing Detection Using Semi-Supervised Methods with New Features - - PowerPoint PPT Presentation

ReDAS Lab Phishing Detection Using Semi-Supervised Methods with New Features Victor Zeng Advisor: Rakesh M. Verma COMPUTER SCIENCE Motivation Phishing is the act of sending fake emails to trick a user into doing something.


slide-1
SLIDE 1

Phishing Detection Using Semi-Supervised Methods with New Features

Victor Zeng Advisor: Rakesh M. Verma

COMPUTER SCIENCE ReDAS Lab

slide-2
SLIDE 2

Motivation

  • Phishing is the act of sending fake emails

to trick a user into doing something.

  • Beachhead for 95% of attacks on

enterprise networks

  • Average cost: $1.6 Million
  • Cannot depend on user to identify

phishing emails

  • Creating labeled training data is expensive

Source: Eitan Katz. Phishing statistics: What every business needs to know, May 2019

slide-3
SLIDE 3

Goal

  • Improve upon the current state-of-the-art

THEMIS model

  • Publish a paper based on my results
slide-4
SLIDE 4

Objectives

  • Identify new features which can be used

for phishing detection

  • Use semi-supervised methods to detect

phishing emails

slide-5
SLIDE 5

Expected Impact

  • Improve performance of phishing

detection methods

  • Decrease the amount of labeled data

required to train phishing detection models

slide-6
SLIDE 6

Deliverables

  • Code + Documentation
  • Poster
  • Report
  • Paper
  • Final Presentation
slide-7
SLIDE 7

Methods: Objective 1

Implement feature in PhishBench 2.0 Perform exploratory analysis on proposed feature Evaluate multi-feature performance with PhishBench 2.0 Evaluate single-feature performance with PhishBench 2.0

slide-8
SLIDE 8

Results: Objective 1

  • Spellcheck ratio feature
  • Statistically different between phish and

legit emails (p-value: 1.512e-22)

  • Random Forest identifies 54% of phish

emails in single feature test

slide-9
SLIDE 9

Conclusions

  • Spellcheck ratio is a promising feature for

phishing detection

slide-10
SLIDE 10

Methods: Objective 2

Extend PhishBench 2.0 to support semi-supervised methods Implement semi-supervised methods in PhishBench 2.0 Evaluate performance of semi-supervised methods against pre-existing supervised methods

slide-11
SLIDE 11

Remaining Work

  • Evaluate features from Statement Analysis
  • Acquire additional datasets
  • Work for Objective 2
slide-12
SLIDE 12

Acknowledgements

The REU project is sponsored by NSF under award NSF-1659755. Special thanks to the following UH

  • ffices for providing financial support to the

project: Department of Computer Science; College

  • f Natural Sciences and Mathematics; Dean of

Graduate and Professional Studies; VP for Research; and the Provost's Office. The views and conclusions contained in this presentation are those of the author and should not be interpreted as necessarily representing the official policies, either expressed or implied, of the sponsors.