SLIDE 1
Identifying Intention Posts in Discussion Forums Meichun Hsu - - PowerPoint PPT Presentation
Identifying Intention Posts in Discussion Forums Meichun Hsu - - PowerPoint PPT Presentation
Identifying Intention Posts in Discussion Forums Meichun Hsu Zhiyuan (Brett) Chen Malu Castellanos Bing Liu Riddhiman Ghosh What is Intention? Example Hello, I am going to buy a new high-end gaming laptop my budget is below than 1500$ ram
SLIDE 2
SLIDE 3
Example
Hello, I am going to buy a new high-end gaming laptop my budget is below than 1500$ ram should be more than 6gb,graphics card must be more than 2gb and the processor should be intel core i7 3rd generations or better.
SLIDE 4
Explicit Intention
I plan to buy this book. I am looking for a new car. I am going to travel to Atlanta.
SLIDE 5
Implicit Intention
Anyone knows the battery life of iPhone? What are the components in this laptop?
SLIDE 6
Identifying Intention
A totally NEW problem Explicit intention Applications like advertisement
SLIDE 7
Problem Definition
SLIDE 8
Two-Class Post Classification Transfer Learning: Use labeled
data from other domains (source domains) to classify target domain
SLIDE 9
SLIDE 10
The ways to express an intention are similar in different domains.
SLIDE 11
Motivation Examples
I want to buy a car. I want to buy a camera. I want to buy the tickets.
SLIDE 12
Special Difficulties
SLIDE 13
Noise
I read many reviews of two Canon models which I'm considering for purchase, the Canon PowerShot S2 IS and the Canon PowerShot S3 IS. This is my second digital camera, my first being a Kodak EasyShare from about 4 years ago. …
SLIDE 14
Imbalanced Shared Features
SLIDE 15
EM Algorithm with NB
(Nigam et al., 2000)
E-step M-step Naïve Bayes
SLIDE 16
Proposed Models
FS-EM Co-Class
SLIDE 17
FS-EM
Incorporates feature selection into EM iterations Selects features from both labeled source (domain) data and unlabeled target (domain) data
SLIDE 18
SLIDE 19
Co-Class
Builds two classifiers based on FS-EM Solves the imbalanced shared feature problem
SLIDE 20
SLIDE 21
Features & Feature Selection
N-grams
SLIDE 22
Evaluation
SLIDE 23
Datasets
4 Domains from different forums (http://www.cs.uic.edu/~zchen/) Human annotation Cross-validation
SLIDE 24
Evaluation Measures
SLIDE 25
Supervised Learning (One Domain)
SLIDE 26
Model Comparisons
3TR-1TE (Aue & Gamon, 2005) EM (Nigam, et al., 2000) ANB (Tan et al., 2009) FS-EM1 FS-EM2 Co-Class
SLIDE 27
Model Comparisons
SLIDE 28
Effects of Source Domains
SLIDE 29
Conclusions
Novel problem of identifying intention Suitable for transfer learning Two special difficulties Effectiveness of Co-Class
SLIDE 30
Future Directions
Sentence-level classification Extract intention components
SLIDE 31
Q & A
SLIDE 32