A Semi-Supervised Bayesian Network Model for Microblog Topic - PowerPoint PPT Presentation

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification Yan Chen 1 , 2 Zhoujun Li 1 Liqiang Nie 2 Xia Hu 3 Xiangyu Wang 2 Tat-seng Chua 2 Xiaoming Zhang 1 1 State Key Laboratory of Software Development Environment, Beihang University, China 2 School of Computing, National University of Singapore, Singapore 3 Arizona State University, United States 11-12-2012 Yan Chen (Beihang University) COLING 2012 11-12-2012 1 / 32

Outline Background and Motivation 1 Related Work 2 Semi-Supervised Graphical Model 3 The General Framework Probabilistic Graph Model Construction Parameter Inference Experiments 4 Experimental Settings Analysis Parameter Analysis Conclusion and Future Work 5 Yan Chen (Beihang University) COLING 2012 11-12-2012 2 / 32

Background and Motivation Outline Background and Motivation 1 Related Work 2 Semi-Supervised Graphical Model 3 The General Framework Probabilistic Graph Model Construction Parameter Inference Experiments 4 Experimental Settings Analysis Parameter Analysis Conclusion and Future Work 5 Yan Chen (Beihang University) COLING 2012 11-12-2012 3 / 32

Background and Motivation Background Microblogging services are becoming immensely popular in breaking-news disseminating, information sharing, and events participation. Yan Chen (Beihang University) COLING 2012 11-12-2012 4 / 32

Background and Motivation Background Microblogging services are becoming immensely popular in breaking-news disseminating, information sharing, and events participation. The most well known one is Twitter, which has more than 140 million active users with 1 billion Tweets every 3 days as of March 2012. Yan Chen (Beihang University) COLING 2012 11-12-2012 4 / 32

Background and Motivation Background Microblogging services are becoming immensely popular in breaking-news disseminating, information sharing, and events participation. The most well known one is Twitter, which has more than 140 million active users with 1 billion Tweets every 3 days as of March 2012. In China, Weibo (www.weibo.com) has accumulated more than 300 millions users in less than three years. Every second, more than 1000 Chinese tweets are posted in Weibo. Yan Chen (Beihang University) COLING 2012 11-12-2012 4 / 32

Background and Motivation Background Microblogging services are becoming immensely popular in breaking-news disseminating, information sharing, and events participation. The most well known one is Twitter, which has more than 140 million active users with 1 billion Tweets every 3 days as of March 2012. In China, Weibo (www.weibo.com) has accumulated more than 300 millions users in less than three years. Every second, more than 1000 Chinese tweets are posted in Weibo. With the large volume and multi-aspect messages, how do users locate the specific messages that they are interested in? Yan Chen (Beihang University) COLING 2012 11-12-2012 4 / 32

Background and Motivation Motivation Example 1: Query Yan Chen (Beihang University) COLING 2012 11-12-2012 5 / 32

Background and Motivation Motivation Example 1: DBS Bank Yan Chen (Beihang University) COLING 2012 11-12-2012 5 / 32

Background and Motivation Motivation Example 1: DBS Car Yan Chen (Beihang University) COLING 2012 11-12-2012 5 / 32

Background and Motivation Motivation Example 2: Yan Chen (Beihang University) COLING 2012 11-12-2012 6 / 32

Background and Motivation Motivation Example 2: Yan Chen (Beihang University) COLING 2012 11-12-2012 7 / 32

Background and Motivation Motivation Example 2: How do we provide users an overviews of search results based on meaningful and structural categories. Yan Chen (Beihang University) COLING 2012 11-12-2012 7 / 32

Background and Motivation Motivation Example 2: Topic Classification! Yan Chen (Beihang University) COLING 2012 11-12-2012 8 / 32

Related Work Outline Background and Motivation 1 Related Work 2 Semi-Supervised Graphical Model 3 The General Framework Probabilistic Graph Model Construction Parameter Inference Experiments 4 Experimental Settings Analysis Parameter Analysis Conclusion and Future Work 5 Yan Chen (Beihang University) COLING 2012 11-12-2012 9 / 32

Related Work Related Work 1 Topic Model based Methods [Hong and Davison, 2010] employs latent dirichlet allocation (LDA) [Blei et al., 2003] and author-topic model [Rosen-Zvi et al., 2010] to deeply investigate to automatically find hidden topic structures on Twitter. Several variants of LDA to incorporate supervision have been proposed by [Ramage et al., 2009, Ramage et al., 2010], and have been shown to be competitive with strong baselines in the microblogging environment. 2 Traditional Classification Methods [Lee et al., 2011] classified tweets into pre-defined categories such as sports, technology, politics, etc . They constructed word vectors with tf-idf weights and utilized a Naive Bayesian Multinomial classifier to classify tweets. [Sriram et al., 2010] proposed to use a small set of domain-specific features extracted from the author’s profile and text to represent short messages. Their method requires extensive pre-processing to conduct effectively feature analysis. Yan Chen (Beihang University) COLING 2012 11-12-2012 10 / 32

Related Work Challenges and Contribution 1 Challenges Sparseness: lack sufficient word co-occurrence or shared contexts for effective similarity measure-[Hu et al., 2009]. Informal: not well conformed as standard structures of documents. Lack of label information. It is time and labor consuming to label the huge amount of messages. Yan Chen (Beihang University) COLING 2012 11-12-2012 11 / 32

Related Work Challenges and Contribution 1 Challenges Sparseness: lack sufficient word co-occurrence or shared contexts for effective similarity measure-[Hu et al., 2009]. Informal: not well conformed as standard structures of documents. Lack of label information. It is time and labor consuming to label the huge amount of messages. 2 Contribution to handle data sparseness problem, we employ query related external resources from Google Search Engine to enrich the short messages. to alleviate negative effect brought by informal words, we utilize linguistic corpus to detect informal words and correct them. to require less labelled data, we attempt to use a semi-supervised learning approach for microblog categorization task. Yan Chen (Beihang University) COLING 2012 11-12-2012 11 / 32

Semi-Supervised Graphical Model Outline Background and Motivation 1 Related Work 2 Semi-Supervised Graphical Model 3 The General Framework Probabilistic Graph Model Construction Parameter Inference Experiments 4 Experimental Settings Analysis Parameter Analysis Conclusion and Future Work 5 Yan Chen (Beihang University) COLING 2012 11-12-2012 12 / 32

Semi-Supervised Graphical Model The General Framework the General Framework Figure: The General Framework. Yan Chen (Beihang University) COLING 2012 11-12-2012 13 / 32

Semi-Supervised Graphical Model Probabilistic Graph Model Construction Semi-Supervised Bayesian Network Graph Model Figure: Probabilistic graphical representation of semi-supervised Bayesian network model. Yan Chen (Beihang University) COLING 2012 11-12-2012 14 / 32

Semi-Supervised Graphical Model Parameter Inference Parameter Inference The maximum likelihood category label for a given message m i is, φ ′ ) = P ( c j | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( m i | c j , ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) c j P ( c j | m i , ˆ θ , ˆ φ , ˆ θ ′ , ˆ y i = argmax P ( m i | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) Yan Chen (Beihang University) COLING 2012 11-12-2012 15 / 32

Semi-Supervised Graphical Model Parameter Inference Parameter Inference The maximum likelihood category label for a given message m i is, φ ′ ) = P ( c j | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( m i | c j , ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) c j P ( c j | m i , ˆ θ , ˆ φ , ˆ θ ′ , ˆ y i = argmax P ( m i | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( c j | ˆ φ , ˆ θ ′ , ˆ φ ′ ) = P ( c j | ˆ α P ( c j | ˆ θ , ˆ θ , ˆ α ) P ( c j | ˆ φ ) = ˆ θ )+( 1 − ˆ φ ) Yan Chen (Beihang University) COLING 2012 11-12-2012 15 / 32

Semi-Supervised Graphical Model Parameter Inference Parameter Inference The maximum likelihood category label for a given message m i is, φ ′ ) = P ( c j | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( m i | c j , ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) c j P ( c j | m i , ˆ θ , ˆ φ , ˆ θ ′ , ˆ y i = argmax P ( m i | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( c j | ˆ φ , ˆ θ ′ , ˆ φ ′ ) = P ( c j | ˆ α P ( c j | ˆ θ , ˆ θ , ˆ α ) P ( c j | ˆ φ ) = ˆ θ )+( 1 − ˆ φ ) φ ′ ) = ∑ P ( m i | ˆ θ , ˆ φ , ˆ θ ′ , ˆ P ( c j | ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) P ( m i | c j , ˆ θ , ˆ φ , ˆ θ ′ , ˆ φ ′ ) c j Yan Chen (Beihang University) COLING 2012 11-12-2012 15 / 32

A Semi-Supervised Bayesian Network Model for Microblog Topic - PowerPoint PPT Presentation

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification Yan Chen 1 , 2 Zhoujun Li 1 Liqiang Nie 2 Xia Hu 3 Xiangyu Wang 2 Tat-seng Chua 2 Xiaoming Zhang 1 1 State Key Laboratory of Software Development Environment, Beihang

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S

Semi-Supervised Kernel Mean Shift Clustering A Semi-Supervised Clustering Approach Motivation:

Semi-Supervised Local Fisher Semi-Supervised Local Fisher Discriminant Analysis Discriminant

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs.

Semi-Supervised Learning Maria-Florina Balcan 03/30/2015 Readings: Semi-Supervised Learning.

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised

Iterative Hybrid Algorithm for Semi-supervised Classification Martin SAVESKI Supervised by

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

wPod Weibo Public Opinion (Polarity) Detection Haotian He & Sanae Sato Microblog is

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Classification Semi-supervised learning based on network Speakers: Hanwen Wang, Xinxin Huang, and

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Semi-supervised Image Classification in Likelihood Space Rong Duan, Wei Jiang, Hong Man Stevens

Semi-Crystalline Polymer Morphologies and their Hierarchical Morphologies 1 Semi-Crystalline

Designing a scalable twitter Nati Shalom , CTO & Founder GigaSpaces John D. Mitchell Mad

Simple policy negotiation for location disclosure Nick Doty & Erik Wilde UC Berkeley, School

Diffusion of Following Links in Microblogging Networks Jing Zhang Tsinghua University

DETECTING RUMORS FROM MICROBLOGS WITH RECURRENT NEURAL NETWORKS 515030910611 INTRODUCTION

Applications using Erlang Web 2.0 Security & Privacy (W2SP) 2010 May 20, Berkeley,

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

Reciprocal Relationship Prediction* 1 John Hopcroft, 2 Tiancheng Lou, 3 Jie Tang 1 Department of

Why? CASCADE project Digital Literacies in Academia Engaging with research online podcasts

A Semi-Supervised Bayesian Network Model for Microblog Topic - PowerPoint PPT Presentation

A Semi-Supervised Bayesian Network Model for Microblog Topic Classification Yan Chen 1 , 2 Zhoujun Li 1 Liqiang Nie 2 Xia Hu 3 Xiangyu Wang 2 Tat-seng Chua 2 Xiaoming Zhang 1 1 State Key Laboratory of Software Development Environment, Beihang

Margin-based Semi-supervised Learning Using Apollonius circle MONA EMADI AND JAFAR TANHA T TC S

Semi-Supervised Kernel Mean Shift Clustering A Semi-Supervised Clustering Approach Motivation:

Semi-Supervised Local Fisher Semi-Supervised Local Fisher Discriminant Analysis Discriminant

Support Vector Machines (SVMs). Semi-Supervised Learning. Semi-Supervised SVMs.

Semi-Supervised Learning Maria-Florina Balcan 03/30/2015 Readings: Semi-Supervised Learning.

CS330 Paper Presentation: October 16th, 2019 Supervised Classification Semi-Supervised

Iterative Hybrid Algorithm for Semi-supervised Classification Martin SAVESKI Supervised by

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Unsupervised and Semi-supervised Learning of Structure Graham Neubig Site

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

wPod Weibo Public Opinion (Polarity) Detection Haotian He &amp; Sanae Sato Microblog is

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Classification Semi-supervised learning based on network Speakers: Hanwen Wang, Xinxin Huang, and

CS 331: Bayesian Networks 2 1 Bayesian Networks Youve heard about how Bayesian networks

Semi-supervised Image Classification in Likelihood Space Rong Duan, Wei Jiang, Hong Man Stevens

Semi-Crystalline Polymer Morphologies and their Hierarchical Morphologies 1 Semi-Crystalline

Designing a scalable twitter Nati Shalom , CTO &amp; Founder GigaSpaces John D. Mitchell Mad

Simple policy negotiation for location disclosure Nick Doty &amp; Erik Wilde UC Berkeley, School

Diffusion of Following Links in Microblogging Networks Jing Zhang Tsinghua University

DETECTING RUMORS FROM MICROBLOGS WITH RECURRENT NEURAL NETWORKS 515030910611 INTRODUCTION

Applications using Erlang Web 2.0 Security &amp; Privacy (W2SP) 2010 May 20, Berkeley,

Twitter Sentiment Analysis Twitter Sentiment Analysis Presented by: Loitongbam Gyanendro Singh

Reciprocal Relationship Prediction* 1 John Hopcroft, 2 Tiancheng Lou, 3 Jie Tang 1 Department of

Why? CASCADE project Digital Literacies in Academia Engaging with research online podcasts

wPod Weibo Public Opinion (Polarity) Detection Haotian He & Sanae Sato Microblog is

Designing a scalable twitter Nati Shalom , CTO & Founder GigaSpaces John D. Mitchell Mad

Simple policy negotiation for location disclosure Nick Doty & Erik Wilde UC Berkeley, School

Applications using Erlang Web 2.0 Security & Privacy (W2SP) 2010 May 20, Berkeley,