CERI 2016, Granada, Spain Injecting Multiple Psychological Features - PowerPoint PPT Presentation

CERI 2016, Granada, Spain Injecting Multiple Psychological Features into Standard Text Summarisers David Losada and Javier Parapar @davidelosada @jparapar IRLab and CITIUS @IRLab_UDC @citiususc Univ. Coruña, Univ. Santiago Spain

Outline 1. Automatic Text Summarisation 2. Psycholinguistics 3. PsySum 4. Experiments 5. Conclusions and Future Work 1/24

Automatic Text Summarisation

A utomatic Text Summarisation Automatic Text Summarisation ( ATS ) is indispensable for dealing with the rapid growth of online content: � Quickly digest and skim large quantities of textual documents. � Numerous application domains : news media, scientific literature, intelligence gathering, web snippets, etc. � Extractive vs Generative summaries. 3/24

Extractive Summarisation Methods that extract salient parts of the source text and arrange them in some effective manner. � Different features have been exploited: cue words, position within the text, or centrality for locating those parts. � We will be centred on the most popular extractive summaries ( sentence-based ). Three steps: 1. feature-based representation of every sentence, 2. sentence scoring, 3. summary creation by sentence selection. 4/24

Psycholinguistics

Psychology of the Language Language provides a full range of powerful indicators about emotions , cognition , social context , personality , and other psychological states. � In the Social Sciences, the relationship between word use and many social and psychological processes has been actively studied. � Psychometric properties of word use are informative about differences among individuals, about mental and physical health, and even about deception and honesty. � Quantitative analysis of text supplies a great deal of information about situational and social fluctuations. 6/24

Psychological Word Count In human writing the occurrence of certain psychological dimensions might be noteworthy. Content words that relate to psychological processes, linguistic style markers are also known to yield unexpected insights. “ Pronouns, prepositions and other common words are as distinctive as fingerprints; and analysing them is fruitful for James W. Pennebaker – The Secret Life of Pronouns ” a wide variety of applications Linguistic Inquiry and Word Count (LIWC) computes the degree to which people use different categories of words. 7/24

PsySum

Our Proposal Research Hypothesis The most salient or informative sentences in a document may exhibit singular patterns of usage of psychological, social or linguistic elements � Communication is not only about content. It is also about style and feelings. � We employ LIWC for computing sentence features that reflect such axes to be taken into account for summarisation. 9/24

Pyscological Features Based Summaries 1. We use 70 categories from LIWC and define 70 new features . 2. We also consider standard signals (e.g. the position of a sentence in a text, or the similarity between the sentence and the document’s centroid). 3. We linearly combine all feature weights for each sentence. 4. The combined score is employed for ranking sentences . 5. We incorporate this new sentence weighting method into the MEAD summarisation system . 10/24

Particle Swarm Optimisation A full exploration of the parameter space is not feasible (up to 80 feature weights) Particle Swarm Optimisation is a class of swarm intelligence techniques inspired by the social behaviour of bird flocking that runs a restricted search within the parameter space PSO has been previously used on other IR problems, in this work we optimised with the standard PSO alg. the ROUGE-2 metric with a population of 100 particles. 11/24

Experiments

T ask and Metrics Tasks from the Document Understanding Conferences (DUC) � single-document summarisation (fully automatic summarisation of a single news article) (Training 2001T Test: 2001, 2002) � multi-document summarisation (fully automatic summarisation of multiple news articles on a single subject) (Training 2001MT Test: 2001M, 2002M, 2003M, 2004M) ROUGE-2 and ROUGE-SU4 have shown to be correlated with human’s judgements, they count the number of overlapping units between the automatic summary and the manual summary 13/24

Experiments We compared the following summarisation algorithms: � Baselines ◦ Default MEAD ◦ Lead-Based ◦ Random � MEAD optimised (MEAD c+p tuned) � MEAD c+p+liwc ◦ All LIWC features (all) ◦ Linguistic ProcessesLIWC Features (ling) ◦ Psychological Processes LIWC Features (psyc.) ◦ Personal Concerns LIWC Features (pers.) 14/24

Results: Single Document Results in DUC2001 ROUGE-2 ROUGE-SU4 default MEAD .1793 (.1660,.1941) .1813 (.1698,.1926) random .1277 (.1167,.1401) .1420 (.1336,.1517) lead-based .1931 (.1796,.2071) .1825 (.1726,.1934) MEAD c+p tuned .1928 (.1792,.2067) .1820 (.1721,.1927) MEAD c+p+liwc(all) .1918 (.1787,.2055) .1848 (.1741,.1954) MEAD c+p+liwc(ling.) .1953 (.1820,.2091) .1882 (.1777,.1992) MEAD c+p+liwc(psyc.) .1913 (.1775,.2054) .18550 (.1744,.1969) MEAD c+p+liwc(pers.) .1919 (.1783,.2051) .1865 (.1756,.1972) 15/24

Results: Multi-Document Results in DUC2002M ROUGE-2 ROUGE-SU4 default MEAD .0684 (.0610,.0769) .0950 (.0870,.1032) random .0355 (.0301,.0413) .0710 (.0659,.0764) lead-based .0433 (.0369,.0504) .0659 (.0601,.0716) MEAD c+p tuned .0610 (.0550,.0678) .0963 (.0898,.1030) MEAD c+p+liwc(all) .0720 (.0643,.0810) .1006 (.09371,.1083) MEAD c+p+liwc(ling.) .0711 (.0637,.0789) .1047 (.0974,.1124) MEAD c+p+liwc(psyc.) .0626 (.0568,.0686) .0931 (.0866,.0996) MEAD c+p+liwc(pers.) .0665 (.0594,.0736) .0991 (.0911,.1069) 16/24

Analysis The MEAD c+p+liwc(ling.) summariser is consistently better than the baseline summarisers, however it does not achieve stat. sig. improvements, i.e., we think that it is working well for some cases but is degrading the performance for certain individual summarisation cases For each summary we took ROUGE-2 score of a baseline summariser (MEAD c+p tuned) as an estimator of the difficulty to summarise the document or cluster We computed the difference between the ROUGE-2 score of the MEAD c+p+liwc(ling.) summariser and the ROUGE-2 score of the baseline summariser. 17/24

Analysis: Results (i) DUC2001 DUC2002 DUC2001M 0.3 Regr. line: 0.03 -0.14 x Regr. line: 0.037 -0.165 x 0.15 Regr. line: 0.027 -0.317 x p-value (slope not 0): 2.5e-06 p-value (slope not 0): 3e-11 p-value (slope not 0): 0.037 0.2 0.2 diff ROUGE-2 diff ROUGE-2 diff ROUGE-2 0.10 0.1 0.1 0.05 0.0 0.0 -0.1 0.00 -0.1 -0.2 -0.05 -0.2 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.0 0.1 0.2 0.3 0.4 0.5 0.00 0.05 0.10 0.15 ROUGE-2 (baseline) ROUGE-2 (baseline) ROUGE-2 (baseline) DUC2002M DUC2003M DUC2004M 0.15 0.15 Regr. line: 0.028 -0.286 x Regr. line: 0.036 -0.329 x Regr. line: 0.03 -0.316 x 0.04 p-value (slope not 0): 0.00066 p-value (slope not 0): 0.024 p-value (slope not 0): 0.0014 0.10 diff ROUGE-2 diff ROUGE-2 0.10 diff ROUGE-2 0.05 0.00 0.05 0.00 -0.04 0.00 -0.05 -0.08 0.00 0.05 0.10 0.15 0.20 0.05 0.10 0.15 0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14 ROUGE-2 (baseline) ROUGE-2 (baseline) ROUGE-2 (baseline) Our method has a tendency to work well for difficult summarisation cases (low ROUGE-2) and to be harming for easier summarisation cases. 18/24

Analysis: Results (ii) Percentage of improvement of the MEAD c+p+liwc(ling.) with summaries binned on the baseline performance. performance 19/24

Analysis: Results (iii) Having the weight of the different features we can conclude on the importance of each one in the summarisation process. In general, the summariser gives preferences to sentences that � have quantifiers, prepositions, conjunctions, impersonal pronouns, � lack personal pronouns, 1st person plural, and adverbs. This fits well with some findings in the area of Psychology, related with the use of the language by people writing about real experiences Our analysis suggests that driving the summarisers with LIWC features has implicitly fomented analytical extracts and extracts about real experiences. 20/24

Conclusions and Future Work

Conclusions � We have provided preliminary empirical evidence on the effect of psycholinguistic features in Automatic Text Summarisation. � We defined a novel set of features –related to psychological dimensions– and injected them into a state-of-the-art summarisation system. � We found that the summariser that includes linguistic LIWC dimensions is the best performing summariser. There are interesting connections between the occurrence of certain linguistic dimensions and types of writing and thinking. � Our novel summarisation approaches are better suited for hard summarisation cases. 22/24

Future Work � We believe that there is room for further enhancement. For example, by applying feature selection to individually extract LIWC features from every subset of LIWC dimensions. � We hope that our results serve as a basis to foster the discussion on how linguistic and psychological dimensions relate to sentence salience. � Selective feature injection for summarisation: estimate the difficulty of summarising a given document or cluster and then decide whether or not to add the advanced features. 23/24

Thank you! @jparapar http://www.dc.fi.udc.es/ ~ parapar

CERI 2016, Granada, Spain Injecting Multiple Psychological Features - PowerPoint PPT Presentation

CERI 2016, Granada, Spain Injecting Multiple Psychological Features into Standard Text Summarisers David Losada and Javier Parapar @davidelosada @jparapar IRLab and CITIUS @IRLab_UDC @citiususc Univ. Corua, Univ. Santiago Spain Outline

Granada University Hospitals: Our experience Dr Juan Antonio Retamero Pathologist Granada

GRANADA (SPAIN) 12-15 September 2012 30th European Conference on Acoustic Emission Testing &

Chapter 4 SQL McGraw-Hill and Atzeni, Ceri, Paraboschi, Torlone 1999 1 Database Systems

PRESENTATION OF THE COUNTY COUNCIL OF GRANADA AND ITS PROVINCE Project CHRISTA - INTERREG

Presentation on the Experience of Staff Mobility in University of Granada, Spain (CHINLONE WP3

ELECTRICITY STORAGE: APPLICATIONS AND BUSINESS CASES CERI Breakfast Overview Allan Fogwill,

in ARIADNEplus Ceri Binding and Douglas Tudhope Hypermedia Research Group University of South

Social Media in Rulemaking an Example: The RegulationRoom partnership between CFPB and CeRI

APPLICATION TO JOIN DUNE Bruno Zamorano on behalf of the Granada group 29 th January 2019 The

ON THE WEIGHT OF NUMERICAL SEMIGROUPS IBERIAN MEETING ON NUMERICAL SEMIGROUPS GRANADA 2010

Serve-Learn-Sustain in Spain SPRING SEMESTER 2022, JANUARY 10 MAY 3 A FACULTY-LED GEORGIA TECH

The Bayesian Chow-Liu Algorithm Joe Suzuki Osaka University September 19, 2012 Granada, Spain

Hybrid Activity and Plan Recognition for Video Streams Roger Granada, Ramon Fraga Pereira, Juarez

Radio continuum spectra of galaxies Uli Klein (Univ. Bonn) Ute Lisenfeld (Univ. de Granada)

19 th ESCAMPIG, Granada, Spain 15-19 July 2008 Aim of this work To study the energy

Landscape Ecology Landscape ecology in Spain Alberto Gmez Rodrguez Spain Spain is

Updating & Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear

State Engineer John DAntonio Jr., P.E. Two Nations One Water April 25, 2019 The agencys

Anglo-Saxon Literature 05.23.13 || English 2322: British Literature: Anglo-Saxon Mid 18th

Human-Robot Interaction Elective in Artificial Intelligence Lecture 9 Motion control for HRI

Electronic Trial Master File Standard Technical Committee Meeting Agenda Comment Review Period

Olympic Park site Chinese media delegation briefing James Apted, Ian Morrissey, Ian Mead 11

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime

13 th February 2020 Bristol Temple Quarter AGENDA Purpose To provide an update to the

CERI 2016, Granada, Spain Injecting Multiple Psychological Features - PowerPoint PPT Presentation

CERI 2016, Granada, Spain Injecting Multiple Psychological Features into Standard Text Summarisers David Losada and Javier Parapar @davidelosada @jparapar IRLab and CITIUS @IRLab_UDC @citiususc Univ. Corua, Univ. Santiago Spain Outline

Granada University Hospitals: Our experience Dr Juan Antonio Retamero Pathologist Granada

GRANADA (SPAIN) 12-15 September 2012 30th European Conference on Acoustic Emission Testing &amp;

Chapter 4 SQL McGraw-Hill and Atzeni, Ceri, Paraboschi, Torlone 1999 1 Database Systems

PRESENTATION OF THE COUNTY COUNCIL OF GRANADA AND ITS PROVINCE Project CHRISTA - INTERREG

Presentation on the Experience of Staff Mobility in University of Granada, Spain (CHINLONE WP3

ELECTRICITY STORAGE: APPLICATIONS AND BUSINESS CASES CERI Breakfast Overview Allan Fogwill,

in ARIADNEplus Ceri Binding and Douglas Tudhope Hypermedia Research Group University of South

Social Media in Rulemaking an Example: The RegulationRoom partnership between CFPB and CeRI

APPLICATION TO JOIN DUNE Bruno Zamorano on behalf of the Granada group 29 th January 2019 The

ON THE WEIGHT OF NUMERICAL SEMIGROUPS IBERIAN MEETING ON NUMERICAL SEMIGROUPS GRANADA 2010

Serve-Learn-Sustain in Spain SPRING SEMESTER 2022, JANUARY 10 MAY 3 A FACULTY-LED GEORGIA TECH

The Bayesian Chow-Liu Algorithm Joe Suzuki Osaka University September 19, 2012 Granada, Spain

Hybrid Activity and Plan Recognition for Video Streams Roger Granada, Ramon Fraga Pereira, Juarez

Radio continuum spectra of galaxies Uli Klein (Univ. Bonn) Ute Lisenfeld (Univ. de Granada)

19 th ESCAMPIG, Granada, Spain 15-19 July 2008 Aim of this work To study the energy

Landscape Ecology Landscape ecology in Spain Alberto Gmez Rodrguez Spain Spain is

Updating &amp; Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear

State Engineer John DAntonio Jr., P.E. Two Nations One Water April 25, 2019 The agencys

Anglo-Saxon Literature 05.23.13 || English 2322: British Literature: Anglo-Saxon Mid 18th

Human-Robot Interaction Elective in Artificial Intelligence Lecture 9 Motion control for HRI

Electronic Trial Master File Standard Technical Committee Meeting Agenda Comment Review Period

Olympic Park site Chinese media delegation briefing James Apted, Ian Morrissey, Ian Mead 11

Metric Distances 28 Great Circle Distances North Pole (90N lat) North Pole C Prime

13 th February 2020 Bristol Temple Quarter AGENDA Purpose To provide an update to the

GRANADA (SPAIN) 12-15 September 2012 30th European Conference on Acoustic Emission Testing &

Updating & Improving optim(): Unifying optimization algorithms in R for smooth, nonlinear