reading the news through its structure new hybrid
play

READING THE NEWS THROUGH ITS STRUCTURE: NEW HYBRID CONNECTIVITY - PowerPoint PPT Presentation

READING THE NEWS THROUGH ITS STRUCTURE: NEW HYBRID CONNECTIVITY BASED APPROACHES Programa de Doutoramento em Cincias da Complexidade Doctoral Programme in Complexity Sciences Orientador / Advisor: Professor Jorge Manuel Anacleto Lou


  1. READING THE NEWS THROUGH ITS STRUCTURE: NEW HYBRID CONNECTIVITY BASED APPROACHES Programa de Doutoramento em Ciências da Complexidade Doctoral Programme in Complexity Sciences Orientador / Advisor: Professor Jorge Manuel Anacleto Louçã David Manuel de Sousa Rodrigues March 17, 2014

  2. Reading the News Through its Structure: New Hybrid 2 / 40 17 March 2014 Connectivity Based Approaches Outline of presentation • Context of this work and Related Work • Newspapers • Adaptive Networks • Q-analysis • Community detection • Ant Colony Optimisation • Hybrid Connectivity Based Approaches • Variation of Information and Dynamic Networks • Clustering News: Timelines with k-means • Clustering News: Community finding with Q-analysis filtering • Hamiltonian Paths in Q-analysis eccentricity matrices • Conclusions

  3. Reading the News Through its Structure: New Hybrid 3 / 40 17 March 2014 Connectivity Based Approaches Objectives • The thesis presents four approaches to the problem of identifying meaningful structure in the news published online. • This is a hard problem due to the high volume of produced data and to the possible high dimensionality of the data collected.

  4. Reading the News Through its Structure: New Hybrid 4 / 40 17 March 2014 Connectivity Based Approaches Contributions • The thesis shows how Hybrid Connectivity Based Approaches give insights to news structure. • Adaptive Networks and Mutual Information • Clustering with k-means and feature vectors • Clustering news with pre-filtering with Q-analysis • Creating Hamiltonian paths of news using Q-analysis eccentricity as distances. • New Ant Colony Optimisation Algorithm

  5. Reading the News Through its Structure: New Hybrid 5 / 40 17 March 2014 Connectivity Based Approaches CONTEXT AND RELATED Part I

  6. Reading the News Through its Structure: New Hybrid 6 / 40 17 March 2014 Connectivity Based Approaches Context: newspapers (print) Portuguese circulation UK Circulation

  7. Reading the News Through its Structure: New Hybrid 7 / 40 17 March 2014 Connectivity Based Approaches Context: newspapers (electronic) Internet overtakes print as news Internet traffic outlet

  8. Reading the News Through its Structure: New Hybrid 8 / 40 17 March 2014 Connectivity Based Approaches Related work: Document analysis • Categorisation of documents (Supervised) • Machine learning • K-neighbours, SVM, NN, etc … • Clustering (unsupervised) • Document navigation • Sometimes associated with clustering • Information retrieval

  9. Reading the News Through its Structure: New Hybrid 9 / 40 17 March 2014 Connectivity Based Approaches Related work: Networks • Network Science • Adaptive Networks • (interplay of topology dynamics and local dynamics of networks) • Community Detection in Graphs • Clustering nodes of graphs • Q -analysis • Topological description of the high dimensionality of structures.

  10. Reading the News Through its Structure: New Hybrid 10 / 40 17 March 2014 Connectivity Based Approaches Related: Bio-inspired • Swarm Intelligence algorithms • Ant Systems • Ant Colony Optimisation • Travelling Salesman Problem • Anti-pheromone ideas • subtractive anti-pheromone (SAP) • 1 pheromone – subtracted from poor solutions • preferential anti-pheromone (PAP) • 2 pheromones but to solve bi-criterion optimisation problems

  11. Reading the News Through its Structure: New Hybrid 11 / 40 17 March 2014 Connectivity Based Approaches HYBRID CONNECTIVITY BASED APPROACHES Part II

  12. Reading the News Through its Structure: New Hybrid 12 / 40 17 March 2014 Connectivity Based Approaches Research Opportunities • Finding Patterns in Data • Community Detection and Adaptive Networks • Q -analysis to describe high dimensional structures • Bio-inspired heuristics to solve • Combining Different Techniques to produce better algorithms for existing problems.

  13. Reading the News Through its Structure: New Hybrid 13 / 40 17 March 2014 Connectivity Based Approaches Hybrid Connectivity approaches • Hybrid? • This thesis proposes approaches that involve multiple techniques Usually two techniques are used. • Connectivity? • Data is represented by entities and relations between them. • Binary relations (graphs) • n-ary relations (hypergraphs, etc..)

  14. Reading the News Through its Structure: New Hybrid 14 / 40 17 March 2014 Connectivity Based Approaches TOPIC MONITORING WITH VARIATION OF INFORMATION AND DYNAMIC NETWORKS

  15. Reading the News Through its Structure: New Hybrid 15 / 40 17 March 2014 Connectivity Based Approaches Description

  16. Reading the News Through its Structure: New Hybrid 16 / 40 17 March 2014 Connectivity Based Approaches Main Results

  17. Reading the News Through its Structure: New Hybrid 17 / 40 17 March 2014 Connectivity Based Approaches CLUSTERING NEWS: constructing timelines of news with k- means

  18. Reading the News Through its Structure: New Hybrid 18 / 40 17 March 2014 Connectivity Based Approaches Clustering with k-means • Objective: create clustered timelines of news to see time- dependence of news. • Possibility to track back in time origins of stories • Create an interface for story navigation • Approach: tf.idf feature vectors clustered with k -means • Write interactive software for news navigation (part of Theseus)

  19. Reading the News Through its Structure: New Hybrid 19 / 40 17 March 2014 Connectivity Based Approaches Clustering with k-means

  20. Reading the News Through its Structure: New Hybrid 20 / 40 17 March 2014 Connectivity Based Approaches CLUSTERING NEWS: finding communities with Q- analysis filtering

  21. Reading the News Through its Structure: New Hybrid 21 / 40 17 March 2014 Connectivity Based Approaches Clustering with no filtering

  22. Reading the News Through its Structure: New Hybrid 22 / 40 17 March 2014 Connectivity Based Approaches Fraction of vertices in resulting graphs

  23. Reading the News Through its Structure: New Hybrid 23 / 40 17 March 2014 Connectivity Based Approaches Fraction of vertices in maximal cluster in relation to that particular subgraph

  24. Reading the News Through its Structure: New Hybrid 24 / 40 17 March 2014 Connectivity Based Approaches Number of Clusters

  25. Reading the News Through its Structure: New Hybrid 25 / 40 17 March 2014 Connectivity Based Approaches Modularity of the resulting clustering

  26. Reading the News Through its Structure: New Hybrid 26 / 40 17 March 2014 Connectivity Based Approaches Software developed for visualisation of case study (on CD)

  27. Reading the News Through its Structure: New Hybrid 27 / 40 17 March 2014 Connectivity Based Approaches HAMILTONIAN PATHS IN Q -ANALYSIS ECCENTRICITY MATRICES

  28. Reading the News Through its Structure: New Hybrid 28 / 40 17 March 2014 Connectivity Based Approaches Two threads • Development of a novel Travelling Salesman Problem algorithm • In collaboration with Vitorino Ramos [Rodrigues, 2011, Ramos 2011, Ramos 2013] • Application of Q -analysis eccentricities matrices as distance matrices in the construction of Directed Hamiltonian Paths in the TSP problem.

  29. Reading the News Through its Structure: New Hybrid 29 / 40 17 March 2014 Connectivity Based Approaches 2 nd Order Swarm Intelligence • Pharaoh's ants ( Monomorium pharaonis ) deposit a pheromone as a ' no entry ' signal to mark unrewarding foraging paths. • Double Pheromone Model on top of traditional ACS. • Traditional positive reinforcement pheromone • Use of Negative Pheromone to block bad paths.

  30. Reading the News Through its Structure: New Hybrid 30 / 40 17 March 2014 Connectivity Based Approaches Results – Static problems

  31. Reading the News Through its Structure: New Hybrid 31 / 40 17 March 2014 Connectivity Based Approaches Influence of negative pheromone

  32. Reading the News Through its Structure: New Hybrid 32 / 40 17 March 2014 Connectivity Based Approaches Application to dynamic problems: recovery patterns

  33. Reading the News Through its Structure: New Hybrid 33 / 40 17 March 2014 Connectivity Based Approaches Application to the News

  34. Reading the News Through its Structure: New Hybrid 34 / 40 17 March 2014 Connectivity Based Approaches Software Developed (on CD)

  35. Reading the News Through its Structure: New Hybrid 35 / 40 17 March 2014 Connectivity Based Approaches CONCLUSIONS

  36. Reading the News Through its Structure: New Hybrid 36 / 40 17 March 2014 Connectivity Based Approaches Main Contributions of this work • 4 approaches based on the connectivity of the system that reveal the underlying structure of the news. • Each as advantages and disadvantages

  37. Reading the News Through its Structure: New Hybrid 37 / 40 17 March 2014 Connectivity Based Approaches

  38. Reading the News Through its Structure: New Hybrid 38 / 40 17 March 2014 Connectivity Based Approaches Main Contributions of this work • 4 approaches based on the connectivity o the system that reveal the underlying structure of the news. • Each as advantages and disadvantages • New Optimisation bio-inspired algorithm for TSP problems (adaptable to new problems) • Software for dealing with gathering, processing, and visualising these systems (Theseus)

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend