dynamics complexity
play

Dynamics, Complexity install R and RStudio, linked from here: - PowerPoint PPT Presentation

Plan (15:00-16:20) Social and Cognitive Dynamics with Natural Data 10 minutes - theoretical preamble 20 minutes - text as a source of dynamics Rick Dale and David W. Vinson 20 minutes - bridging variables in rich social data


  1. Plan (15:00-16:20) Social and Cognitive Dynamics with Natural Data • 10 minutes - theoretical preamble • 20 minutes - text as a source of dynamics Rick Dale and David W. Vinson • 20 minutes - bridging variables in rich social data Cognitive & Information Sciences University of California, Merced • 10 minutes - other tools to share cognaction.org • 20 minutes - questions and hands-on play SCHOOL OF SOCIAL SCIENCES c gsCI tinyurl.com/icps-workshop HUMANITIES AND ARTS Dynamics, Complexity install R and RStudio, linked from here: tinyurl.com/icps-workshop i’m attending icps for the first time and so far it is quite exciting and even as i present this workshop right now i’m missing some other fascinating workshops and talks… but i shouldn’t remind you here of that either! dynamics: structure in time complexity: interdependent processes

  2. “Big Data” Lots of Data natural data in large volumes , and sometimes at high velocity , variety + issues of validity (“Four V’s”) Plan (15:00-16:20) Text as Dynamics • 10 minutes - theoretical preamble • 20 minutes - text as a source of dynamics • 20 minutes - bridging variables in rich social data • 10 minutes - other tools to share • 20 minutes - questions and hands-on play cognaction.org/rick/icps-workshop Laura Allen, ASU (McNamara lab) 
 Nick Duran, ASU 
 (Ph.D. student) (Assistant Professor)

  3. 140 General Strategy And love in my family 120 (116,116) All I really need is a song in my heart (19,99) Treat text as a source of temporal patterns of behavior 100 80 Time (letter) All I really need is a song in my heart Food in my belly and love in my family 60 Transform the data in a manner that can 40 be subjected to new, emerging dynamic analysis 20 (65,6) recurrence quantification analysis 0 (RQA) 0 20 40 60 80 100 120 140 Time (letter) All I really need is a song in my heart Food in my belly and love in my family All I really need is a song in my heart And love in my family line of identity Temporal Patterns 140 And love in my family 120 All I really need is a song in my heart Quote Lyrics 100 Time (letter) 80 Recurrence Plot (RP) 400 All I really need is a song in my heart Food in my belly and love in my family 300 60 300 Time (letter) Time (letter) 200 40 200 100 20 100 0 0 0 0 100 200 300 0 100 200 300 400 0 20 40 60 80 100 120 140 Time (letter) Time (letter) Time (letter) All I really need is a song in my heart Food in my belly and love in my family All I really need is a song in my heart And love in my family

  4. RQA Measures RQA Measures %DET (or, DET) %REC (or, RR) Quote Lyrics Quote Lyrics plot occupied by points. % 400 400 % 300 Percent determinism 300 Recurrence rate ( %REC ): 300 300 ( %DET ): Percentage of Total percentage of the Time (letter) Time (letter) the points on the plot Time (letter) Time (letter) 200 200 200 200 that fall on diagonal lines (length > 1). 100 100 100 100 0 0 0 0 0 100 200 300 0 100 200 300 400 0 100 200 300 0 100 200 300 400 Time (letter) Time (letter) Time (letter) Time (letter) 7.0% 8.2% 18.1% 26.4% http://www.recurrence-plot.tk http://www.recurrence-plot.tk Mean Recurrence Rate (RR) TASA Corpus (Touchstone Applied Science Associates, Inc.) language arts social studies science 37,808 paragraphs from books and textbooks 2 R =10% **** 2.1 Each approximately 300-400 words **** X Language Arts, Science, Social Studies 1.825 Mean RR (%) Includes reading difficulty score 1.55 word-stem dynamics 1.275 reflect genre and reading di ffi culty? 1 Overall Easy Difficult

  5. Local vs. Global Cohesion? Text as Dynamics language arts social studies science 60 **** ● 24 ● 50 ● 40 Mean DET (%) 22 30 ● ● ● 20 ● ● ● ● ● ● ● ● ● ● ● ● 20 ● ● 10 ● Recurrent patterns in word usage could be 0 predictive of how texts 18 20 40 60 80 100 120 140 support learning; Window size Laura Allen, ASU (McNamara lab) 
 Nick Duran, ASU 
 interacts with genre… (Ph.D. student) (Assistant Professor) 16 Overall Easy ficult Plan (15:00-16:20) Dynamics, Complexity • 10 minutes - theoretical preamble • 20 minutes - text as a source of dynamics • 20 minutes - bridging variables in rich social data • 10 minutes - other tools to share i’m attending icps for the first time and so far it is quite exciting and even as i present this workshop right now i’m missing • 20 minutes - questions and hands-on play some other fascinating workshops and talks… but i shouldn’t remind you here of that either! tinyurl.com/icps-workshop dynamics: structure in time complexity: interdependent processes

  6. General Strategy Yelp, Inc. Dataset Fuse measures of cognition with context variables Dave Vinson, Ph.D. student, UCM IBM Ph.D. Fellow Sample from the large data set, in a manner that permits exploration of interdependence information and graph theory Yelp Data Format { 'type': 'user', Friends 'user_id': (encrypted user id), 'friends': [(friend user_ids)], 'stars': (star rating,1-5), Adam T.’s network 'text': (review text), 'name': (first name), 'review_count': (review count), Text 'votes': {(vote type): (count)}, 'fans': (num_fans), … } JSON

  7. Lexical Richness community 0.5 0.5 innovation 5000 Residual ACI Residual ACI Frequency 0.0 0.0 2000 High richness (low ACI) : … unprecedented photography, modern community 0 9 10 11 12 and contemporary art conceived before alignment bits AUI (bits / word) − 0.5 − 0.5 World War II, to American and European fashion designs portraying the 18th Low richness (low ACI) : … century. Fortunately for the flexible Baseline Baseline This is a great place for museum hours, one may lolly-gag − 1.0 − 1.0 True Network True Network Tr lunch and dinner. The food through the corridors on is great, the price is good Wednesday ...one of the most 0.0 0.0 2.5 2.5 and the service is friendly Edges (z-score) Edges Edges captivating subjects of art... juxtaposes and quick. racist caricatures such as Malcolm X and Martin Luther King Jr. Interestingly, this mixed media, graffiti style display Social context may modulate subtle withheld a deep meaning of America's aspects of communication strategy, religious fervor and external cultural N 1 measurable in large natural data. X adolescence…. ACI j = − log 2 p ( w i | w i − 1 ) N − 1 i =2 Plan (15:00-16:20) • 10 minutes - theoretical preamble • 20 minutes - text as a source of dynamics • 20 minutes - bridging variables in rich social data • 10 minutes - other tools to share • 20 minutes - questions and hands-on play 2017 tinyurl.com/icps-workshop

  8. crqa Other Resources • Led by Moreno Coco, a version of tools for conducting RQA on Focus on libraries designed for larger categorical and continuous time datasets. series . Among many: dplyr a clear example • With help of the R community, crqa (for managing, filtering, transforming is almost twice as fast as its standard comparison toolbox, in MATLAB. large data frames). tinyurl.com/icps-workshop • See paper by Coco & Dale (2014) for summary (linked on workshop Two from my lab: crqa and cmscu website). http://languagegoldmine.com cmscu • Fast and memory-efficient way of generating frequency tables for n - grams. • Vinson, Davis, Sindi & Dale (2016) describe how you can be certain of arbitrarily small error in estimating frequency of (say) unigrams and bigrams in large corpora. • Quick deployment on corpora, and can be used to estimate wide variety of statistical and information-theoretic measures. cmscu vs. tm http://www.dataonthemind.org

  9. Plan (15:00-16:20) Some Code • 10 minutes - theoretical preamble • 20 minutes - text as a source of dynamics tinyurl.com/icps-workshop • 20 minutes - bridging variables in rich social data part I • 10 minutes - other tools to share part 2 • 20 minutes - questions and hands-on play cognaction.org/rick/icps-workshop Thanks David Vinson (UC Merced) Laura Allen (ASU) Nick Duran (ASU) Moreno Coco (Edinburgh) Alexandra Paxton (UC Berkeley) Danielle McNamara (ASU) SCHOOL OF SOCIAL SCIENCES c gsCI HUMANITIES AND ARTS BCS-0826825 BCS-1344279

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend