On Data-Driven Creativity Lav R. Varshney - - PowerPoint PPT Presentation

on data driven creativity
SMART_READER_LITE
LIVE PREVIEW

On Data-Driven Creativity Lav R. Varshney - - PowerPoint PPT Presentation

On Data-Driven Creativity Lav R. Varshney ECE/CSL/Beckman/CS/Neuroscience/ILEE University of Illinois at Urbana-Champaign January 5, 2017 Understanding sociotechnical systems General purpose technologies of past centuries such as communication


slide-1
SLIDE 1

On Data-Driven Creativity

Lav R. Varshney

ECE/CSL/Beckman/CS/Neuroscience/ILEE University of Illinois at Urbana-Champaign January 5, 2017

slide-2
SLIDE 2

Understanding sociotechnical systems

General purpose technologies of past centuries such as communication networks and engines give rise to new engineering challenges that are not just technical but sociotechnical in scope

2

slide-3
SLIDE 3

Obesity in social capital deserts

3

Obesity surveillance using Foursquare data

  • Venues (opportunity for

social interaction)

  • Checkins (actual social

activity) associated with obesity rates in New York City neighborhoods

  • H. Bai, R. Chunara, and L. R. Varshney, "Social Capital Deserts: Obesity Surveillance using a Location-Based Social Network," in Proc. Data

for Good Exchange (D4GX), New York, 28 Sept. 2015. (NYC Media Lab - Bloomberg Data for Good Exchange Paper Award)

slide-4
SLIDE 4

4

  • A. J. Gross, D. Murthy, and L. R. Varshney, "Pace of Life in Cities and the Emergence of

Town Tweeters," presented at International Conference on Computational Social Science (IC2S2), Helsinki, 8-11 June 2015.

Pace of life in cities and the emergence of ‘town tweeters’

  • Contrary to superlinear

scaling of productivity with city population, total volume of tweets scales sublinearly

  • Looking at individuals,

however, greater population density associated with smaller inter-tweet intervals

  • Concentrated core of

more active users that serve an information broadcast function, an emerging group of town tweeters

slide-5
SLIDE 5

DATA WITHIN US DATA BETWEEN US DATA ABOUT US

[Rinie van Est, Intimate Technology: The battle for our body and behaviour, Rathenau Instituut, The Hague, The Netherlands, Jan. 2014.]

Building personalized data-driven technologies

5

slide-6
SLIDE 6

Memory Deductive reasoning Association Perception Introspection Abductive reasoning Inductive reasoning Problem solving Language Attention Creativity

6

Augmenting intelligence

slide-7
SLIDE 7

Outline

  • Evolution of a data-driven culinary computational

creativity system

  • Design principles of a data-driven culinary computational

creativity system

  • Beyond culinary: computational creativity as a general

purpose technology

  • Fundamental limits of creativity

7

slide-8
SLIDE 8

Creativity is the generation of an idea or artifact that is judged to be novel and also to be appropriate, useful, or valuable by a suitably knowledgeable social group.

8

slide-9
SLIDE 9

Creativity is the generation of an idea or artifact that is judged to be novel and also to be appropriate, useful, or valuable by a suitably knowledgeable social group.

9

slide-10
SLIDE 10

[The New York Times, 27 Feb. 2013] [San Jose Mercury News, 28 Feb. 2013] [IEEE Spectrum, 31 May 2013] [Wired, 1 Oct. 2013]

1

slide-11
SLIDE 11

11

slide-12
SLIDE 12

12

slide-13
SLIDE 13

13

slide-14
SLIDE 14

https://www.ibmchefwatson.com

14

slide-15
SLIDE 15

15

slide-16
SLIDE 16

16

slide-17
SLIDE 17

Consensual assessment technique

17

slide-18
SLIDE 18

Lovelace: “only when computers originate things should they be believed to have minds” Beyond the Turing Test: Lovelace 2.0

LOVELACE FERRUCCI RIEDL

Lovelace 1.0: an artificial agent possesses intelligence in terms of whether it can “take us by surprise”

Lovelace 2.0: An artificial agent must create artifact o of type t where:

  • artifact o conforms to constraints C where ci ∈ C is any criterion

expressible in natural language

  • human evaluator h, having chosen t and C, is satisfied o is valid instance
  • f t and meets C, and
  • human referee r determines combination of t and C to not be impossible

18

slide-19
SLIDE 19

Address sustainability and public health One-third of all food produced worldwide, worth around US$1 trillion, gets lost

  • r wasted in food production

and consumption systems One-third (78.6 million) of U.S. adults are obese, but 800 million people in the world do not have enough food to lead a healthy active life

19

slide-20
SLIDE 20

Food and Data Workshop: Interoperability through the Food Pipeline

20

slide-21
SLIDE 21

Outline

  • Evolution of a data-driven culinary computational

creativity system

  • Design principles of a data-driven culinary computational

creativity system

  • Beyond culinary: computational creativity as a general

purpose technology

  • Fundamental limits of creativity

21

slide-22
SLIDE 22

22

slide-23
SLIDE 23
  • 1. Find

Problem

  • 2. Acquire

Knowledge

  • 3. Gather

Related Information

  • 4. Incubation
  • 5. Generate

Ideas

  • 6. Combine

Ideas

  • 7. Select Best

Ideas

  • 8. Externalize

Ideas

[Sawyer, 2012]

23

slide-24
SLIDE 24

Many previous attempts at computational creativity have

  • nly had computational divergent thinking, but not

computational convergent thinking Harold Cohen: AARON David Cope: music Doug Lenat: Automated Mathematician (AM) Building big data oriented models of human hedonic perception / cognition allows us to not only generate promising ideas but also to rank the best ones among them

24

slide-25
SLIDE 25
  • 1. Sample from state space,

using culturally well- chosen sampling distribution

  • 2. Rank according to

psychophysical predictors

  • f novelty and flavor
  • 3. Select either automatically
  • r semi-automatically

depending on human- computer interaction model

25

slide-26
SLIDE 26

Joint histogram of surprise and pleasantness for 10000 generated Caymanian Plantain Dessert recipes. Values for the selected/tested recipe indicated with red dashed line.

26

slide-27
SLIDE 27

Data Engineering and Natural Language Processing to Understand the Domain

PARSER

Generative, Selective, and Planning Algorithms to Create the Best New Ideas

DOMAIN KNOWLEDGE DATABASE DYNAMIC PLANNER COMBINATORIAL DESIGNER COGNITIVE ASSESSOR

NOVEL RECIPE

27

slide-28
SLIDE 28

Recipe Corpus

28

slide-29
SLIDE 29

500 1000 1500 2000 2500 3000 50

nb_recipes

SOURCE: 27697 recipes from Wikia dataset

Number of steps Number of recipes

  • Recipes typically require about

eight steps (similar to # ingredients)

  • NLP tokens include cooking

techniques, tools, and ingredients

  • Data model supports

analytics algorithms

Natural language processing is difficult since recipes are out-of- domain for standard tools trained on general corpora

29

slide-30
SLIDE 30

[Shepherd, 2006]

Neurogastronomy

30

slide-31
SLIDE 31

Saffron (Crocus sativus L.)

phenethyl alcohol safranal isophorone

Food Chemistry

31

slide-32
SLIDE 32

Black Tea Bantu Beer Beer Strawberry White Wine Cooked Apple

PLEASANTNESS INTENSITY FAMILIARITY R2 = 0.374

Chemical Compound Ingredient Recipe

Linear Pleasantness Hypothesis DATA Chemistry: molecular properties Psychology: human-labeled pleasantness rating Psychophysical Pleasantness Chemistry [TPSA, heavy atom count, complexity,

rotatable bond count, hydrogen bond acceptor count]

Hedonic Psychophysics

32

slide-33
SLIDE 33

[Ahn, Ahnert, et al., 2011]

Flavor Networks

33

slide-34
SLIDE 34

[Itti and Baldi, 2006]

𝑇 𝑆, ℬ = 𝐸 𝑄𝐶|𝑆||𝑄𝐶 = 𝑄𝐶|𝑆 log 𝑄𝐶|𝑆 𝑄𝐶 𝑒𝐶

newly created recipe personalized repository of prior food experience prior beliefs posterior beliefs

Latent Dirichlet Allocation (LDA) Model

Bayesian Surprise and Attention

34

slide-35
SLIDE 35
  • 1. Find

Problem

  • 2. Acquire

Knowledge

  • 3. Gather

Related Information

  • 4. Incubation
  • 5. Generate

Ideas

  • 6. Combine

Ideas

  • 7. Select Best

Ideas

  • 8. Externalize

Ideas

[Sawyer, 2012]

Learn data-driven cognitive models Use models for creativity

35

slide-36
SLIDE 36

WIKIA ICE US NAVY PARSER

...

RECIPE DB RECIPE PLANNER RECIPE DESIGNER COGNITIVE RECIPE ASSESSOR COOKING PLAN Crowds & Experts Natural Language Processing Databases Operations Research Creativity Analytics Predictive Analytics Human- Computer Interaction

  • 5. Generate

Ideas

  • 7. Select

Best Ideas 8. Externalize Ideas

  • 6. Combine

Ideas

36

slide-37
SLIDE 37

Outline

  • Evolution of a data-driven culinary computational

creativity system

  • Design principles of a data-driven culinary computational

creativity system

  • Beyond culinary: computational creativity as a general

purpose technology

  • Fundamental limits of creativity

37

slide-38
SLIDE 38

From spices to silks, materials, and education

38

slide-39
SLIDE 39

39

slide-40
SLIDE 40

40

slide-41
SLIDE 41

Creativity for Technology, Drug Cocktails, …

41

slide-42
SLIDE 42

[K. Haase, Discovery Systems: From AM to CYRANO, MIT AI Lab Working Paper 293, Mar. 1987]

Constructive machine learning: Discovering concepts

42

slide-43
SLIDE 43

Discovering concepts: Music theory from Bach’s chorales

  • Interpretable rule hierarchy

learning by iterative, alternating

  • ptimization of Bayesian surprise

(against current ruleset) and informativeness in Shannon’s sense

  • H. Yu, L. R. Varshney, G. E. Garnett, and R. Kumar, "Learning Interpretable Musical Compositional Rules and Traces," in 2016 ICML

Workshop on Human Interpretability in Machine Learning (WHI 2016), New York, New York, 23 June 2016.

  • H. Yu, L. R. Varshney, G. Garnett, and R. Kumar, "MUS-ROVER: A Self-Learning System for Musical Compositional Rules," in Proceedings of

the 4th International Workshop on Musical Metacreation (MUME 2016), Paris, France, 27 June 2016.

43

slide-44
SLIDE 44

Outline

  • Evolution of a data-driven culinary computational

creativity system

  • Design principles of a data-driven culinary computational

creativity system

  • Beyond culinary: computational creativity as a general

purpose technology

  • Fundamental limits of creativity

44

slide-45
SLIDE 45

Google Magenta

45

slide-46
SLIDE 46

Google Magenta

46

slide-47
SLIDE 47

Cyclic ordering of cards for mind-reading card trick

47

slide-48
SLIDE 48

48

slide-49
SLIDE 49

From engineering to engineering theory

  • Multifarious algorithmic ideas for computational creativity

– Supervised learning (trained on complete artifacts)

  • Collaborative filtering

– Optimization with data-driven psychophysical objectives

  • Genetic algorithms
  • Simulated annealing
  • Stochastic sampling + ranking + selection
  • Consensual assessment technique allows characterization of

whether artifacts are creative or not

  • Lovelace 2.0 test determines whether a machine is as creative as a

human

Are there fundamental limits to how creative

any system can be in a given domain?

49

slide-50
SLIDE 50

Carnot established fundamental limits on efficiency of engines Shannon established fundamental limits of communication in the presence of noise Karaman and Frazzoli established fundamental speed limit of flight in forests without crashing

50

slide-51
SLIDE 51

Creativity is the generation of an artifact that is judged to be novel and also to be appropriate, useful, or valuable by a suitably knowledgeable social group.

51

slide-52
SLIDE 52

Creativity is the generation of an artifact that is judged to be novel and also to be appropriate, useful, or valuable by a suitably knowledgeable social group.

Towards a formalism

52

slide-53
SLIDE 53

Towards a formalism

Artifact An unordered combinatorial object 𝛽 selected from the power set 2Ω of possible components, Ω, that define the creative domain. (assume all possible components known) Known Set Set of artifacts that are already known in the creative domain, Θ ⊆ 2Ω ∈ 22Ω, also called the inspiration set. Novelty A non-negative function 𝑡: 2Ω × 22Ω → ℝ+ that measures the surprise of a given artifact 𝛽0 in the presence of a known set Θ, e.g. the empirical Bayesian surprise. Utility A non-negative function 𝑟: 2Ω → ℝ+ that measures the quality

  • f a given artifact 𝛽0, e.g. through the psychophysical properties of

components and their combining rules.

53

slide-54
SLIDE 54

Towards a formalism

  • A coding scheme for channel coding may be thought of as a test

source with an input distribution, for informational characterization

  • Similarly, think of a creativity algorithm as a (possibly degenerate)

probabilistic process 𝑄

𝐵 𝛽 for mathematical characterization

54

slide-55
SLIDE 55

Basic Tradeoff in Creativity: Average Case

𝑇 𝑅 = max

𝑄𝐵 𝛽 :𝐹 𝑟 𝐵 ≥𝑅 𝐹 𝑡 𝐵, Θ

Novelty-Quality tradeoff in Creativity

55

slide-56
SLIDE 56

Basic Tradeoff in Creativity: Average Case

𝑇 𝑅 = max

𝑄𝐵 𝛽 :𝐹 𝑟 𝐵 ≥𝑅 𝐹 𝑡 𝐵, Θ

Novelty-Quality tradeoff in Creativity

Lemma [Varshney, 2013]

𝐹 𝑡 𝐵, Θ = 𝐽 𝐵, Θ . 𝑇 𝑅 = max

𝑄𝐵 𝛽 :𝐹 𝑟 𝐵 ≥𝑅 𝐽 𝐵, Θ

(Shannon’s capacity-cost function)

Corollary

56

slide-57
SLIDE 57

Basic Tradeoff in Creativity: Probabilities

𝑇 𝑅 = max

𝑄𝐵 𝛽 :Pr 𝑟 𝐵 >𝜇𝑟 ≥𝑅 Pr 𝑡 𝐵, Θ > 𝜇𝑡

Rather than wanting algorithm that performs well on average, consider algorithm that produces novel and high-quality artifacts with probabilities above thresholds 𝜇𝑡 and 𝜇𝑟 Make use of information geometry techniques Lemma [Varshney, 2013] Shannon capacity C for channel 𝑞𝑍|𝑌 is: 𝐷 = min

𝑞𝑍 𝑧 max 𝒴 𝑡 𝑦

Geometrically, the unconstrained optimal output distribution will be the center of a “sphere” with radius measured by Bayesian surprise, as derived from the KKT conditions

57

slide-58
SLIDE 58

Optimal Creativity Algorithms

The extremal 𝑄

𝐵 𝛽 describes an optimal stochastic sampling algorithm

for computational creativity Optimal sequential selection can be analyzed using the theory of concomitants of order statistics

58

STOCHASTIC SAMPLING SEQUENTIAL SELECTION IDEAS IDEA

slide-59
SLIDE 59

Maturity of the field

  • Initially when Θ is very small, 𝑄𝜄|𝛽 may not be absolutely continuous

with respect to 𝑄𝜄, so relative entropy in surprise would be infinite

  • After many artifacts are created and known, the effect of the

Bayesian belief update due to the new artifact is small – Noisier channel shifts curve to left – All low-hanging fruits already created

59

slide-60
SLIDE 60

SOURCE: Youn, et al. (2014).

  • Broad patents were prevalent after WWII, but narrow patents now

predominate – Growing the component alphabet?

  • Time is ripe for broad systems-level inventions, which make use of

the fertile resource of narrow inventions

Maturity of the field

60

slide-61
SLIDE 61
  • Lack of absolute continuity in Bayesian surprise expression yields

an infinite value – Do new components yield transformational creativity that is different in kind from combinatorial creativity?

  • Scientific discovery provides new “ingredients” for creating artifacts

and ideas, especially if they are high-quality

Discovery: Growing the component alphabet

61

slide-62
SLIDE 62

62

“Each new machine or technique, in a sense, changes all existing machines and techniques, by permitting us to put them together into new

  • combinations. The number of possible combinations rises exponentially as

the number of new machines or techniques rises arithmetically. Indeed, each new combination may, itself, be regarded as a new super-machine.” ‒ Alvin Toffler, Future Shock (1970), pp. 28–29

slide-63
SLIDE 63

Email: varshney@illinois.edu Twitter: @lrvarshney

63

On Data-Driven Creativity

Lav R. Varshney University of Illinois at Urbana-Champaign