Computational Data Analytics on the Web for Better Food Decision - - PowerPoint PPT Presentation

computational data analytics on the web for better food
SMART_READER_LITE
LIVE PREVIEW

Computational Data Analytics on the Web for Better Food Decision - - PowerPoint PPT Presentation

Computational Data Analytics on the Web for Better Food Decision Making Assoc. Prof. Dr. DI Christoph Trattner InfoMedia @ UiB 2. October 2019 . Christoph Trattner 1 Where do I come from? 2. October 2019 . Christoph Trattner 2


slide-1
SLIDE 1

1

. Christoph Trattner

  • 2. October 2019

Computational Data Analytics on the Web for Better Food Decision Making

  • Assoc. Prof. Dr. DI Christoph Trattner

InfoMedia @ UiB

slide-2
SLIDE 2

2

. Christoph Trattner

  • 2. October 2019

Where do I come from?

slide-3
SLIDE 3

3

. Christoph Trattner

  • 2. October 2019

Research Focus

Big Data Data Analytics Predictive Modeling Social Network Data Online Communities Data Web Data Open Data Understand how people behave Theories e.g. social psychology RecSys Behavioral Data Science & RecSys

slide-4
SLIDE 4

4

. Christoph Trattner

  • 2. October 2019

Agenda

  • 1. Motivation
  • 2. DS: Healthiness of Online Food (recipes)
  • 3. RS: State-of-the-art & Health-aware Food RecSys
  • 4. DS: Linking Online to Offline
  • 5. DS: Factors Influencing Food Choice
  • 6. RS: Altering Food Choice with RecSys
  • 7. The Future & Conclusions
slide-5
SLIDE 5

5

. Christoph Trattner

  • 2. October 2019

Part 1: Motivation

slide-6
SLIDE 6

6

. Christoph Trattner

  • 2. October 2019

Why is research into Food Recsys Important?

slide-7
SLIDE 7

7

. Christoph Trattner

  • 2. October 2019

Why is that important?

§ Food is one the main concepts that shapes how good we feel and how healthy we are § According to the WHO, if common lifestyle risk factors, among others diet-related ones, were eliminated, around 80% of cases of heart disease, strokes and type 2 diabetes, and 40% of cancers, could be avoided (European Comission Recommendation C(2010) 2587 final, 2010).

slide-8
SLIDE 8

8

. Christoph Trattner

  • 2. October 2019

Problem

slide-9
SLIDE 9

9

. Christoph Trattner

  • 2. October 2019

The approaches I am discussing today are all online food recommender approaches! Why Online?

slide-10
SLIDE 10

10

. Christoph Trattner

  • 2. October 2019

Most food interactions nowadays online

According to recent market research over 50%

slide-11
SLIDE 11

11

. Christoph Trattner

  • 2. October 2019

Amazon

slide-12
SLIDE 12

12

. Christoph Trattner

  • 2. October 2019

Part 2: Healthiness of Online Food (Recipes)

slide-13
SLIDE 13

13

. Christoph Trattner

  • 2. October 2019

RQ: How healthy are online food items (recipes) actually?

slide-14
SLIDE 14

14

. Christoph Trattner

  • 2. October 2019

Nutrition Facts Basic statistics: § 60,983 recipes § 1,032,226 ratings § 17,190,534 bookmarks http://allrecipes.com

slide-15
SLIDE 15

15

. Christoph Trattner

  • 2. October 2019

According to Alexa.com

Allrecipes.com popularity

slide-16
SLIDE 16

16

. Christoph Trattner

  • 2. October 2019

How can we determine the healthiness of

  • nline recipes?

Trattner, C. Elsweiler, D. and Simon, H. Estimating the Healthiness of Internet Recipes: A Cross-Sectional Study. Frontiers in Public Health, 2017. Trattner, C. and Elsweiler, D. Investigating the Healthiness of Internet-Sourced Recipes: Implications for Meal Planning and Recommender Systems. In Proceedings of the World Wide Web Conference (WWW), 2017.

slide-17
SLIDE 17

17

. Christoph Trattner

  • 2. October 2019

FSA food health criteria

Determining the healthiness of recipes

slide-18
SLIDE 18

18

. Christoph Trattner

  • 2. October 2019

WHO food health criteria

  • Who. Diet, nutrition and the prevention of chronic diseases. World Health Organ

TechRep Ser, 916(i-viii), 2003.

Determining the healthiness of recipes

slide-19
SLIDE 19

19

. Christoph Trattner

  • 2. October 2019

Results

slide-20
SLIDE 20

20

. Christoph Trattner

  • 2. October 2019

Trattner, C. Elsweiler, D. and Simon, H. Estimating the Healthiness of Internet Recipes: A Cross-Sectional Study. Frontiers in Public Health, 2017.

Online food is unhealthy L

e.g. Tesco Allrecipes

slide-21
SLIDE 21

21

. Christoph Trattner

  • 2. October 2019

FSA criteria

Online food (recipes) is unhealthy L

slide-22
SLIDE 22

22

. Christoph Trattner

  • 2. October 2019

Online food is unhealthy L

slide-23
SLIDE 23

23

. Christoph Trattner

  • 2. October 2019

User perception

Results when asking users how healthy categories are on Allrecipes.com

(Kappa κ = .165, z = 42, p < .001)

slide-24
SLIDE 24

24

. Christoph Trattner

  • 2. October 2019

With which types of recipes do user interact the most?

slide-25
SLIDE 25

25

. Christoph Trattner

  • 2. October 2019

People seem to like unhealthy recipes

slide-26
SLIDE 26

26

. Christoph Trattner

  • 2. October 2019

Part 3: State-of-the-art & Health-aware Food RecSys

slide-27
SLIDE 27

27

. Christoph Trattner

  • 2. October 2019

How healthy are recommendations produced by std. recommender systems algorithms in terms of health?

slide-28
SLIDE 28

28

. Christoph Trattner

  • 2. October 2019

What is actually the current state-of-the- art in Food Recommenders?

Food Recommender Systems: Important Contributions, Challenges and Future Research Directions. Trattner, C. and Elsweiler, D. Collaborative Recommendations: Algorithms, Practical Challenges and Applications, World Scientific Publishing Co. Pte. Ltd., 2018

slide-29
SLIDE 29

29

. Christoph Trattner

  • 2. October 2019
slide-30
SLIDE 30

30

. Christoph Trattner

  • 2. October 2019

*** p < .001

Results: Recommender Experiment

L

Libray: LibRec Eval: 10 fold-cross validation ∆ = train − pred

slide-31
SLIDE 31

31

. Christoph Trattner

  • 2. October 2019

Can we improve std. recommender systems in terms of health?

slide-32
SLIDE 32

32

. Christoph Trattner

  • 2. October 2019

Post-Filter scoring functions Linear combinations as discussed in Elsweiler et al. (2015) did not work L

Re-ranking for health

  • D. Elsweiler, M. Harvey, B. Ludwig, and A. Said. Bringing the "healthy" into food
  • recommenders. In Proc. of DRMS’15., pages 33–36.
slide-33
SLIDE 33

33

. Christoph Trattner

  • 2. October 2019

Results: Recommender (2)

Note: similar results with bookmarks

L J

slide-34
SLIDE 34

34

. Christoph Trattner

  • 2. October 2019

Conclusions

§ Only a small percentage of Allrecipes.com recipes can be considered healthy according to WHO and FSA guidelines. § Users are to some extent able to judge how healthy categories will be, but often disagree. § Interaction data reveals that people are most positive about the unhealthy recipes. § Current state-of-the-art recommender algorithms in general produce unhealthy recommendations.

slide-35
SLIDE 35

35

. Christoph Trattner

  • 2. October 2019

Part 4: Linking Online & Offline

slide-36
SLIDE 36

36

. Christoph Trattner

  • 2. October 2019

Can we find a link between the online and

  • ffline world?
slide-37
SLIDE 37

37

. Christoph Trattner

  • 2. October 2019

Abbar, S., Mejova, Y., & Weber, I. (2015). You tweet what you eat: Studying food consumption through

  • twitter. ACM CHI 2015.
slide-38
SLIDE 38

38

. Christoph Trattner

  • 2. October 2019

Correlation between food mentions on Twitter & Obese

p=.772 s=.784

Abbar, S., Mejova, Y., & Weber, I. (2015). You tweet what you eat: Studying food consumption through twitter. ACM CHI 2015.

http://www.caloriecount.com/ § 50 million tweets § Food related keywords

slide-39
SLIDE 39

39

. Christoph Trattner

  • 2. October 2019

…in RecSys, we typically use other types

  • f signals…

Trattner, C., Parra, D. and Elsweiler, D. Monitoring obesity prevalence in the United States through bookmarking activities in online food

  • portals. PLOS ONE 12(6), 2017.

Trattner, C. and Elsweiler, D. What online data say about eating

  • habits. NATURE Sustainability, 2019.
slide-40
SLIDE 40

40

. Christoph Trattner

  • 2. October 2019

Research Questions

§ RQ1. To what extent do the nutritional properties of bookmarked recipes on Allrecipes.com correlate with

  • besity levels in the US?

§ RQ2. To what extent can temporal or geographical factors help in explaining obesity patterns? § RQ3. To what extent do nutrition factors explain the variance in obesity rates across the US?

slide-41
SLIDE 41

41

. Christoph Trattner

  • 2. October 2019

Dataset

slide-42
SLIDE 42

42

. Christoph Trattner

  • 2. October 2019

Dataset in detail

slide-43
SLIDE 43

43

. Christoph Trattner

  • 2. October 2019

Variables

Dependent Variable

  • Obesity prevalence (state / county level)

Independent Variables

  • Fat (of recipe)
  • Saturated Fat (of recipe)
  • Sugar (of recipe)
  • Sodium (of recipe)
  • Healthiness (of recipe)
slide-44
SLIDE 44

44

. Christoph Trattner

  • 2. October 2019

Results

slide-45
SLIDE 45

45

. Christoph Trattner

  • 2. October 2019

Trends over time

slide-46
SLIDE 46

46

. Christoph Trattner

  • 2. October 2019

Trends over time (zoom in)

slide-47
SLIDE 47

47

. Christoph Trattner

  • 2. October 2019
  • RQ1. To what extent do the nutritional properties of

bookmarked recipes on Allrecipes.com correlate with

  • besity levels in the US?
slide-48
SLIDE 48

48

. Christoph Trattner

  • 2. October 2019

State Level Correlations

slide-49
SLIDE 49

49

. Christoph Trattner

  • 2. October 2019

Baseline Baseline+ Time Baseline+ Time + FSA Baseline+ Time + Fat + Sugar

slide-50
SLIDE 50

50

. Christoph Trattner

  • 2. October 2019

Conclusion

§ We demonstrate significant and meaningful (i.e. sensibly interpretable) relationships between the nutritional properties of bookmarked recipes (sugar content, fat content and a combined FSA-score for recipes) and obesity incidence. § The good fit achieved by our models suggests that combining interaction data, geographical data and temporal data can be a useful in monitoring

  • besity incidence.
slide-51
SLIDE 51

51

. Christoph Trattner

  • 2. October 2019

Part 5: Factors Influencing Online Food Choice

slide-52
SLIDE 52

52

. Christoph Trattner

  • 2. October 2019

Why do people like the unhealthy recipes more?

Trattner, C., Moesslang, D. and Elsweiler, D. On the Predictability of the Popularity of Online Recipes. EPJ Data Science, 2018.

slide-53
SLIDE 53

53

. Christoph Trattner

  • 2. October 2019

...from the social psychology literature we know that there are several biases involved in when people cook or select food, e.g. social & cultural factors, season, healthiness, visual appeal

What makes a recipe actually to be chosen/popular?

Scheibehenne, B., Miesler, L., and Todd, P.M. (2007). Fast and frugal food choices: Uncovering individual decision heuristics. Appetite, 49, 578-589.

slide-54
SLIDE 54

54

. Christoph Trattner

  • 2. October 2019

Predicting Recipe Popularity = Item Cold-Start Prediction

slide-55
SLIDE 55

55

. Christoph Trattner

  • 2. October 2019

Datasets

  • Allrecipes.com
  • Kochbar.de
slide-56
SLIDE 56

56

. Christoph Trattner

  • 2. October 2019

Exploratory Data Analysis

slide-57
SLIDE 57

57

. Christoph Trattner

  • 2. October 2019

Temporality?

Clear temporal patterns emerge

slide-58
SLIDE 58

58

. Christoph Trattner

  • 2. October 2019

Homophilie? Location?

ingredients type

slide-59
SLIDE 59

59

. Christoph Trattner

  • 2. October 2019

Visual Attractiveness?

San Pedro J, Siersdorfer S (2009) Ranking and classifying attractiveness of photos in folksonomies. In: Proceedings of the 18th international conference on world wide web. WWW ’09. ACM, New York, pp 771–780

slide-60
SLIDE 60

60

. Christoph Trattner

  • 2. October 2019

Other factors

  • Recipe complexity
  • Instruction: Num. Words
  • Instruction: Num. Sentences
  • Entropy
  • LIX
  • Recipe innovation
slide-61
SLIDE 61

61

. Christoph Trattner

  • 2. October 2019

Predicting popularity: Kochbar.de

Social Features

Random Forrest: 90% accuracy Social factors most important

slide-62
SLIDE 62

62

. Christoph Trattner

  • 2. October 2019

Predicting popularity: Allrecipes.com

Social Features

Random Forrest 70% accuracy Innovation & Image factors important

slide-63
SLIDE 63

63

. Christoph Trattner

  • 2. October 2019

How about other food cultures?

Zhang, Q., Trattner, C., Elsweiler, D. and Ludwig, B. Identifying Cross-Cultural Visual Food Tastes with Online Recipe

  • Platforms. In Proceedings of the 11th International AAAI

conference on Web and Social Media (ICWSM), 2019.

slide-64
SLIDE 64

64

. Christoph Trattner

  • 2. October 2019

China?

slide-65
SLIDE 65

65

. Christoph Trattner

  • 2. October 2019

Cross-Country Prediction

slide-66
SLIDE 66

66

. Christoph Trattner

  • 2. October 2019

Other Factors! Gender & Food?

slide-67
SLIDE 67

67

. Christoph Trattner

  • 2. October 2019

Impact of gender

  • Prof. Dr. Eva Barlösius

Head of Leibniz Forschungszentrum Wissenschaft und Gesellschaft (LCSS)

Rocicki, M., Herder, E., Kusmierczyk, T. and Trattner, C. Plate and Prejudice: Gender Differences in Online

  • Cooking. In Proceedings of the International Conference on User Modeling and Personalisation (UMAP), 2016.
slide-68
SLIDE 68

68

. Christoph Trattner

  • 2. October 2019
  • H1. Men Are Better Cooks
  • H2. Men Cook for Impressing
  • H3. Women Prefer to Cook Sweet Dishes,

Men Prefer to Cook Meat Dishes

  • H4. Women Use Spices More Subtly
  • H5. Men Use More Gadgets for Cooking
  • H6. Men Are More Innovative

Hypotheses

Among recipes published by female cooks, 16.5% were identified as sweet dishes, significantly more than the fraction of 7.8% for male cooks

slide-69
SLIDE 69

69

. Christoph Trattner

  • 2. October 2019

To what extent can we identify the gender

  • f the recipe authors?

Feature Importance Classification Results RF=Random Forrest, LR=Logistic Regression, AB=Ada Boost Most important Higher = better

slide-70
SLIDE 70

70

. Christoph Trattner

  • 2. October 2019

Gender-aware recommendations (predicting the recipes a user will like)

Most Popular Most Popular with gender Higher = better Collaborative Filtering Collaborative Filtering with gender Gender aware methods always betterJ

slide-71
SLIDE 71

71

. Christoph Trattner

  • 2. October 2019

Part 6: Altering Food Choice

slide-72
SLIDE 72

72

. Christoph Trattner

  • 2. October 2019

Can we alter food choices with recommenders?

slide-73
SLIDE 73

73

. Christoph Trattner

  • 2. October 2019

Study

Elsweiler, D.*, Trattner, C.* and Harvey, M. (* equal contribution). Exploiting Food Choice Biases for Healthier Recipe Recommendation. In Proceedings

  • f the ACM SIGIR Conference (SIGIR), 2017.
slide-74
SLIDE 74

74

. Christoph Trattner

  • 2. October 2019

Which one of the two would you choose?

slide-75
SLIDE 75

75

. Christoph Trattner

  • 2. October 2019

User feedback Title Ingredients Image Nutrition

slide-76
SLIDE 76

76

. Christoph Trattner

  • 2. October 2019

User Study

Q: Which one of the two would you choose?

slide-77
SLIDE 77

77

. Christoph Trattner

  • 2. October 2019

User Study 1 User Study 2

slide-78
SLIDE 78

78

. Christoph Trattner

  • 2. October 2019

Nudging People Towards Healthy Food Choices

Developed an algorithm that can nudge people towards healthy food choices through images J

Less fat More fat

Exploiting Food Choice Biases for Healthier Recipe Recommendation. Elsweiler, D.*, Trattner, C.* and Harvey, M. (* equal contribution). In Proceedings of the ACM SIGIR Conference (SIGIR), 2017.

slide-79
SLIDE 79

79

. Christoph Trattner

  • 2. October 2019

Part 7: The Future & Conclusions

slide-80
SLIDE 80

80

. Christoph Trattner

  • 2. October 2019

What are we currently working on? Sustainable Food Recommender Systems

What online data say about eating habits. Trattner, C. and Elsweiler, D. NATURE Sustainability, 2019

slide-81
SLIDE 81

81

. Christoph Trattner

  • 2. October 2019

Thank you!

Assoc Prof. Christoph Trattner

Email: trattner.christoph@gmail.com Web: christophtrattner.com Twitter: @ctrattner

slide-82
SLIDE 82

82

. Christoph Trattner

  • 2. October 2019
slide-83
SLIDE 83

83

. Christoph Trattner

  • 2. October 2019

Some References

q Food Recommender Systems: Important Contributions, Challenges and Future Research Directions. Trattner, C. and Elsweiler, D. Collaborative Recommendations: Algorithms, Practical Challenges and Applications, World Scientific Publishing Co. Pte. Ltd., 2018 q Monitoring obesity prevalence in the United States through bookmarking activities in

  • nline food portals. Trattner, C., Parra, D. and Elsweiler, D. PLOS ONE 12(6), 2017.

q Exploiting Food Choice Biases for Healthier Recipe Recommendation. Elsweiler, D.*, Trattner, C.* and Harvey, M. (* equal contribution). In Proceedings of the ACM SIGIR Conference (SIGIR), 2017. q Investigating the Healthiness of Internet-Sourced Recipes: Implications for Meal Planning and Recommender Systems. Trattner, C. and Elsweiler, D. In Proceedings of the World Wide Web Conference (WWW), 2017. q Estimating the Healthiness of Internet Recipes: A Cross-Sectional Study. Trattner, C. Elsweiler, D. and Simon, H. Frontiers in Public Health, 2017. q Plate and Prejudice: Gender Differences in Online Cooking. Rocicki, M., Herder, E., Kusmierczyk, T. and Trattner, C. In Proceedings of the International Conference on User Modeling and Personalisation (UMAP), 2016. q Understanding and Predicting Online Food Production Patterns. Kusmierczyk, T., Trattner, C. and Norvag, K. In Proceedings of the ACM Conference on Hypertext and Social Media (Hypertext), 2016.