Agenda 1. Visualizing predictions in R (lab from Tuesday) 2. - - PowerPoint PPT Presentation

agenda
SMART_READER_LITE
LIVE PREVIEW

Agenda 1. Visualizing predictions in R (lab from Tuesday) 2. - - PowerPoint PPT Presentation

Agenda 1. Visualizing predictions in R (lab from Tuesday) 2. Parsimony and Occams razor 3. Illustrating overfitting with test and training data 4. Information criteria as formal measures of (over)fit 5. Review of linear model building


slide-1
SLIDE 1

Agenda

1

  • 1. Visualizing predictions in R


(lab from Tuesday)

  • 2. Parsimony and Occam’s razor
  • 3. Illustrating overfitting with test

and training data

  • 4. Information criteria as formal

measures of (over)fit

  • 5. Review of linear model building
  • 6. Comparing criteria in R
slide-2
SLIDE 2

Occam’ s razor

2

How many buildings?

slide-3
SLIDE 3

Occam’ s razor

3

M1: Four buildings M2: Five buildings

Pr(M|D) Pr(M2|D) = Pr(M) Pr(M2) Pr(D|M) Pr(D|M2)

<latexit sha1_base64="P1EKryKCeEPVWRVBfcvIFgKPcPo=">AD7nichVLbhMxFHUaHiW8WmCHQBYVUitF1UzaJrCIVJWKdtGWQF+RkijyOHdSq/bMyPa0RO7s+QJ2iC2/xE/wDdyZKVKf4kojXZ97HM8PkEihbGe97syVb1z9796Qe1h48eP3k6M/vswMSp5rDPYxnrbsAMSBHBvhVWQjfRwFQg4TA4/pDPD09AGxFHe3aSwECxcSRCwZlFaDjzrR9qxl1fMXuklevobH576J+tL2RXsEaO0Tat3bjhOn0hu4G5fnadm2PIHs7MeYu+t9RqNCk2zZWlVt40lv3lpkf9Ra+ouVKiuoMZ6d2+qOYpwoiyUzpud7iR04pq3gElA+NZAwfszG0M2YgrMwBW/LKNvERnRMNb4RZYW6MUdjiljJipAZu7VXJ3l4I2zr6VArT+CEN+kWDk1CWQKmfuysZa5pZX6SrPep/VLlu04buBE1GSWoh46TBMJbUxzR+OjoQGbuUEG8a1wEtSfsTwD1t83hrFumijMJYAz3CSW2GptPlNQ2A21WDclhiXsPuvVdziS0t5nlR1vxKWiOkaoX6Wmfey8VDFjFRJSf1Bu4TZAngEY3YGcUo5ROJ/Pr4uxsKa+hUmL6hsa4Hiht5OqAO2UT4S0bje0XES6zyKTGaXBG8/Xavef4i7YHfx7WL5EX04yfQYTLE2mYtirVDLfdruZk6conDmFMJZDRP4L2b09uagSld9D8vz62ulVEk0+QleUPmiU9aZJVskg7ZJ5z8qbyovKq8ribV79Uf1Z8ldapyvuc5uVTVX38Bn2pP6Q=</latexit><latexit sha1_base64="P1EKryKCeEPVWRVBfcvIFgKPcPo=">AD7nichVLbhMxFHUaHiW8WmCHQBYVUitF1UzaJrCIVJWKdtGWQF+RkijyOHdSq/bMyPa0RO7s+QJ2iC2/xE/wDdyZKVKf4kojXZ97HM8PkEihbGe97syVb1z9796Qe1h48eP3k6M/vswMSp5rDPYxnrbsAMSBHBvhVWQjfRwFQg4TA4/pDPD09AGxFHe3aSwECxcSRCwZlFaDjzrR9qxl1fMXuklevobH576J+tL2RXsEaO0Tat3bjhOn0hu4G5fnadm2PIHs7MeYu+t9RqNCk2zZWlVt40lv3lpkf9Ra+ouVKiuoMZ6d2+qOYpwoiyUzpud7iR04pq3gElA+NZAwfszG0M2YgrMwBW/LKNvERnRMNb4RZYW6MUdjiljJipAZu7VXJ3l4I2zr6VArT+CEN+kWDk1CWQKmfuysZa5pZX6SrPep/VLlu04buBE1GSWoh46TBMJbUxzR+OjoQGbuUEG8a1wEtSfsTwD1t83hrFumijMJYAz3CSW2GptPlNQ2A21WDclhiXsPuvVdziS0t5nlR1vxKWiOkaoX6Wmfey8VDFjFRJSf1Bu4TZAngEY3YGcUo5ROJ/Pr4uxsKa+hUmL6hsa4Hiht5OqAO2UT4S0bje0XES6zyKTGaXBG8/Xavef4i7YHfx7WL5EX04yfQYTLE2mYtirVDLfdruZk6conDmFMJZDRP4L2b09uagSld9D8vz62ulVEk0+QleUPmiU9aZJVskg7ZJ5z8qbyovKq8ribV79Uf1Z8ldapyvuc5uVTVX38Bn2pP6Q=</latexit><latexit sha1_base64="P1EKryKCeEPVWRVBfcvIFgKPcPo=">AD7nichVLbhMxFHUaHiW8WmCHQBYVUitF1UzaJrCIVJWKdtGWQF+RkijyOHdSq/bMyPa0RO7s+QJ2iC2/xE/wDdyZKVKf4kojXZ97HM8PkEihbGe97syVb1z9796Qe1h48eP3k6M/vswMSp5rDPYxnrbsAMSBHBvhVWQjfRwFQg4TA4/pDPD09AGxFHe3aSwECxcSRCwZlFaDjzrR9qxl1fMXuklevobH576J+tL2RXsEaO0Tat3bjhOn0hu4G5fnadm2PIHs7MeYu+t9RqNCk2zZWlVt40lv3lpkf9Ra+ouVKiuoMZ6d2+qOYpwoiyUzpud7iR04pq3gElA+NZAwfszG0M2YgrMwBW/LKNvERnRMNb4RZYW6MUdjiljJipAZu7VXJ3l4I2zr6VArT+CEN+kWDk1CWQKmfuysZa5pZX6SrPep/VLlu04buBE1GSWoh46TBMJbUxzR+OjoQGbuUEG8a1wEtSfsTwD1t83hrFumijMJYAz3CSW2GptPlNQ2A21WDclhiXsPuvVdziS0t5nlR1vxKWiOkaoX6Wmfey8VDFjFRJSf1Bu4TZAngEY3YGcUo5ROJ/Pr4uxsKa+hUmL6hsa4Hiht5OqAO2UT4S0bje0XES6zyKTGaXBG8/Xavef4i7YHfx7WL5EX04yfQYTLE2mYtirVDLfdruZk6conDmFMJZDRP4L2b09uagSld9D8vz62ulVEk0+QleUPmiU9aZJVskg7ZJ5z8qbyovKq8ribV79Uf1Z8ldapyvuc5uVTVX38Bn2pP6Q=</latexit><latexit sha1_base64="zrtFqBbCFjWzi1cz2t1vlLV5miw=">AD7nichVJdb9MwFE1XPkb52oA3BLKYkDYpqpKua+Gh0jQmtodtFPZVqa0qx71prdlJZDsblZd3fgFviFf+En+C38BNWqR9itFuj732Oc4PkEiuDae97s0V75z979+QeVh48eP3m6sPjsSMepYnDIYhGrTkA1CB7BoeFGQCdRQGUg4Dg4+ZDPj09BaR5HB2aSQF/SUcRDzqhBaLDwrRcqymxPUjNW0rZVtrw78M83V7IrWC3HSItUbtxwnb6S3cDcPL/OzTFkDxaWvKrvrTZrDYJNY21mTe1ul9veMSvekUtObNqDxbn9nrDmKUSIsME1bre4npW6oMZwJQPtWQUHZCR9DFNqISdN8WvywjbxEZkjBW+EWGFOjFHZKrScyQGbuV+d5eCNs69TgUpvCG+SbGychKIFDL7ZWsjs6tr7lrDb7PKpctmvBd3/IoSQ1EbOowTAUxMckfjgy5AmbEBvKFMdLEjam+IcNPm+FYF20URhLgGU4ya3QVJj8piFQkyrQdoePpl3rADruPqMCWrvUsPFOfAaKYaTcIj2tmfepgYjKY/yk7p9uw3iFNAIJXuQU6ZjFM7ny5t8xI12dzBpkbulAE5WunupDNBuC+UjHo3ctoqTWOVRpCK7JHj76Up2/0PcB7OPbxeLj+jDCqpGoIu1zmwUK4la9tNuJ7P8DIUzKxHOKpjAfzEjtzdHNUxp1f9cX1rfmGVx3npvHGWHd9pOuvOtN2Dh3m/Cm9KL0qvS4n5e/lH+WfU+pcabnuXOpyr/+Ao4bT4k=</latexit>

Model likelihood Simpler models rely less on coincidence

Pr(D|M) Pr(D|M2) >

<latexit sha1_base64="YWSdDoXWvwSYrC/3D27s0a8dK1g=">ADl3ichVLbThsxEDWkF5reoH2q+mIVQJphXYDJO1DWkQR8ABpKLdISR5ndlgYXtXthcauf6Ofk1f2/o39S7SyVoizqSpfHM2OfMzIkzrQJw58zs7U7d+/dn3tQf/jo8ZOn8wvPTnSaKwrHNOWp6sVEA2cSjg0zHqZAiJiDqfx+Ycif3oBSrNUHplpBkNBJpIljBLjQ6P5aJAoQu1AEHOmhO0qt7T1ZX8ULbu/Yo1lh9/haDS/GK5E4Wqr0cTea6vtgqnsRatNUMcrYSlLW5gVFp3tDbGYxTmguQhnKidT8KMzO0RBlGObj6INeQEXpOJtD3riQC9NCWvTn82kfGOEmVP9LgMnr9hSVC6mIfWXBV/+ZK4L/zH2uAOqDMSR+eOXNimnMc3D2086ms6vrwXozaL19ZsUTfJmaJnMcgOSVgyTnGOT4mLCeMwUMOn3iFUMd8kpmfET9n4PdSxt+s0SmIZUOczBRWSc1N0mgAxuQJt9ik8tpH0AsOKeHQ3ieGnu2l6Co31Qrl9xb1C0GAEYbL4qT+0u8AvwBMhuANFSZX2wEV+aYtNmNHBnpeEDHYUwPlyv5OL2Nte3jJ5CToqjRLVaEZwt0NwNt/V6L/n8JDMId+dynf9jwsJ2oCurxrZ2WqhMeyH/d7zrJLD+ys8GFX9wr8LTN8u3PS8CpdiQ7WFjc2KymiOfQSvUJLKEItIF2URcdI4q+om/oO/pRe1F7X9u7ValszNXb56jG1Y7+AXpfC8Y</latexit><latexit sha1_base64="YWSdDoXWvwSYrC/3D27s0a8dK1g=">ADl3ichVLbThsxEDWkF5reoH2q+mIVQJphXYDJO1DWkQR8ABpKLdISR5ndlgYXtXthcauf6Ofk1f2/o39S7SyVoizqSpfHM2OfMzIkzrQJw58zs7U7d+/dn3tQf/jo8ZOn8wvPTnSaKwrHNOWp6sVEA2cSjg0zHqZAiJiDqfx+Ycif3oBSrNUHplpBkNBJpIljBLjQ6P5aJAoQu1AEHOmhO0qt7T1ZX8ULbu/Yo1lh9/haDS/GK5E4Wqr0cTea6vtgqnsRatNUMcrYSlLW5gVFp3tDbGYxTmguQhnKidT8KMzO0RBlGObj6INeQEXpOJtD3riQC9NCWvTn82kfGOEmVP9LgMnr9hSVC6mIfWXBV/+ZK4L/zH2uAOqDMSR+eOXNimnMc3D2086ms6vrwXozaL19ZsUTfJmaJnMcgOSVgyTnGOT4mLCeMwUMOn3iFUMd8kpmfET9n4PdSxt+s0SmIZUOczBRWSc1N0mgAxuQJt9ik8tpH0AsOKeHQ3ieGnu2l6Co31Qrl9xb1C0GAEYbL4qT+0u8AvwBMhuANFSZX2wEV+aYtNmNHBnpeEDHYUwPlyv5OL2Nte3jJ5CToqjRLVaEZwt0NwNt/V6L/n8JDMId+dynf9jwsJ2oCurxrZ2WqhMeyH/d7zrJLD+ys8GFX9wr8LTN8u3PS8CpdiQ7WFjc2KymiOfQSvUJLKEItIF2URcdI4q+om/oO/pRe1F7X9u7ValszNXb56jG1Y7+AXpfC8Y</latexit><latexit sha1_base64="YWSdDoXWvwSYrC/3D27s0a8dK1g=">ADl3ichVLbThsxEDWkF5reoH2q+mIVQJphXYDJO1DWkQR8ABpKLdISR5ndlgYXtXthcauf6Ofk1f2/o39S7SyVoizqSpfHM2OfMzIkzrQJw58zs7U7d+/dn3tQf/jo8ZOn8wvPTnSaKwrHNOWp6sVEA2cSjg0zHqZAiJiDqfx+Ycif3oBSrNUHplpBkNBJpIljBLjQ6P5aJAoQu1AEHOmhO0qt7T1ZX8ULbu/Yo1lh9/haDS/GK5E4Wqr0cTea6vtgqnsRatNUMcrYSlLW5gVFp3tDbGYxTmguQhnKidT8KMzO0RBlGObj6INeQEXpOJtD3riQC9NCWvTn82kfGOEmVP9LgMnr9hSVC6mIfWXBV/+ZK4L/zH2uAOqDMSR+eOXNimnMc3D2086ms6vrwXozaL19ZsUTfJmaJnMcgOSVgyTnGOT4mLCeMwUMOn3iFUMd8kpmfET9n4PdSxt+s0SmIZUOczBRWSc1N0mgAxuQJt9ik8tpH0AsOKeHQ3ieGnu2l6Co31Qrl9xb1C0GAEYbL4qT+0u8AvwBMhuANFSZX2wEV+aYtNmNHBnpeEDHYUwPlyv5OL2Nte3jJ5CToqjRLVaEZwt0NwNt/V6L/n8JDMId+dynf9jwsJ2oCurxrZ2WqhMeyH/d7zrJLD+ys8GFX9wr8LTN8u3PS8CpdiQ7WFjc2KymiOfQSvUJLKEItIF2URcdI4q+om/oO/pRe1F7X9u7ValszNXb56jG1Y7+AXpfC8Y</latexit><latexit sha1_base64="3YAtuB5nKdy/ueksyW1v0+kFjns=">ADl3ichVLbhMxEHUbLiXcWnhCvFhUSK20qnbTNoGHQFWqtg9NSektUhJFXmc2tWJ7V7a3JTL+Dr6GV/gG/gbvJkgtUDGSpfHM2OfMzIkzrQJw59z85U7d+/dX3hQfjo8ZOni0vPznSaKwqnNOWp6sREA2cSTg0zHDqZAiJiDufx+EORP78EpVkqT8wkg74gI8kSRonxocFi1EsUobYniLlQwraVW9n50hpEq+6vWG3V4Xc4Giwuh2tRuN6o1bF36pvrjcKpbUQb9RBHa2Fpy2hm7cHS/GFvmNJcgDSUE627UZiZviXKMrBVXu5hozQMRlB17uSCNB9W/bm8GsfGeIkVf5Ig8vo9ReWCK0nIvaVBV/9Z64I/jP3eQpQ7Q0h8cMrb1ZMYp6Ds5/2tp1d3w260HjravepGiSN3LZJYbkHTKMk5NikuJoyHTAE1fOIdQhXzTWJ6QfyUjd9DFXu7TqMklgF1PlNQITk3RacJEJMr0PaAjaZe8wQ6wTElHJotYujFQXoFivrdB+WamzPuUwQNRhAmi5+6fbsP/BI8EYIPoSiZpj1wkV/ZYSNmdHDgJSGDPQUwXu0e5iL2dJseXjI5CtoqzVJVaIZwdwPw9t+V6P6n8BjMsd9dync9D8uJGoEu79pZmSrhsezHVsdZduWBnRU+7Kpegb9lhm93zmpepWvR0cby1vZMiwvoJXqFVlCEGmgL7aM2OkUfUXf0Hf0o/Ki8r6yW9mfls7Pzd48RzescvQL2C0uA=</latexit>

Simpler models are easier to interpret or more compelling A priori

Pr(M) Pr(M2) >

<latexit sha1_base64="W1/6tdpd6zNvuhYJhz4YM3iLwA=">ADk3ichVLbhMxEHUbLiXcWhBPvFhUSK20VLtJm4BQUFUq2oemBHqLlESR15lNrdrele1tiSx/BV/DK3wFf4N3t0htRcVIlsYzY58zMyfONMmDH/Pzdfu3L13f+FB/eGjx0+eLi49O9Zprigc0ZSnqh8TDZxJODLMcOhnCoiIOZzEZx+L/Mk5KM1SeWhmGYwEmUqWMEqMD40X3wTRagdCmJOlbA95Va642jV3Yg0Vh3+gKPx4nK4FoXNdqOFvdPaLYLp7EerbdCHK2FpS1vYlRab7w0vz+cpDQXIA3lROtBFGZmZIkyjHJw9WGuISP0jExh4F1JBOiRLfty+LWPTHCSKn+kwWX06gtLhNYzEfvKgq2+mSuC/8x9qwDqwkfnDlzYpZzHNw9uvOlrPNjWCjFbTfufp1iZ5O7JMZrkBSuGSc6xSXExXTxhCqjhM+8QqphvEtNT4ids/A7q2NtVGiWxDKjzmYIKybkpOk2AmFyBtntsWnmdQ+gHB5Rw6HSJoad76QUo6vcelCvuXHKvEDQYQZgsfhqM7C7wc/BECN6HoqRKe+Aiv7LNpszoYM/LQY7CuBsdbCfi9jT7Xh4yeQ06Kk0S1WhF8LdNcDbf1di8J/CAzAHfncp/+R5WE7UFHR5187KVAmPZT93+86yCw/srPBhV/cK/CszfLtz3PAqXYu+rC9vblVSRAvoJXqFVlCE2mgT7aIeOkIUfUc/0E/0q/ai9r62VduSufnLt8R9es1v0DhMstcA=</latexit><latexit sha1_base64="W1/6tdpd6zNvuhYJhz4YM3iLwA=">ADk3ichVLbhMxEHUbLiXcWhBPvFhUSK20VLtJm4BQUFUq2oemBHqLlESR15lNrdrele1tiSx/BV/DK3wFf4N3t0htRcVIlsYzY58zMyfONMmDH/Pzdfu3L13f+FB/eGjx0+eLi49O9Zprigc0ZSnqh8TDZxJODLMcOhnCoiIOZzEZx+L/Mk5KM1SeWhmGYwEmUqWMEqMD40X3wTRagdCmJOlbA95Va642jV3Yg0Vh3+gKPx4nK4FoXNdqOFvdPaLYLp7EerbdCHK2FpS1vYlRab7w0vz+cpDQXIA3lROtBFGZmZIkyjHJw9WGuISP0jExh4F1JBOiRLfty+LWPTHCSKn+kwWX06gtLhNYzEfvKgq2+mSuC/8x9qwDqwkfnDlzYpZzHNw9uvOlrPNjWCjFbTfufp1iZ5O7JMZrkBSuGSc6xSXExXTxhCqjhM+8QqphvEtNT4ids/A7q2NtVGiWxDKjzmYIKybkpOk2AmFyBtntsWnmdQ+gHB5Rw6HSJoad76QUo6vcelCvuXHKvEDQYQZgsfhqM7C7wc/BECN6HoqRKe+Aiv7LNpszoYM/LQY7CuBsdbCfi9jT7Xh4yeQ06Kk0S1WhF8LdNcDbf1di8J/CAzAHfncp/+R5WE7UFHR5187KVAmPZT93+86yCw/srPBhV/cK/CszfLtz3PAqXYu+rC9vblVSRAvoJXqFVlCE2mgT7aIeOkIUfUc/0E/0q/ai9r62VduSufnLt8R9es1v0DhMstcA=</latexit><latexit sha1_base64="W1/6tdpd6zNvuhYJhz4YM3iLwA=">ADk3ichVLbhMxEHUbLiXcWhBPvFhUSK20VLtJm4BQUFUq2oemBHqLlESR15lNrdrele1tiSx/BV/DK3wFf4N3t0htRcVIlsYzY58zMyfONMmDH/Pzdfu3L13f+FB/eGjx0+eLi49O9Zprigc0ZSnqh8TDZxJODLMcOhnCoiIOZzEZx+L/Mk5KM1SeWhmGYwEmUqWMEqMD40X3wTRagdCmJOlbA95Va642jV3Yg0Vh3+gKPx4nK4FoXNdqOFvdPaLYLp7EerbdCHK2FpS1vYlRab7w0vz+cpDQXIA3lROtBFGZmZIkyjHJw9WGuISP0jExh4F1JBOiRLfty+LWPTHCSKn+kwWX06gtLhNYzEfvKgq2+mSuC/8x9qwDqwkfnDlzYpZzHNw9uvOlrPNjWCjFbTfufp1iZ5O7JMZrkBSuGSc6xSXExXTxhCqjhM+8QqphvEtNT4ids/A7q2NtVGiWxDKjzmYIKybkpOk2AmFyBtntsWnmdQ+gHB5Rw6HSJoad76QUo6vcelCvuXHKvEDQYQZgsfhqM7C7wc/BECN6HoqRKe+Aiv7LNpszoYM/LQY7CuBsdbCfi9jT7Xh4yeQ06Kk0S1WhF8LdNcDbf1di8J/CAzAHfncp/+R5WE7UFHR5187KVAmPZT93+86yCw/srPBhV/cK/CszfLtz3PAqXYu+rC9vblVSRAvoJXqFVlCE2mgT7aIeOkIUfUc/0E/0q/ai9r62VduSufnLt8R9es1v0DhMstcA=</latexit><latexit sha1_base64="tpCG78nxbqVCY3r65DR0hD3wgD8=">ADk3ichVLbhMxEHUbLiXcWhBPvFhUSK20VLtpm4BQUFUq2oemBHqLlESR15lNrNjele1tiSx/BV/DK3wFf4N3E6S2omIkS+OZsc+ZmRNnGkThr8XFit37t67v/Sg+vDR4ydPl1enek0VxROacpT1YmJBs4knBpmOHQyBUTEHM7jycif34BSrNUnphpBn1BRpIljBLjQ4PlN71EWp7gpixErat3FprEK27G5HausMfcDRYXg03onCzUatj79S3NxuFU9uKtuohjbC0lbR3NqDlcWj3jCluQBpKCdad6MwM31LlGUg6v2cg0ZoRMygq53JRGg+7bsy+HXPjLESar8kQaX0asvLBFaT0XsKwu2+mauCP4z920GUO0NIfGDK29WTGOeg7Nf93ed3dwOtutB452rXqdokrd9y2SWG5B0xjDJOTYpLqaLh0wBNXzqHUIV801iOiZ+wsbvoIq9XaVREsuAOp8pqJCcm6LTBIjJFWh7yEYzr3kCneCYEg7NFjF0fJhegqJ+70G54uac+wxBgxGEyeKnbt8eAL8AT4TgIyhKZmkPXOTX9tiIGR0cejnIYF8BTNa7R7mIPd2mh5dMjoK2SrNUFXoh3F0DvP13Jbr/KTwGc+x3l/JPnoflRI1Al3ftrEyV8Fj2c6vjLv0wM4KH3ZVr8C/MsO3O2c1r9KN6MvW6s7uXItL6CV6hdZQhBpoBx2gNjpFH1HP9BP9KvyovK+slvZm5UuLszfPEfXrNL6A3N8LRA=</latexit>
slide-4
SLIDE 4

Overfitting

4

  • −1

1 2 4 6 8 10 12

Age (standardized) Log income

  • −1

1 2 4 6 8 10 12

Age (standardized) Log income

  • −1

1 2 4 6 8 10 12

Age (standardized) Log income

  • −1

1 2 4 6 8 10 12

Age (standardized) Log income

Linear Cubic Quadratic Order-10 polynomial

slide-5
SLIDE 5

Test and training data

5

D=22.66

Fit model on half of the data. Training data

−1 1 2 7 8 9 10 11

Age (standardized) Log income

D=4885.21

Assess fit on the other half of the data. Test data

−1 1 2 7 8 9 10 11

Age (standardized) Log income

slide-6
SLIDE 6

Akaike information criterion

6

Interpretation 2 Model the average difference in deviance between training and test data. Penalize deviance score for each added parameter by some ‘reasonable’ value. Interpretation 1

Sample size ≫ number of parameters (k)
 Priors have minimal influence (flat or lots of data)
 Posterior is approximately (multivariate) normal

D = −2 log(Pr(data|θ)Pr(θ)) AIC = −2 log(Pr(data|θ)Pr(θ)) + 2k

= D + 2k

<latexit sha1_base64="Ff7rA4zOCWug/tEr/Ng975e+pus=">AEHicpVLdbtMwFE5Xfkb46+CSG4uJaROhStKt3S4qyjaxIW2jMLZVaqrJcU9Tq86PbGejMnkJHoMn4A5xyxvwNjJmFZ+xAVHjvT5fMfnfI4/P2FUSNv+XpmrXrt+4+b8LfP2nbv37tcWHhyLOUEjkjMYt7zsQBGIziSVDLoJRxw6DM48SdbOX9yBlzQOHonpwkMQhxEdEQJljp1WvkhViOeai2s6X2M9djcbD8M9Xl2SUeYomzD54cg8QrMwVlasXzHxdMC9ebf1nP/QUuRPdcamNtgtsntYW7brTajnNBrLr7nrLtR0NGutuw3GRU7eLWOwgo4ju6cLcgTeMSRpCJAnDQvQdO5EDhbmkhEFmeqmABJMJDqCvYRDEANV/NQMPdGZIRrFXH+REX26gmFQyGmoa8rc/niVy5P/pF7Xw4wvSGM9KsVOxVOfZCpt7ubGaqsWatNa3WRmbOSpSj9YGiUZJKiEipcJQyJGOUPy0aUg5EsqkGmHCqL4nIGHNMpDaAiXRclVEIS4Bkmsml4JTJ/KYjwDLlINQeDUrUfgc965BgBu19LMl4Lz4HTrTprMJf7Qvt5QBMsQ0yjv1B2oX2BloIRgdQF5S0npwzi9v04BKYe1pL0bWDgeYrPQP0tDXct6fESjwOryOIl5blbMspmBf+/Ow/4/Cg9BHuq3i9lLrUMxzAMQxV5kKop5qGep1/u9TNFzPThToU5npQM38mhe+u13cOzWnUZ9c3qYmeztKIxbzwyHhvLhmO0jI6xa3SNI4NUapVm5XmlU/1Y/Vz9Uv1als5VLs48NGai+u0Hhc1bHg=</latexit>
slide-7
SLIDE 7

Akaike information criterion

7

Measure of model fit Penalty for model complexity

D 2 log Pr data θ Pr θ AIC = −2 log(Pr(data|θ)Pr(θ)) + 2k D 2k

<latexit sha1_base64="Ff7rA4zOCWug/tEr/Ng975e+pus=">AEHicpVLdbtMwFE5Xfkb46+CSG4uJaROhStKt3S4qyjaxIW2jMLZVaqrJcU9Tq86PbGejMnkJHoMn4A5xyxvwNjJmFZ+xAVHjvT5fMfnfI4/P2FUSNv+XpmrXrt+4+b8LfP2nbv37tcWHhyLOUEjkjMYt7zsQBGIziSVDLoJRxw6DM48SdbOX9yBlzQOHonpwkMQhxEdEQJljp1WvkhViOeai2s6X2M9djcbD8M9Xl2SUeYomzD54cg8QrMwVlasXzHxdMC9ebf1nP/QUuRPdcamNtgtsntYW7brTajnNBrLr7nrLtR0NGutuw3GRU7eLWOwgo4ju6cLcgTeMSRpCJAnDQvQdO5EDhbmkhEFmeqmABJMJDqCvYRDEANV/NQMPdGZIRrFXH+REX26gmFQyGmoa8rc/niVy5P/pF7Xw4wvSGM9KsVOxVOfZCpt7ubGaqsWatNa3WRmbOSpSj9YGiUZJKiEipcJQyJGOUPy0aUg5EsqkGmHCqL4nIGHNMpDaAiXRclVEIS4Bkmsml4JTJ/KYjwDLlINQeDUrUfgc965BgBu19LMl4Lz4HTrTprMJf7Qvt5QBMsQ0yjv1B2oX2BloIRgdQF5S0npwzi9v04BKYe1pL0bWDgeYrPQP0tDXct6fESjwOryOIl5blbMspmBf+/Ow/4/Cg9BHuq3i9lLrUMxzAMQxV5kKop5qGep1/u9TNFzPThToU5npQM38mhe+u13cOzWnUZ9c3qYmeztKIxbzwyHhvLhmO0jI6xa3SNI4NUapVm5XmlU/1Y/Vz9Uv1als5VLs48NGai+u0Hhc1bHg=</latexit>
slide-8
SLIDE 8

Information criteria

8

Akaike Information Criterion (AIC) Deviance at MAP estimate Number of parameters “Bayesian” Information Criterion (BIC) Deviance at MAP estimate #parameters times log(#observations) Deviance Information Criterion (DIC) Deviance averaged across posterior “Effective” #parameters
 (posterior) Widely Applicable Information Criterion (WAIC) Deviance averaged across observations “Effective” #parameters
 (posterior and obs.)

Criterion Fit Penalty

slide-9
SLIDE 9

Using information criteria

9

Pick the model with the lowest value. Strategy 1

WAIC(M1) = 209.0; WAIC(M2) = 208.1
 M2 is the winner

Report several models along with values. Strategy 2

Multi-model table showing estimates for different combinations of coefficients, along with WAIC

Average predictions across models. Strategy 3

Simultaneous posterior predictions from all models, weighted by WAIC

slide-10
SLIDE 10

Building linear models

10

Independent variables chosen address theoretical concerns Theoretical relevance

Test theoretical predictions, account for theorized connections

Independent variables chosen to make robust causal claims Causal inference

Worry about including confounders, omitting colliders, and thinking through role of moderating and mediating variables

Independent variables chosen to maximize predictive power Predictive accuracy

Accuracy of out-of-sample predictions;
 Interpretation of models with many moving parts

Considerations when choosing covariates

Information criteria are for this