Unsupervised Learning, K Means March 12, 2020 Data Science CSCI - - PowerPoint PPT Presentation

unsupervised learning k means
SMART_READER_LITE
LIVE PREVIEW

Unsupervised Learning, K Means March 12, 2020 Data Science CSCI - - PowerPoint PPT Presentation

Unsupervised Learning, K Means March 12, 2020 Data Science CSCI 1951A Brown University Instructor: Ellie Pavlick HTAs: Josh Levin, Diane Mutako, Sol Zitter 1 Announcements Here we go! Get cozy..PJs, coffee-in-hand, ready to talk ML :)


slide-1
SLIDE 1

Unsupervised Learning, K Means

March 12, 2020 Data Science CSCI 1951A Brown University Instructor: Ellie Pavlick HTAs: Josh Levin, Diane Mutako, Sol Zitter

1

slide-2
SLIDE 2

Announcements

  • Here we go! Get cozy..PJs, coffee-in-hand, ready

to talk ML :)

  • Use the “raise hand” feature (under “participants”)
  • I’ll scroll through periodically and see if here are

any questions; if I call on you, unmute yourself

2

slide-3
SLIDE 3

How’s everyone feeling? <3 (a) Super! (b) Kinda freaked out but healthy (c) A little sick (d) Very sick, very scared

3

slide-4
SLIDE 4

Today

  • Gradient Descent
  • Supervised vs. Unsupervised Learning
  • K-Means and EM

4

slide-5
SLIDE 5

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

5

slide-6
SLIDE 6

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

6

slide-7
SLIDE 7

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

7

slide-8
SLIDE 8

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize Cov(X, Y ) V ar(X)

<latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit>

m =

= ¯ Y − m ¯ X

<latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit>

b =

8

slide-9
SLIDE 9

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

9

slide-10
SLIDE 10

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

∂Q ∂m =

n

X

i=1

−2Xi(Yi − b − mXi) = 0

<latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit>

10

slide-11
SLIDE 11

Training with Gradient Descent

=

n

X

i=1

(Yi − ˆ Y )2

<latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit><latexit sha1_base64="r/S0kziZA4N57u3eiAn1RfFZ4w=">ACDHicbVDLSsNAFJ3UV62vqks3g0WoC0tSBN0Uim5ctmBfNGmYTCft0MkzEyEvIBbvwVNy4UcesHuPNvnLZaOuBgcM53LnHi9iVCrT/DZya+sbm1v57cLO7t7+QfHwqC3DWGDSwiELRdDkjDKSUtRxUg3EgQFHiMdb3I78zsPREga8ns1jYgToBGnPsVIacktlpqwBm0ZB25Ca1Y6SHha7rkUXkB7jFTS8HVZ0yK+YcJVYGSmBDA23+GUPQxwHhCvMkJR9y4yUkyChKGYkLdixJBHCEzQifU05Coh0kvkxKTzTyhD6odCPKzhXf08kKJByGng6GSA1lsveTPzP68fKv3YSyqNYEY4Xi/yYQRXCWTNwSAXBik01QVhQ/VeIx0grHR/BV2CtXzyKmlXK5ZsZqXpfpNVkcenIBTUAYWuAJ1cAcaoAUweATP4BW8GU/Gi/FufCyiOSObOQZ/YHz+ADvfmdM=</latexit>

minimize

∂Q ∂m =

n

X

i=1

−2Xi(Yi − b − mXi) = 0

<latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit>

11

slide-12
SLIDE 12

Training with Gradient Descent

https://independentseminarblog.com/2018/01/12/moving-below-the-surface-3-gradient-descent-william/ 12

slide-13
SLIDE 13

Training with Gradient Descent

https://independentseminarblog.com/2018/01/12/moving-below-the-surface-3-gradient-descent-william/

∂Q ∂b =

n

X

i=1

−2(Yi − mXi − b) = 0

<latexit sha1_base64="9cIJCpfvS+XsZ9OkPHz9MbmKXfE=">ACLnicbVBdS8MwFE3n15xfVR9CQ5hPjaIeiLIrg4wZuTtZa0ix1YUlaklQYpb/IF/+KPgq4qs/w3QO1OmBkM593LvPWHCqNKO82yVZmbn5hfKi5Wl5ZXVNXt9o6PiVGLSxjGLZTdEijAqSFtTzUg3kQTxkJHLcHha+Je3RCoaiws9SojP0Y2gEcVIGymwz7xIpx5CZKaIgZb+TcPc3gEPZXyIKNHbn6diRzuNWpXAYV7kHfHX7hrapzArjp1Zwz4l7gTUgUTNAP70evHOVEaMyQUj3XSbSfFYMxI3nFSxVJEB6iG9IzVCBOlJ+Nz83hjlH6MIqleULDsfqzI0NcqRE36+9wpAdq2ivE/7xeqNDP6MiSTUR+GtQlDKoY1hkB/tUEqzZyBCEJTW7QjxAJj9tEq6YENzpk/+STqPuOnW3tV89PpnEUQZbYBvUgAsOwDE4B03QBhjcgQfwAl6te+vJerPev0pL1qRnE/yC9fEJufumUQ=</latexit><latexit sha1_base64="9cIJCpfvS+XsZ9OkPHz9MbmKXfE=">ACLnicbVBdS8MwFE3n15xfVR9CQ5hPjaIeiLIrg4wZuTtZa0ix1YUlaklQYpb/IF/+KPgq4qs/w3QO1OmBkM593LvPWHCqNKO82yVZmbn5hfKi5Wl5ZXVNXt9o6PiVGLSxjGLZTdEijAqSFtTzUg3kQTxkJHLcHha+Je3RCoaiws9SojP0Y2gEcVIGymwz7xIpx5CZKaIgZb+TcPc3gEPZXyIKNHbn6diRzuNWpXAYV7kHfHX7hrapzArjp1Zwz4l7gTUgUTNAP70evHOVEaMyQUj3XSbSfFYMxI3nFSxVJEB6iG9IzVCBOlJ+Nz83hjlH6MIqleULDsfqzI0NcqRE36+9wpAdq2ivE/7xeqNDP6MiSTUR+GtQlDKoY1hkB/tUEqzZyBCEJTW7QjxAJj9tEq6YENzpk/+STqPuOnW3tV89PpnEUQZbYBvUgAsOwDE4B03QBhjcgQfwAl6te+vJerPev0pL1qRnE/yC9fEJufumUQ=</latexit><latexit sha1_base64="9cIJCpfvS+XsZ9OkPHz9MbmKXfE=">ACLnicbVBdS8MwFE3n15xfVR9CQ5hPjaIeiLIrg4wZuTtZa0ix1YUlaklQYpb/IF/+KPgq4qs/w3QO1OmBkM593LvPWHCqNKO82yVZmbn5hfKi5Wl5ZXVNXt9o6PiVGLSxjGLZTdEijAqSFtTzUg3kQTxkJHLcHha+Je3RCoaiws9SojP0Y2gEcVIGymwz7xIpx5CZKaIgZb+TcPc3gEPZXyIKNHbn6diRzuNWpXAYV7kHfHX7hrapzArjp1Zwz4l7gTUgUTNAP70evHOVEaMyQUj3XSbSfFYMxI3nFSxVJEB6iG9IzVCBOlJ+Nz83hjlH6MIqleULDsfqzI0NcqRE36+9wpAdq2ivE/7xeqNDP6MiSTUR+GtQlDKoY1hkB/tUEqzZyBCEJTW7QjxAJj9tEq6YENzpk/+STqPuOnW3tV89PpnEUQZbYBvUgAsOwDE4B03QBhjcgQfwAl6te+vJerPev0pL1qRnE/yC9fEJufumUQ=</latexit><latexit sha1_base64="9cIJCpfvS+XsZ9OkPHz9MbmKXfE=">ACLnicbVBdS8MwFE3n15xfVR9CQ5hPjaIeiLIrg4wZuTtZa0ix1YUlaklQYpb/IF/+KPgq4qs/w3QO1OmBkM593LvPWHCqNKO82yVZmbn5hfKi5Wl5ZXVNXt9o6PiVGLSxjGLZTdEijAqSFtTzUg3kQTxkJHLcHha+Je3RCoaiws9SojP0Y2gEcVIGymwz7xIpx5CZKaIgZb+TcPc3gEPZXyIKNHbn6diRzuNWpXAYV7kHfHX7hrapzArjp1Zwz4l7gTUgUTNAP70evHOVEaMyQUj3XSbSfFYMxI3nFSxVJEB6iG9IzVCBOlJ+Nz83hjlH6MIqleULDsfqzI0NcqRE36+9wpAdq2ivE/7xeqNDP6MiSTUR+GtQlDKoY1hkB/tUEqzZyBCEJTW7QjxAJj9tEq6YENzpk/+STqPuOnW3tV89PpnEUQZbYBvUgAsOwDE4B03QBhjcgQfwAl6te+vJerPev0pL1qRnE/yC9fEJufumUQ=</latexit>

Q =

n

X

i=1

(Yi − (mXi + b))2

<latexit sha1_base64="q3o/vVOTz7GUx+RiBjGcRGq1f0o=">ACD3icbVDLSgMxFM3UV62vUZdugkVpEctMEXRTKLpx2YJ9SB9DJk3b0CQzJBmhDP0DN/6KGxeKuHXrzr8xbWehrQcu93DOvST3+CGjSjvOt5VaWV1b30hvZra2d3b37P2DugoiUkNByQTR8pwqgNU01I81QEsR9Rhr+6GbqNx6IVDQd3ockg5HA0H7FCNtJM8+rcISbKuIezEtuZNuLCa5e4/Cc5jTdPoJ/Pd4uenXUKzgxwmbgJyYIEFc/+avcCHEiNGZIqZbrhLoTI6kpZmSaUeKhAiP0IC0DBWIE9WJZ/dM4IlRerAfSFNCw5n6eyNGXKkx980kR3qoFr2p+J/XinT/qhNTEUaCDx/qB8xqAM4DQf2qCRYs7EhCEtq/grxEmEtYkwY0JwF09eJvViwXUKbvUiW75O4kiDI3AMcsAFl6AMbkEF1AGj+AZvI368l6sd6tj/loykp2DsEfWJ8/aLmZsg=</latexit><latexit sha1_base64="q3o/vVOTz7GUx+RiBjGcRGq1f0o=">ACD3icbVDLSgMxFM3UV62vUZdugkVpEctMEXRTKLpx2YJ9SB9DJk3b0CQzJBmhDP0DN/6KGxeKuHXrzr8xbWehrQcu93DOvST3+CGjSjvOt5VaWV1b30hvZra2d3b37P2DugoiUkNByQTR8pwqgNU01I81QEsR9Rhr+6GbqNx6IVDQd3ockg5HA0H7FCNtJM8+rcISbKuIezEtuZNuLCa5e4/Cc5jTdPoJ/Pd4uenXUKzgxwmbgJyYIEFc/+avcCHEiNGZIqZbrhLoTI6kpZmSaUeKhAiP0IC0DBWIE9WJZ/dM4IlRerAfSFNCw5n6eyNGXKkx980kR3qoFr2p+J/XinT/qhNTEUaCDx/qB8xqAM4DQf2qCRYs7EhCEtq/grxEmEtYkwY0JwF09eJvViwXUKbvUiW75O4kiDI3AMcsAFl6AMbkEF1AGj+AZvI368l6sd6tj/loykp2DsEfWJ8/aLmZsg=</latexit><latexit sha1_base64="q3o/vVOTz7GUx+RiBjGcRGq1f0o=">ACD3icbVDLSgMxFM3UV62vUZdugkVpEctMEXRTKLpx2YJ9SB9DJk3b0CQzJBmhDP0DN/6KGxeKuHXrzr8xbWehrQcu93DOvST3+CGjSjvOt5VaWV1b30hvZra2d3b37P2DugoiUkNByQTR8pwqgNU01I81QEsR9Rhr+6GbqNx6IVDQd3ockg5HA0H7FCNtJM8+rcISbKuIezEtuZNuLCa5e4/Cc5jTdPoJ/Pd4uenXUKzgxwmbgJyYIEFc/+avcCHEiNGZIqZbrhLoTI6kpZmSaUeKhAiP0IC0DBWIE9WJZ/dM4IlRerAfSFNCw5n6eyNGXKkx980kR3qoFr2p+J/XinT/qhNTEUaCDx/qB8xqAM4DQf2qCRYs7EhCEtq/grxEmEtYkwY0JwF09eJvViwXUKbvUiW75O4kiDI3AMcsAFl6AMbkEF1AGj+AZvI368l6sd6tj/loykp2DsEfWJ8/aLmZsg=</latexit><latexit sha1_base64="q3o/vVOTz7GUx+RiBjGcRGq1f0o=">ACD3icbVDLSgMxFM3UV62vUZdugkVpEctMEXRTKLpx2YJ9SB9DJk3b0CQzJBmhDP0DN/6KGxeKuHXrzr8xbWehrQcu93DOvST3+CGjSjvOt5VaWV1b30hvZra2d3b37P2DugoiUkNByQTR8pwqgNU01I81QEsR9Rhr+6GbqNx6IVDQd3ockg5HA0H7FCNtJM8+rcISbKuIezEtuZNuLCa5e4/Cc5jTdPoJ/Pd4uenXUKzgxwmbgJyYIEFc/+avcCHEiNGZIqZbrhLoTI6kpZmSaUeKhAiP0IC0DBWIE9WJZ/dM4IlRerAfSFNCw5n6eyNGXKkx980kR3qoFr2p+J/XinT/qhNTEUaCDx/qB8xqAM4DQf2qCRYs7EhCEtq/grxEmEtYkwY0JwF09eJvViwXUKbvUiW75O4kiDI3AMcsAFl6AMbkEF1AGj+AZvI368l6sd6tj/loykp2DsEfWJ8/aLmZsg=</latexit>

∂Q ∂m =

n

X

i=1

−2Xi(Yi − b − mXi) = 0

<latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit><latexit sha1_base64="XMJx4vTyfUZ86d4j9mLjvojhdw=">ACMXicbVDLSgMxFM34rPVdekmWIS6aJkpgm4KRTdtmAf0qlDJs20oUlmSDJCGeaX3Pgn4qYLRdz6E6YPUFsPXDg515u7vEjRpW27Ym1tr6xubWd2cnu7u0fHOaOjlsqjCUmTRyUHZ8pAijgjQ1Yx0IkQ9xlp+6Pbqd9+JFLRUNzpcUR6HA0EDShG2kheruYGEuHEjZDUFDHYSH84T2EFuirmXkIrTvqQiBQWyx2PFu49CovQN8XN8K02V4ub5fsGeAqcRYkDxaoe7kXtx/imBOhMUNKdR070r1kuhszkmbdWJEI4REakK6hAnGiesns4hSeG6UPg1CaEhrO1N8TCeJKjblvOjnSQ7XsTcX/vG6sg+teQkUayLwfFEQM6hDOI0P9qkWLOxIQhLav4K8RCZCLUJOWtCcJZPXiWtcsmxS07jMl+9WcSRAafgDBSA65AFdRAHTQBk/gFbyBd+vZmlgf1ue8dc1azJyAP7C+vgE9U6ea</latexit>

Helpful equations for following along in the jupyter notebook = Cov(X, Y ) V ar(X)

<latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit><latexit sha1_base64="WPNm1YuVK0X591E2DeKZG6XKYA=">ACB3icbVBNS8NAEN3Urxq/oh4FWSxC1ISEfQiFHvxWMG2kTaUzXbTLt1swu6mUEJuXvwrXjwo4tW/4M1/47bNQVsfDzem2Fmnh8zKpVtfxuFldW19Y3iprm1vbO7Z+0ftGSUCEyaOGKRcH0kCaOcNBVjLixICj0GWn7o/rUb4+JkDTi92oSEy9EA04DipHSUs869uE17AYC4dSsR+OyewYfKlqtpAou5WsZ5Xsqj0DXCZOTkogR6NnfX7EU5CwhVmSMqOY8fKS5FQFDOSmd1EkhjhERqQjqYchUR6eyPDJ5qpQ+DSOjiCs7U3xMpCqWchL7uDJEaykVvKv7ndRIVXHkp5XGiCMfzRUHCoIrgNBTYp4JgxSaICyovhXiIdKhKB2dqUNwFl9eJq3zqmNXnbuLUu0mj6MIjsAJKAMHXIauAUN0AQYPIJn8ArejCfjxXg3PuatBSOfOQR/YHz+ADg1lvo=</latexit>

m =

<latexit sha1_base64="mxdaQB8GmfgkBu9RKA5ZbS6fR8=">ACkHicdVHdSsMwGE3r/ybeulNcAjbxUY7Bb1wOPVGvJrgdLJuJc1SDSZpSVJhlD6P7+Odb2O6daCbfhA4Od/5TpKTIGZUacf5suyl5ZXVtfWN0ubW9s5ueW/UWJxKSLIxbJXoAUYVSQrqakV4sCeIBI0/B203ef3onUtFIPOhxTAYcvQgaUoy0ofzyB4ct6IUS4dRTCfdT2nKzYSoyWH32ac+nsA69AMn0OTObWjavMuSwOdP0ck32j2E1NzOetToUhbowXjCtTlxr9Zlu2Mz8csVpOJOCi8AtQAU1fHLn94owgknQmOGlOq7TqwHKZKaYkaykpcoEiP8hl5I30CBOFGDdBJoBo8NM4JhJM0SGk7YnxMp4kqNeWCUHOlXNd/Lyb96/USH54OUijRODpQWHCoI5g/jtwRCXBmo0NQFhSc1eIX5EJU5s/LJkQ3PknL4LHZsN1Gu79aV9XcSxDg7BEagCF5yBNrgFHdAF2Nq2TqwLq2Xv2+f2pX01ldpWMXMAfpV9w3+Zca5</latexit><latexit sha1_base64="mxdaQB8GmfgkBu9RKA5ZbS6fR8=">ACkHicdVHdSsMwGE3r/ybeulNcAjbxUY7Bb1wOPVGvJrgdLJuJc1SDSZpSVJhlD6P7+Odb2O6daCbfhA4Od/5TpKTIGZUacf5suyl5ZXVtfWN0ubW9s5ueW/UWJxKSLIxbJXoAUYVSQrqakV4sCeIBI0/B203ef3onUtFIPOhxTAYcvQgaUoy0ofzyB4ct6IUS4dRTCfdT2nKzYSoyWH32ac+nsA69AMn0OTObWjavMuSwOdP0ck32j2E1NzOetToUhbowXjCtTlxr9Zlu2Mz8csVpOJOCi8AtQAU1fHLn94owgknQmOGlOq7TqwHKZKaYkaykpcoEiP8hl5I30CBOFGDdBJoBo8NM4JhJM0SGk7YnxMp4kqNeWCUHOlXNd/Lyb96/USH54OUijRODpQWHCoI5g/jtwRCXBmo0NQFhSc1eIX5EJU5s/LJkQ3PknL4LHZsN1Gu79aV9XcSxDg7BEagCF5yBNrgFHdAF2Nq2TqwLq2Xv2+f2pX01ldpWMXMAfpV9w3+Zca5</latexit><latexit sha1_base64="mxdaQB8GmfgkBu9RKA5ZbS6fR8=">ACkHicdVHdSsMwGE3r/ybeulNcAjbxUY7Bb1wOPVGvJrgdLJuJc1SDSZpSVJhlD6P7+Odb2O6daCbfhA4Od/5TpKTIGZUacf5suyl5ZXVtfWN0ubW9s5ueW/UWJxKSLIxbJXoAUYVSQrqakV4sCeIBI0/B203ef3onUtFIPOhxTAYcvQgaUoy0ofzyB4ct6IUS4dRTCfdT2nKzYSoyWH32ac+nsA69AMn0OTObWjavMuSwOdP0ck32j2E1NzOetToUhbowXjCtTlxr9Zlu2Mz8csVpOJOCi8AtQAU1fHLn94owgknQmOGlOq7TqwHKZKaYkaykpcoEiP8hl5I30CBOFGDdBJoBo8NM4JhJM0SGk7YnxMp4kqNeWCUHOlXNd/Lyb96/USH54OUijRODpQWHCoI5g/jtwRCXBmo0NQFhSc1eIX5EJU5s/LJkQ3PknL4LHZsN1Gu79aV9XcSxDg7BEagCF5yBNrgFHdAF2Nq2TqwLq2Xv2+f2pX01ldpWMXMAfpV9w3+Zca5</latexit><latexit sha1_base64="mxdaQB8GmfgkBu9RKA5ZbS6fR8=">ACkHicdVHdSsMwGE3r/ybeulNcAjbxUY7Bb1wOPVGvJrgdLJuJc1SDSZpSVJhlD6P7+Odb2O6daCbfhA4Od/5TpKTIGZUacf5suyl5ZXVtfWN0ubW9s5ueW/UWJxKSLIxbJXoAUYVSQrqakV4sCeIBI0/B203ef3onUtFIPOhxTAYcvQgaUoy0ofzyB4ct6IUS4dRTCfdT2nKzYSoyWH32ac+nsA69AMn0OTObWjavMuSwOdP0ck32j2E1NzOetToUhbowXjCtTlxr9Zlu2Mz8csVpOJOCi8AtQAU1fHLn94owgknQmOGlOq7TqwHKZKaYkaykpcoEiP8hl5I30CBOFGDdBJoBo8NM4JhJM0SGk7YnxMp4kqNeWCUHOlXNd/Lyb96/USH54OUijRODpQWHCoI5g/jtwRCXBmo0NQFhSc1eIX5EJU5s/LJkQ3PknL4LHZsN1Gu79aV9XcSxDg7BEagCF5yBNrgFHdAF2Nq2TqwLq2Xv2+f2pX01ldpWMXMAfpV9w3+Zca5</latexit>

b = ¯ Y − m ¯ X

<latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit><latexit sha1_base64="hwHBCHxlXmNAb1i+1nKjhGPDZ7s=">ACAHicbZDLSgMxFIbP1Fut1EXLtwEi+DGMiOCboSiG5cV7EXaoWTSTBuaZIYkI5RhNr6KGxeKuPUx3Pk2peFtv4Q+PjPOZycP0w408bzvp3C0vLK6lpxvbSxubW94+7uNXScKkLrJOaxaoVYU84krRtmOG0limIRctoMhzfjevORKs1ieW9GCQ0E7ksWMYKNtbruQYiuUCfEKnvI0SkSU27lXbfsVbyJ0CL4MyjDTLWu+9XpxSQVBrCsdZt30tMkGFlGOE0L3VSTRNMhrhP2xYlFlQH2eSAHB1bp4eiWNknDZq4vycyLQeidB2CmwGer42Nv+rtVMTXQYZk0lqCTRVHKkYnROA3UY4oSw0cWMFHM/hWRAVaYGJtZyYbgz5+8CI2ziu9V/LvzcvV6FkcRDuEITsCHC6jCLdSgDgRyeIZXeHOenBfn3fmYthac2cw+/JHz+QOthJUt</latexit>

13

slide-14
SLIDE 14

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

14

slide-15
SLIDE 15

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

15

slide-16
SLIDE 16

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

16

slide-17
SLIDE 17

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

17

slide-18
SLIDE 18

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

18

slide-19
SLIDE 19

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

19

slide-20
SLIDE 20

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

20

slide-21
SLIDE 21

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

Today

21

slide-22
SLIDE 22

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

Today

22

slide-23
SLIDE 23

Supervised vs. Unsupervised Learning

  • Supervised: Explicit data labels
  • Sentiment analysis—review text -> star ratings
  • Image tagging—image -> caption
  • Unsupervised: No explicit labels
  • Clustering—find groups similar customers
  • Dimensionality Reduction—find features that

differentiate individuals

Tuesday

23

slide-24
SLIDE 24

Oh you thought it was that simple? How cute…

  • Semi Supervised—Combining large amounts of

unlabelled with smaller amounts of labelled (pretraining)

  • Weakly/Distantly Supervised—using noisy labels or

partial labels (bootstrapping, automatically-labeled data)

  • Reinforcement Learning—label on the result of a

sequence of actions, but not on each action (games, robotics)

  • “Found” Data… (?)

24

slide-25
SLIDE 25

Oh you thought it was that simple? How cute…

  • Semi Supervised—Combining large amounts of

unlabelled with smaller amounts of labelled (pretraining)

  • Weakly/Distantly Supervised—using noisy labels or

partial labels (bootstrapping, automatically-labeled data)

  • Reinforcement Learning—label on the result of a

sequence of actions, but not on each action (games, robotics)

  • “Found” Data… (?)

25

slide-26
SLIDE 26

Oh you thought it was that simple? How cute…

  • Semi Supervised—Combining large amounts of

unlabelled with smaller amounts of labelled (pretraining)

  • Weakly/Distantly Supervised—using noisy labels or

partial labels (bootstrapping, automatically-labeled data)

  • Reinforcement Learning—label on the result of a

sequence of actions, but not on each action (games, robotics)

  • “Found” Data… (?)

26

slide-27
SLIDE 27

Oh you thought it was that simple? How cute…

  • Semi Supervised—Combining large amounts of

unlabelled with smaller amounts of labelled (pretraining)

  • Weakly/Distantly Supervised—using noisy labels or

partial labels (bootstrapping, automatically-labeled data)

  • Reinforcement Learning—label on the result of a

sequence of actions, but not on each action (games, robotics)

  • “Found” Data… (?)

27

slide-28
SLIDE 28

Oh you thought it was that simple? How cute…

  • Semi Supervised—Combining large amounts of

unlabelled with smaller amounts of labelled (pretraining)

  • Weakly/Distantly Supervised—using noisy labels or

partial labels (bootstrapping, automatically-labeled data)

  • Reinforcement Learning—label on the result of a

sequence of actions, but not on each action (games, robotics)

  • “Found” Data… (?)

28

slide-29
SLIDE 29

Unsupervised Learning

  • “Finding structure in data”
  • In data science, this is typically for “exploratory

analysis”. “What the $@%! is this data even?! Enlighten me.”

  • Or for preprocessing/featurizing—e.g. so you can use

article “topics” to predict clicks.

  • In ML, right now, used extensively for “pretraining” (e.g.

autoencoding, dimensionality reduction, language modeling*)

29

slide-30
SLIDE 30

Unsupervised Learning

  • “Finding structure in data” (vs. predicting labels)
  • In data science, this is typically for “exploratory

analysis”. “What the $@%! is this data even?! Enlighten me.”

  • Or for preprocessing/featurizing—e.g. so you can use

article “topics” to predict clicks.

  • In ML, right now, used extensively for “pretraining” (e.g.

autoencoding, dimensionality reduction, language modeling*)

30

slide-31
SLIDE 31

Unsupervised Learning

  • “Finding structure in data” (vs. predicting labels)
  • In data science, this is typically for “exploratory

analysis”. “What the $@%! is this data even?! Enlighten me.”

  • Or for preprocessing/featurizing—e.g. so you can use

article “topics” to predict clicks.

  • In ML, right now, used extensively for “pretraining” (e.g.

autoencoding, dimensionality reduction, language modeling*)

31

slide-32
SLIDE 32

Unsupervised Learning

  • “Finding structure in data” (vs. predicting labels)
  • In data science, this is typically for “exploratory

analysis”. “What the $@%! is this data even?! Enlighten me.”

  • Or for preprocessing/featurizing—e.g. so you can use

article “topics” to predict clicks.

  • In ML, right now, used extensively for “pretraining” (e.g.

autoencoding, dimensionality reduction, language modeling*)

32

slide-33
SLIDE 33

Unsupervised Learning

  • “Finding structure in data” (vs. predicting labels)
  • In data science, this is typically for “exploratory

analysis”. “What the $@%! is this data even?! Enlighten me.”

  • Or for preprocessing/featurizing—e.g. so you can use

article “topics” to predict clicks.

  • In ML, right now, used extensively for “pretraining” (e.g.

autoencoding, dimensionality reduction, language modeling*)

33

slide-34
SLIDE 34

Clicker Question!

34

slide-35
SLIDE 35

Clustering

Discussion Question! What is it good for…?

(…because those free-form answers were enlightening last time…)

35

slide-36
SLIDE 36

Clustering

  • Find groups of customers with similar tastes
  • Find topics within a set of news articles
  • Find genres within a library of music
  • Extrapolating—make predictions about your new

business based on behavior of similar old businesses

36

slide-37
SLIDE 37

Clustering

tempo harmonic complexity

37

slide-38
SLIDE 38

Clustering

tempo harmonic complexity

38

slide-39
SLIDE 39

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

39

slide-40
SLIDE 40

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

“Hyperparameters” (i.e. not model parameters)

40

slide-41
SLIDE 41

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

How many clusters we want to find

41

slide-42
SLIDE 42

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

When to quit. Things aren’ t changing,

  • r we have gotten bored.

42

slide-43
SLIDE 43

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

Randomly guess what the means are (lots of ways to do this)

43

slide-44
SLIDE 44

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

Repeat until your hyperparameters say to stop

44

slide-45
SLIDE 45

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

Assign each point to its closest mean

45

slide-46
SLIDE 46

K Means

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

Recompute the means to be the mean

  • f the points assigned to each cluster

46

slide-47
SLIDE 47

tempo harmonic complexity

K Means

47

slide-48
SLIDE 48

tempo harmonic complexity guess what the means are

X X

K Means

48

slide-49
SLIDE 49

tempo harmonic complexity Assign each point to closest mean

X X

K Means

49

slide-50
SLIDE 50

tempo harmonic complexity re-compute means to be center of clusters

X X

K Means

50

slide-51
SLIDE 51

tempo harmonic complexity

X X

Assign each point to closest mean

K Means

51

slide-52
SLIDE 52

tempo harmonic complexity

X X

re-compute means to be center of clusters

K Means

52

slide-53
SLIDE 53

tempo harmonic complexity

X X

Assign each point to closest mean

K Means

53

slide-54
SLIDE 54

tempo harmonic complexity

X X

re-compute means to be center of clusters

K Means

54

slide-55
SLIDE 55

tempo harmonic complexity

X X

Converged!

K Means

55

slide-56
SLIDE 56

Clicker Question!

56 56

slide-57
SLIDE 57

Clicker Question! (a)Number of clusters (b)Distance of points to their respective clusters (c) Distance between clusters (d)Probability of observed data

57

What is the “loss” that we are trying to minimize here?

57

slide-58
SLIDE 58

Clicker Question! (a)Number of clusters (b)Distance of points to their respective clusters (c) Distance between clusters (d)Probability of observed data

58

What is the “loss” that we are trying to minimize here?

58

slide-59
SLIDE 59

Clicker Question! (a)Number of clusters (b)Distance of points to their respective clusters (c) Distance between clusters (d)Probability of observed data

59

What is the “loss” that we are trying to minimize here?

This in just a few slides!

59

slide-60
SLIDE 60

Clicker(/Discussion) Question! (a)Yes (b)No (c) Sure, why not.

60

Is this a good objective?

60

slide-61
SLIDE 61

Clicker(/Discussion) Question! (a)Yes (b)No (c) Sure, why not.

61

Is this a good objective?

Potential problems? (Hint: hyperparameters, generalization…)

61

slide-62
SLIDE 62

How many clusters?

X X

mean dist to center 2 1 N 3 4 ……..

62

slide-63
SLIDE 63

How many clusters?

X X

mean dist to center 2 1 N 3 4 ……..

63

slide-64
SLIDE 64

How many clusters?

X

mean dist to center 2 1 N 3 4 ……..

64

slide-65
SLIDE 65

How many clusters?

X

mean dist to center

X X X X X X X X X X X X X

2 1 N 3 4 ……..

65

slide-66
SLIDE 66

How many clusters?

mean dist to center 2 1 N 3 4 …….. “Elbow Point”

X X

66

slide-67
SLIDE 67

How many clusters?

mean dist to center 2 1 N 3 4 …….. “Elbow Point”

X X

Other techniques:

  • Silhouette
  • Intuition/Divine Intervention
  • LGTM

67

slide-68
SLIDE 68

How many clusters?

mean dist to center 2 1 N 3 4 …….. “Elbow Point”

X X

Other techniques:

  • Silhouette
  • Intuition/Divine Intervention
  • LGTM

distance to own cluster distance to next best cluster

68

slide-69
SLIDE 69

69

slide-70
SLIDE 70

Expectation Maximization (EM)

70

slide-71
SLIDE 71

Expectation Maximization (EM)

define parameters: K, max_iter, min_diff iter = 0 change = inf means = [random() for _ in range(K)] while iter < max_iter and change > min_diff: update_assignments() compute_new_means() change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

71

slide-72
SLIDE 72

Expectation Maximization (EM)

define parameters: K, max_iter, min_diff iter = 0 change = inf randomly initialize params while not converged: data = estimate_likelihood(params) params = maximize_likelihood(data) change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

72

slide-73
SLIDE 73

Expectation Maximization (EM)

define parameters: K, max_iter, min_diff iter = 0 change = inf randomly initialize params while not converged: data = estimate_likelihood(params) params = maximize_likelihood(data) change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

E Step: estimate the likelihood of data under current parameter setting

73

slide-74
SLIDE 74

Expectation Maximization (EM)

define parameters: K, max_iter, min_diff iter = 0 change = inf randomly initialize params while not converged: data = estimate_likelihood(params) params = maximize_likelihood(data) change = max_i(dist(new_mean_i, old_mean_i)) iter += 1

M Step: adjust the the parameters so as to maximize the expectation of the data

74

slide-75
SLIDE 75

Expectation Maximization (EM)

tempo harmonic complexity

X X

E step: Assign each point to closest mean

http://www.dirkhovy.com/portfolio/papers/download/an_evening_with_EM.pdf

75

slide-76
SLIDE 76

tempo harmonic complexity

X X

M step: Compute means to be center of clusters

http://www.dirkhovy.com/portfolio/papers/download/an_evening_with_EM.pdf

Expectation Maximization (EM)

76

slide-77
SLIDE 77

tempo harmonic complexity

X X

EM -> “soft” K-Means, where points belong to a probability distribution over clusters…

http://www.dirkhovy.com/portfolio/papers/download/an_evening_with_EM.pdf

0.4 0.6

Expectation Maximization (EM)

77

slide-78
SLIDE 78

#tbt

  • Why was Heather

Locklear arrested? 


  • Why did the bystander

call emergency services? 


  • Where did the witness

see her acting abnormally?
 


Heather Locklear Arrested for driving under the influence of drugs The actress Heather Locklear, Amanda of the popular series Melrose Place, was arrested this weekend in Santa Barbara (California) after driving under the influence of drugs. A witness viewed her performing inappropriate maneuvers while trying to take her car out from a parking in Montecito, as revealed to People magazine by a spokesman for the Californian Highway Police. The witness stated that around 4.30pm Ms. Locklear "hit the accelerator very violently, making excessive noise while trying to take her car out from the parking with abrupt back and forth maneuvers. While reversing, she passed several times in front of his sunglasses." Shortly after, the witness, who, in a first time, apparently had not recognized the actress, saw Ms. Was arrested actress Heather Locklear because of the driving under the effect of an unknown medicine Driving while medicated The actress Heather Locklear that is known to the Amanda through the role from the series "Melrose Place" was arrested at this weekend in Santa Barbara (Californium) because of the driving under the effect of an unknown medicine. A female witness observed she attempted in quite strange way how to go from their parking space in Montecito, speaker of the traffic police of californium told the warehouse `People'. The female witness told in detail, that Locklear 'pressed `after 16:30 clock accelerator and a lot of noise did when she attempted to move their car towards behind or forward from the parking space, and when it went backwards, she pulled itself together unites Male at their sunglasses'. A little later the female witness that did probably There was a lot of noise In a parking lot

Second-Pass HIT Incentive Pay Statistical Models

Quality Control

Slide from crowdsourcing lecture

78

slide-79
SLIDE 79

#tbt

  • Why was Heather

Locklear arrested? 


  • Why did the bystander

call emergency services? 


  • Where did the witness

see her acting abnormally?
 


Heather Locklear Arrested for driving under the influence of drugs The actress Heather Locklear, Amanda of the popular series Melrose Place, was arrested this weekend in Santa Barbara (California) after driving under the influence of drugs. A witness viewed her performing inappropriate maneuvers while trying to take her car out from a parking in Montecito, as revealed to People magazine by a spokesman for the Californian Highway Police. The witness stated that around 4.30pm Ms. Locklear "hit the accelerator very violently, making excessive noise while trying to take her car out from the parking with abrupt back and forth maneuvers. While reversing, she passed several times in front of his sunglasses." Shortly after, the witness, who, in a first time, apparently had not recognized the actress, saw Ms. Was arrested actress Heather Locklear because of the driving under the effect of an unknown medicine Driving while medicated The actress Heather Locklear that is known to the Amanda through the role from the series "Melrose Place" was arrested at this weekend in Santa Barbara (Californium) because of the driving under the effect of an unknown medicine. A female witness observed she attempted in quite strange way how to go from their parking space in Montecito, speaker of the traffic police of californium told the warehouse `People'. The female witness told in detail, that Locklear 'pressed `after 16:30 clock accelerator and a lot of noise did when she attempted to move their car towards behind or forward from the parking space, and when it went backwards, she pulled itself together unites Male at their sunglasses'. A little later the female witness that did probably There was a lot of noise In a parking lot

Second-Pass HIT Incentive Pay Statistical Models

Quality Control

Slide from crowdsourcing lecture (That I don’ t think we actually covered, but its cool its fine…)

79

slide-80
SLIDE 80

worker1 worker2 worker3 worker4 worker5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam

Goal: Find “true” labels despite noisy annotations from workers…

80

slide-81
SLIDE 81

worker1 worker2 worker3 worker4 worker5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam

Goal: Find “true” labels despite noisy annotations from workers…

Easy! If you tell me how much to trust each worker, I can trivially compute labels

81

slide-82
SLIDE 82

worker1 worker2 worker3 worker4 worker5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam

Sure, just tell me the labels and I can easily figure out which workers to trust. Easy! If you tell me how much to trust each worker, I can trivially compute labels

Goal: Find “true” labels despite noisy annotations from workers…

82

slide-83
SLIDE 83

worker1 worker2 worker3 worker4 worker5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam

Sure, just tell me the labels and I can easily figure out which workers to trust. Easy! If you tell me how much to trust each worker, I can trivially compute labels

Goal: Find “true” labels despite noisy annotations from workers…

EM EVERYTHING!!!!

83

slide-84
SLIDE 84

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam ? ? not ? ? w2 spam not spam ? ? not ? ? w3 spam not spam ? ? not ? ? w4 spam not spam ? ? not ? ? w5 spam not spam ? ? not ? ?

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

84

slide-85
SLIDE 85

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam ? ? not ? ? w2 spam not spam ? ? not ? ? w3 spam not spam ? ? not ? ? w4 spam not spam ? ? not ? ? w5 spam not spam ? ? not ? ?

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

P(email1 is spam)

85

slide-86
SLIDE 86

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam ? ? not ? ? w2 spam not spam ? ? not ? ? w3 spam not spam ? ? not ? ? w4 spam not spam ? ? not ? ? w5 spam not spam ? ? not ? ?

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

P(w1 says spam | not spam)

86

slide-87
SLIDE 87

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Assume all workers are perfect

87

slide-88
SLIDE 88

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Compute labels using majority vote

88

slide-89
SLIDE 89

Clicker Question!

89 89

slide-90
SLIDE 90

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Compute labels using majority vote

Clicker Question! (a)0.4, 0.6 (b)0.6, 0.4 (c) 0.8, 0.2 (d)1.0, 0.0

90

slide-91
SLIDE 91

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 ? ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Compute labels using majority vote

Clicker Question! (a)0.4, 0.6 (b)0.6, 0.4 (c) 0.8, 0.2 (d)1.0, 0.0

91

slide-92
SLIDE 92

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 ? email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Compute labels using majority vote

92

slide-93
SLIDE 93

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 ? ? email3 ? ? email4 ? ? email5 ? ?

Compute labels using majority vote

93

slide-94
SLIDE 94

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Compute labels using majority vote

94

slide-95
SLIDE 95

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 1 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

95

slide-96
SLIDE 96

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam not w2 spam not spam not w3 spam not spam not w4 spam not spam not w5 spam not spam not

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

96

slide-97
SLIDE 97

Clicker Question!

97 97

slide-98
SLIDE 98

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam ? ? not w2 spam not spam not w3 spam not spam not w4 spam not spam not w5 spam not spam not

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Clicker Question! (a)0.4, 0.6 (b)0.6, 0.4 (c) 0.8, 0.2 (d)1.0, 0.0

Assume these labels, and recompute confusion matrices

98

slide-99
SLIDE 99

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam ? ? not w2 spam not spam not w3 spam not spam not w4 spam not spam not w5 spam not spam not

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Clicker Question! (a)0.4, 0.6 (b)0.6, 0.4 (c) 0.8, 0.2 (d)1.0, 0.0

Assume these labels, and recompute confusion matrices

99

slide-100
SLIDE 100

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

100

slide-101
SLIDE 101

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

101

slide-102
SLIDE 102

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

102

slide-103
SLIDE 103

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 1 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 1 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

103

slide-104
SLIDE 104

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

spam not email1 0.4 0.6 email2 1 email3 0.4 0.6 email4 0.8 0.2 email5 0.4 0.6

Assume these labels, and recompute confusion matrices

104

slide-105
SLIDE 105

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

spam not email1 1.5 email2 email3 email4 email5

Recompute labels using (weighted) majority vote

105

slide-106
SLIDE 106

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

spam not email1 1.5 4.34 email2 email3 email4 email5

Recompute labels using (weighted) majority vote

106

slide-107
SLIDE 107

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

spam not email1 0.26 0.74 email2 0.69 0.31 email3 0.29 0.71 email4 0.82 0.18 email5 0.26 0.74

Renormalize

107

slide-108
SLIDE 108

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam w1 spam not spam 1 not w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

spam not email1 0.26 0.74 email2 0.69 0.31 email3 0.29 0.71 email4 0.82 0.18 email5 0.26 0.74

Iterate until convergence!

108

slide-109
SLIDE 109

w1 w2 w3 w4 w5 email1 spam not not not spam email2 spam spam spam spam spam email3 not spam not not spam email4 spam spam spam spam not email5 spam not not not spam

spam not email1 0.26 0.74 email2 0.69 0.31 email3 0.29 0.71 email4 0.82 0.18 email5 0.26 0.74

(This example converges after 1 iteration)

w1 spam not spam 1 not 0.67 0.33 w2 spam not spam 1 not 0.33 0.67 w3 spam not spam 1 not 1 w4 spam not spam 1 not 1 w5 spam not spam 0.5 0.5 not 1

109

slide-110
SLIDE 110

iter == max_iter or change == min_diff

110