Intelligent Software Development, Courtesy of Intelligent Software - PDF document

K1 Keynote 11/8/17 8:45 AM Intelligent Software Development, Courtesy of Intelligent Software Presented by: Stephen Frein Comcast Brought to you by: 350 Corporate Way, Suite 400, Orange Park, FL 32073 888 --- 268 --- 8770 ·· 904 --- 278 --- 0524 - info@techwell.com - https://www.techwell.com/

Stephen Frein Comcast Stephen Frein is a senior director of software engineering at Comcast. He previously managed high profile software projects for the U.S. Department of Defense and the U.S. Treasury. For two decades, he has been leading development and testing teams to questionable success by dint of accidents he cannot reliably replicate. As an adjunct professor at Drexel and Villanova, Stephen delivers soporific lectures on machine learning, database development, and technology management to frequently inattentive students. He has presented at previous TechWell events by sneaking into unused rooms and deceiving the unsuspecting. Stephen enjoys polluting the hive mind via TechBeacon and other industry publications with questionable editorial standards. Visit his poorly maintained vanity website, where he practices writing vapid, self-congratulatory bios.

11/8/17 Intelligent Software Development Courtesy of Intelligent Software Stephen Frein About Me Sr. Director of Software Engineering @ Comcast Adjunct @ Drexel & Villanova Universities Contributor to TechBeacon.com stephen_frein@cable.comcast.com www.frein.com 1

11/8/17 Takeaways Machine learning can help you build better software Not hard to get started 2

11/8/17 What is machine learning? When machines get better at tasks through experience. What is data mining? DANG! Subset of machine learning Generation of novel el ins insights ights by discovering previously unknown Funny hats decrease the patterns chances of finding gold by 50% Not writing standard SQL (or similar) queries 3

11/8/17 “Big data” not required Supervised learning: labeled examples Predictors (x) Target (y) Feature 1 Feature 2 Feature 3 Feature 4 Target AAA 38.54 1 0.37 Yes Model y = f(x) CCC 117.16 1 -1.21 No BBB 18.68 0 .349 Yes AAA 89.41 1 0.06 No 4

11/8/17 Supervised learning: model predictions Model y = f(x) New Data (x) Feature 1 Feature 2 Feature 3 Feature 4 Target CCC 24.14 0 1.04 Yes Prediction (y) AAA 192.23 1 -0.28 No BBB 84.01 0 .551 Yes Supervised learning: model interpretation Feature 1 = AAA: Prob No Feature 3 = 1 : Prob No Feature 4 > 0 : Prob Yes Interpretation not always feasible 5

11/8/17 Unsupervised learning: no labels, find patterns We do it all the time for others. 6

11/8/17 We rarely do it for ourselves. Problem: Missed Source Changes File 1 --function foo (float x){ ++function foo (int x){ … } Should be in sync File 2 float param = 7.0; foo(param); 7

11/8/17 Market basket analysis � (a.k.a. association rules) “People who buy waffles are three times more likely to buy syrup than the average shopper.” What is “Lift”? Sample Transactions Support : nails appear in 60% (3/5) of all transactions {hammer, nails} {hammer, nails, rope} Confidence : nails appear 67% {nails, ladder} (2/3) of the time when hammers do {ladder, rope} {screwdriver, Lift: nails are 11% ((67-60)/60) hammer} more likely to appear when hammers do Hammers make it 1.11 times as likely that nails appear 8

11/8/17 Treat code check-ins like shopping baskets? frein@ubuntu:~$ git commit -m “leaving for Orlando; good luck finding these bugs, suckers" Does the presence of some files make others more likely? Sa Sample le Analysis is Tomcat changes 9

11/8/17 Highest Lift Rules Lift {/ajp/AjpAprProcessor.java} => {/ajp/AjpProcessor.java} 93.87 {/ajp/AjpProcessor.java} => {/ajp/AjpAprProcessor.java} 93.87 {/http11/Http11NioProcessor.java} => {/http11/Http11Processor.java} 46.42 {/http11/Http11Processor.java} => {/http11/Http11NioProcessor.java} 46.42 {/http11/Http11Processor.java} => {/http11/Http11AprProcessor.java} 43.10 {/http11/Http11AprProcessor.java} => {/http11/Http11Processor.java} 43.10 If you change AjpProcessor.java, you may need to change AjpAprProcessor.java. Operatio ionaliz lize It CI Rep o Change Rules Rule Builde Warnings r 10

11/8/17 Just Use a Database? Could, but Hard Problem: Predicting Defects 11

11/8/17 Why try to predict defects? Test Effort Complexity Experience Exploration Targeting What would we do if we could predict defects? Peer Review Pairing 12

11/8/17 How can we predict defects? Code Measures Defect History Requirements Using words in stories Id Story Defect? 101 As a customer, I want to order doughnuts with sprinkles. Yes 102 As a customer, I want to pay with a credit card. No 103 As admin, I want to configure available doughnut types. Yes 104 As a customer, I want to order cupcakes with sprinkles. No 105 As a vendor, I want to submit a doughnut invoice. No 13

11/8/17 Sa Sample le Analysis is Re Reason to thin ink k this is will ill work 14

11/8/17 Bayes’ Theorem 𝒐𝒅𝒇 = 𝑸𝒔 𝑸𝒔𝒑𝒄𝒃𝒄𝒋𝒎𝒋𝒖𝒛𝑰𝒛𝒒𝒑𝒖𝒊𝒇𝒕𝒋 𝒕𝒋𝒕 ⁠ 𝑭𝒘𝒋𝒆𝒇𝒐𝒅 ) 𝑸𝒔 𝑸𝒔𝒑𝒄𝒃𝒄𝒎𝒋𝒎𝒖𝒛(𝑰𝒛𝒒𝒑𝒖𝒊𝒇𝒕𝒋 𝒕𝒋𝒕) ∗ 𝑸𝒔 𝑸𝒔𝒑𝒄𝒃𝒄𝒋𝒎𝒋𝒖𝒛 ( 𝑭𝒘𝒋𝒆𝒇𝒐𝒅 𝒐𝒅𝒇 | 𝑰𝒛𝒒𝒑𝒖𝒊𝒇𝒕𝒋 𝒕𝒋𝒕) 𝒕) /𝑸𝒔 𝑸𝒔𝒑𝒄𝒃𝒄𝒋𝒎𝒋 𝐮𝐳 𝐮𝐳( 𝑭𝒘𝒋𝒆𝒇𝒐𝒅 𝒐𝒅𝒇) Heavil ily used in in spam m fi filt lters rs Training and Testing X Y Training Model Y = f(X) 80% Predicted X 20% Data Test Inputs Compare 15

11/8/17 44 % (32/73) of predicted Results defects really were defects. 21% of the stories are I found 27% of the defects. associated with a defect. Understanding Factors with Decision Trees (# of hours) goes to a sub-tree TaskActualTotal TaskActualTotal > 28: [S1] > 28: [S1] TaskActualTotal TaskActualTotal <= 28: <= 28: :... :...TaskActualTotal TaskActualTotal <= 5.5: <= 5.5: No_Defect No_Defect (42.3) (42.3) TaskActualTotal TaskActualTotal > 5.5: > 5.5: :... :...DaysInProgress DaysInProgress <= 1: <= 1: No_Defect No_Defect (20.3) (20.3) DaysInProgress DaysInProgress > 1: > 1: :...DaysTillAcceptance :... DaysTillAcceptance <= 3: <= 3: Has_Defect Has_Defect (19.9/6) (19.9/6) DaysTillAcceptance DaysTillAcceptance > 3: [S2] > 3: [S2] 16

11/8/17 Other Things to Try Which test cases will find defects? Will we finish this story in one sprint? Is the application about to go down? Will this feature get used? Technology Choices 17

11/8/17 Data Science for Developers – TechBeacon.com 18

11/8/17 Takeaways Machine learning can help you build better software Not hard to get started 19

Intelligent Software Development, Courtesy of Intelligent Software - PDF document

K1 Keynote 11/8/17 8:45 AM Intelligent Software Development, Courtesy of Intelligent Software Presented by: Stephen Frein Comcast Brought to you by: 350 Corporate Way, Suite 400, Orange Park, FL 32073 888 --- 268 --- 8770 904 --- 278 ---

Carbon Capture, Utilization, and Storage Presentation to the USEA Courtesy NREL Courtesy GRC

Low Tide Courtesy: Interboro Rebuild by Design Team Courtesy: Interboro Rebuild by Design Team

business need? improve image courtesy of iStockphoto recurrent image courtesy of Microsoft

Courtesy: SCAPE / Landscape Architecture Rebuild by Design Team Courtesy: SCAPE / Landscape

Courtesy: MIT CAU + ZUS + URBANISTEN Rebuild by Design Team Courtesy: MIT CAU + ZUS + URBANISTEN

Stephenie Kramer Willamette Cultural Resources Associates, Ltd. Photo courtesy of DAHP Photos

The Branches of the Government Lanca Amankwaah Fourth Grade Mrs. Francolino Courtesy of

CSE 2221 Software I: Software Components and CSE 2231 Software II: Software Development and

Introduction to Software Testing Software Testing - Module 1 Part 1 The Software Engineering

Photo courtesy of Casbr(@flickr.com) - granted under creative commons licence - attribution Photo

Dr. Randy Lovelace and the Astronauts Loretta Hall Author NSS Space Ambassador 3:17 PM

The 4 th Industrial Revolution and you Paolo Gallo Courtesy of cargocollective Courtesy of

GOVERNMENT PROJECT By Christopher DiPoalo 4grade Mrs. Francolino Judicial Branch Courtesy of

The Athenians and their Gods Image courtesy of Jack Versloot on flickr. License CC BY. 1 Image

Master Theorem Carola Wenk Slides courtesy of Charles Leiserson with small Slides courtesy of

From Lincoln and the Jews , courtesy of Jonathan D. Sarna and Benjamin Shapell. From Lincoln and

A government for or by the citizens? Examples from Uppsala Maria Gardfjell Deputy Mayor of

Investor Presentation Baie-Comeau Silicon Project November, 2018 www.canadianmetalsinc.com

WARD KLEIN WARD KLEIN WARD KLEIN WARD KLEIN ti Chi f E Forward-Looking Statements The

B Y M I N I 901 | MNR MALIBU MUD 902 | MNR CALIFORNIA CARAMEL 903 | MNR ELM STORM 904 | MNR

Hold audience attention Why is it so hard to hold audience attention? Too many distractions A

User test Kim Sherwood, M. Sc. Pharm., Product Information Assessor Medical Products Agency,

Pain and Work: whats known Chronic Pain 1. 20% adult EU citizens 2. Major economic impact

HR Community of Practice April 16, 2020 2:00PM-3:00PM AGENDA HR updates Mark Schmelz,