qian zhang jiyuan muhammad rohan miryung wang ali gulzar
play

Qian Zhang Jiyuan Muhammad Rohan Miryung Wang Ali Gulzar - PowerPoint PPT Presentation

Qian Zhang Jiyuan Muhammad Rohan Miryung Wang Ali Gulzar Padhye Kim ... val locations = sc.textFile("zipcode.csv") . map { s => val cols= s.split(",") (cols(0), cols(1)) } . filter { s => s._2.equals("New


  1. Qian Zhang Jiyuan Muhammad Rohan Miryung Wang Ali Gulzar Padhye Kim

  2. ... val locations = sc.textFile("zipcode.csv") . map { s => val cols= s.split(",") (cols(0), cols(1)) } . filter { s => s._2.equals("New York") } ...

  3. ... val locations = sc.textFile("zipcode.csv") .map {s => val cols = s.split(",") (cols(0), cols(1) } .filter {s => s._2 == "New York"} ...

  4. ... val locations = sc.textFile("zipcode.csv") .map {s => val cols = s.split(",") (cols(0), cols(1) } .filter {s => s._2 == "New York"} ...

  5. ... val locations = sc.textFile("zipcode.csv") .map {s => val cols = s.split(",") (cols(0), cols(1) } .filter {s => s._2 == "New York"} ... public class Map1 { static final Map1 apply(String line2) { String cols[]=line2.split(","); return new Map1(cols[0],cols[1]); }

  6. ... ... val locations = ArrayList <Map1> results1 =LoanSpec.map1 sc.textFile("zipcode.csv") (inputs); .map {s => ArrayList <Map1> results2 =LoanSpec.filter2 val cols = s.split(",") (results1) (cols(0), cols(1) } ... .filter {s => s._2 == "New York"} ... public class Map1 { public ArrayList <Map1> static final Map1 apply(String line2) map1( ArrayList < String > input){ { ArrayList <Map1> output = new ArrayList <>(); String cols[]=line2.split(","); for (String item: input){ return new Map1(cols[0],cols[1]); output.add(Map1.apply(item) );} } return output;}

  7. • val pair = data.filter{ data if (s._1 == 90024) A ; else B ; • } filter False True ... ... •

  8. • val pair = data.filter{ data if (s._1 == 90024) A ; else B ; • } filter False True ... ... •

  9. • integer[0-30] integer[0-30]

  10. • .collect().foreach(pri ntln) Division by zero str.split(“\t”)[1] str.split(“,”)[1] str.substring(1,0) If(age>10 && age<9) LeftOuterJoin (Value, Key) Spark word2vec one row join in spark

  11. • • • •

  12. AFL (9216M memory and 100s timeout) runs at an extremely low speed 9.68 execs_per_sec on average

  13. BigFuzz speeds up to 1477x times with framework abstraction

  14. BigFuzz provides up to a 3.71X improvement on code coverage

  15. BigFuzz achieves up to a 2.57X improvement on error detection

  16. In comparison to a symbolic execution based approach BigTest , BigFuzz detects 80.6% more injected errors Muhammad Ali Gulzar, Shaghayegh Mardani, Madanlal Musuvathi, and Miryung Kim. 2019. White-Box Testing of Big Data Analytics with Complex User-Defined Functions. In Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2019) .

  17. • • • • • •

  18. ● ● ○ ○ ○ 27

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend