my project matter
play

My Project Matter? 2 2 1 1 1 Ying Wang , Ming Wen, Zhenwei Liu, - PowerPoint PPT Presentation

Do the Dependency Conflicts in My Project Matter? 2 2 1 1 1 Ying Wang , Ming Wen, Zhenwei Liu, Rongxin Wu, Rui Wang, 1 1* 2* 1 Bo Yang, Hai Yu, Zhiliang Zhu and Shing-Chi Cheung 1. Northeastern University 2. The Hong Kong University


  1. Do the Dependency Conflicts in My Project Matter? 2 2 1 1 1 Ying Wang , Ming Wen, Zhenwei Liu, Rongxin Wu, Rui Wang, 1 1* 2* 1 Bo Yang, Hai Yu, Zhiliang Zhu and Shing-Chi Cheung 1. Northeastern University 2. The Hong Kong University of Science and Technology Ying ng Wang ng 2018-12 12-19 19

  2. Example xample 1 foo.class foo.class java –cp a.jar; b.jar … -1-

  3. Exam xample ple 2 lib1 lib2 -2-

  4. Observ Ob servat ations ions 2289 Popular Java projects 1457(64%) Projects contain the same library of different versions 1003(44%) Projects contain the duplicate classes in different libraries 954(42%) Projects contain both conflicting classes and libraries -3-

  5. -4-

  6. Example xample 3 176 downstream clients! 879 days! Maven-shade-plugin -5-

  7. Motivation ivation Dependency Conflict ( DC ) problem is very common in practice. Building tools do not differentiate benign from harmful (e.g., causing runtime exceptions) DC warnings. Most building tools do not guarantee loading the most appropriate class for the client project. -6-

  8. Our wo Ou r work rk Empirical study Manifestation Empirical study Patterns Automated diagnosis Fixing Patterns Detection Empirical study Assessing DC Evaluation severity levels Empirical study Effectiveness Usefulness -7-

  9. Emp mpir irical ical st stud udy--- --- Re Research search ques estio tions ns RQ1(Issue manifestation patterns): What are the common manifestations of DC issues? Are there patterns that can be extracted to enable automated detection of these problems? RQ2(Issue fixing patterns): How do developers fix DC issues in practice? Are there factors that affect developers’ choices of different fixing solutions? -8-

  10. Emp mpir irical ical st stud udy--- --- Data Data Co Colle lectio ction Java open source projects built by Maven from the Apache ecosystem are selected as the subjects for our empirical study, due to the following reasons: Key words: 1) “library”, “dependency” or “compatibility”, etc. 2) “conflict” or “ NoSuchMethodError ”, etc. 135 DC issues (128 of them have been fixed) -9-

  11. Emp mpir irical ical st stud udy- RQ1: RQ1: Iss ssue e man anife ifestatio station n pat atterns erns A. Conflicts in library versions C: 4% B. Conflicts in classes among libraries A: 29% C. Conflicts in classes between B: 67% host project and libraries -10-

  12. Emp mpir irical ical st stud udy- RQ1: RQ1: Iss ssue e man anife ifestatio station n pat atterns erns A. Conflicts in library versions (39 out of 135 issues) ⚫ If there are multiple versions of the same library, according to Maven’s nearest wins strategy , Maven chooses the version that appears at the nearest to the root (host project) of the dependency tree. If the host project references the NoClassDefFoundError features only defined in the System Failure shadowed library (i.e., Lib2 v2.0 ), a NoSuchMethodError runtime exception will occur. -11-

  13. Emp mpir irical ical st stud udy- RQ1: RQ1: Iss ssue e man anife ifestatio station n pat atterns erns B. Conflicts in classes among libraries (90 out of 135 issues) ⚫ Based on the Maven’s first declaration wins strategy , the duplicate classes within the first declared library (i.e., lib2) will shadow the ones included in the others (lib1). If the host project references the NoSuchMethodError features only defined in the System Failure shadowed classes (i.e., class A, B, C in Lib1), a runtime exception will occur. -12-

  14. Emp mpir irical ical st stud udy- RQ1: RQ1: Iss ssue e man anife ifestatio station n pat atterns erns C. Conflicts in classes between host project and libraries (6 out of 135 issues) ⚫ If the host project and its referenced library (i.e., Lib1) include duplicate classes (i.e., A, B and C), then only those included in the library (i.e., Lib1) will be included during the packaging process. The classes included in library NoSuchMethodError Lib1 shadowed those defined in System Failure the host project, which leaded to a runtime failure. -13-

  15. Emp mpir irical ical st stud udy- RQ1: RQ1: Iss ssue e man anife ifestatio station n pat atterns erns Loaded Referenced -14-

  16. Emp mpir irical ical st stud udy- RQ2: RQ2: Iss ssue e fixing xing pat atterns erns Maven-Shade-Plugin provides the capability to package the project in an Uber Jar , including its Pattern 1 : Shading the conflicting libraries third party libraries. It will also shade (i.e., rename) (25 out of 128 solutions) the packages of some of the libraries. Forcing a particular dependency order on the Pattern 2 : Adjusting the classpath order classpath is a strategy commonly used by of dependencies (42 out of 128 solutions) developers for fixing DC issues at a relatively low cost. #HDFS-10570 Hdfsproxy Netty 2.0 HDFS Hadoop Netty 2.8 -15-

  17. Emp mpir irical ical st stud udy- RQ2: RQ2: Iss ssue e fixing xing pat atterns erns Maven-Shade-Plugin provides the capability to package the project in an Uber Jar , including its Pattern 1 : Shading the conflicting libraries third party libraries. It will also shade (i.e., rename) (25 out of 128 solutions) the packages of some of the libraries. Pattern 2 : Adjusting the classpath order of Forcing a particular dependency order on the classpath is a strategy commonly used by dependencies (42 out of 128 solutions) developers for fixing DC issues at a relatively low cost. Pattern 3 : Harmonizing library versions Solutions of this pattern upgrade or downgrade (51 out of 128 solutions) some of the JARs to resolve the version inconsistencies. -16-

  18. Emp mpir irical ical st stud udy- RQ2: RQ2: Iss ssue e fixing xing pat atterns erns Maven-Shade-Plugin provides the capability to package the project in an Uber Jar , including its Pattern 1 : Shading the conflicting libraries third party libraries. It will also shade (i.e., rename) the packages of some of the libraries. (25 out of 128 solutions) Forcing a particular dependency order on the Pattern 2 : Adjusting the classpath order of classpath is a strategy commonly used by dependencies (42 out of 128 solutions) developers for fixing DC issues at a relatively low cost. Pattern 3 : Harmonizing library versions Solutions of this pattern upgrade or downgrade (51 out of 128 solutions) some of the JARs to resolve the version inconsistencies. Pattern 4 : Classloader customization This solution uses dynamic module system frameworks such as OSGI and Wildfly , to allow different versions of (5 out of 128 solutions) the same libraries or classes coexist in one project by creating multiple classloaders. Pattern 5 : Other workarounds (5 out of 128 solutions) The remaining issues are resolved in miscellaneous ways -17-

  19. De Depe penden ndency cy conflict nflict diag iagno nosis sis Detect Manifestaion dependency patterns conflict issues Maintenance efforts Assess their on fixing solutions severity levels -18-

  20. De Depe penden ndency cy conflict nflict diag iagno nosis sis Library Dependency Binary Code File Management Script 1 2 Identify Lib3 v1.0 Lib2 Lib2 Lib3 v1.0 Lib2 Lib2 Extract library Duplicate Lib4 Lib4 dependency Libraries or tree Lib3 v2.0 Lib5 Lib3 v2.0 Lib5 Classes 3 Analyze Relations Between Different Feature Set Loaded Shadowed Referenced 4 Lib3 v1.0 Lib3 v2.0 L3 L4 L2 L1 Assessing Warning Severity Levels -19-

  21. Eva valuat luation ion RQ3 (Effectiveness): How effective can Decca detect real DC issues and assess their severity levels? RQ4 (Usefulness): Can Decca detect unknown DC issues in real-world projects and facilitate developers in diagnosing them? -20-

  22. Eva valuat luation: ion: Effectiveness of Decca Subjects: A high quality dataset containing high-severity (i.e., Level 3 and 4 ) and low-severity (i.e., Level 1 and 2 ) DC issues. Assumption: Bugs are usually repaired within 2 years across different projects since they were introduced to the project True Positive (TP) : the conflict identified as a high-severity issue (i.e., Precision = TP/(TP + FP) Level 3 or Level 4 ) is a high-severity issue. Recall = TP/(TP + FN ) False Positive (FP) : the conflict identified as a high-severity issue (i.e., Level 3 or Level 4 ) is a low-severity issue. F-measure =2 × Precision × Recall True Negative (TN) : the conflict identified as a low-severity issue (i.e., /(Precision + Recall) Level 1 or Level 2 ) is a low-severity issue. False Negative (FN) : the conflict identified as a low-severity issue (i.e., Precision : 0.923, Recall : 0.766 and Level 1 or Level 2 ) is a high-severity issue. F - measure : 0.837 -21-

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend