SLIDE 18 Conclusion
Par
titioning Ske w is a c halle nge for MapRe duc e - base d applic ations:
T
sity of Data- inte nsive applic ations Soc ia l Ne twork, Se a rc h e ng ine , Sc ie ntific Ana lysis , e tc
Par
titioning Ske w is due to two fac tor s: Sig nific a nt va ria nc e in inte rme dia te ke ys’ fre que nc ie s Sig nific a nt va ria nc e in inte rme dia te ke y’s distributions a mong the
diffe re nt da ta .
Our
solution is to e xte nd the L
- c ality c onc e pt to the r
e duc e phase Pa rtition the Ke ys a c c ording to the ir hig h fre que nc ie s F
a irne ss in da ta distribution a mong diffe re nt da ta node s
Up to 40% impr
- ve me nt using simple applic ation e xample !
F
utur e wor k
Apply L
E E N to diffe r e nt ke y and value s size
18