0000000
play

0000000 ooo numerical data e.g alphabetic order names w grades - PDF document

Space efficient quantile selection Where U has order C an C U stream Input 9 0000000 ooo numerical data e.g alphabetic order names w grades allowed multiple passes median return the Goal minimum w 6 space passes a quantile queries


  1. Space efficient quantile selection Where U has order C an C U stream Input 9 0000000 ooo numerical data e.g alphabetic order names w grades allowed multiple passes median return the Goal minimum w 6 space passes a quantile queries generally more k element select rank kth largest

  2. Input I pass oilskin O n

  3. i iiiiiiw.am

  4. Passe Spacesort and select quick select random pivot

  5. Approximations and KEEN rank E given 0 param element k rank return En w Sampling median for elements sample b of sample return dn k rank for Deterministic

  6. Quartiles space efficient mergable E approximate quantile queries answer l elements S L E 9,692L ge intervals Ilg along w O h minimax By tracking specially assume Ge ran 149 rankCge 4 n J j g O

  7. E E Querying e y n O l I l I l I I k If EEK 2En kt2EnT for some Ica 9i then return 9 I I I l I I E 9 l I I l i Kt2en K K 2En t k how to such exists ensure 9

  8. iemma 9 7 ti E 9 width 2En E total an interval Ilg ten every query K contains Proofy two cases I 9 KEI Gi 9 for some I LL 9 it 1 3 E 9 K ICg fg I I k

  9. KE IG for i some Suppose done E Ik then if Icg 2g kt2E at else look 9 it

  10. suppose KotICg ti E I 5 3 I 3 I 1 T I I I i 0 In combined intervals the In cover pick 1 covering k inside must lie the intervals of one

  11. invariant consecutive two Key any 2En have width intervals E E APX quantile summary

  12. E APX quantile given two Merging E APX 2 streams want summaries over combinedstream over summary s www.mszrrrrhhmnz to get summary of combine want QQ to www.mtszrrrrhhnhrrnz s s a Q'to I Cgi 3 ai I 9 Cgi

  13. denote I'Gil I'Cg'd Q oh 9 I lgz Q Cg I s g I g m I Cgj bounds rank gj.hr rHSz let 9j C Q bound sits rank what goal 9J dit l 9 is 9 15,1 I 3 4 2 N rank S of 9J in so min I'Coptmin I Cgj set I Cgj I G'it t I Cg Max max

  14. Q gel 94 3 oh intervals an's w E APX need show to to show Q is 2Ers width property in Q's Take intervals two consecutive Two cases from diff elements sets elements from sets same

  15. diff sets A HAD a

  16. sets same

  17. that merging 2 QS's E APX This shows E APX combined of streams QS gives size

  18. Pruning Input O h too many E approximate quantile summary w points sparser summary that's still very good Goal

  19. O 3 4 15 G 4 4 e e e e e e Eta claim resulting quantile APX is Proof rank K suppose we query a f k

  20. Recap we can E APX quantile summaries combine to get E APX quantile summary of whole thing E APX quantile summary sparsify to w Kpoints tat quantile summary APX to address Remains all how make at to one

  21. what l if n all that's need claim I we APX so o o or o or o o o o o o k take µ login at root the O E approximate quartiles

  22. Space D D D D D D D D D D D D D D D root only keep summaries

  23. theorem 1 pass OC log4n E space deterministic stream quantile over E APX t dyadic intervals trick idea mergability slightly better

  24. s a a a a a o or o o o o o o og n u Ye points have first level contain D D D D D D D D D D D D D D D D D D D

  25. Theoremett 1 pass 0C logic Eh 1E space deterministic stream quantile over E APX better Even 2001T Greenwald Khanna space loafers more sophisticated quantile summary merging trick interval

  26. and other ranks the median Finding in passes p p 2 for simplicity Fix OC ftp.olyloycnD space goal k rank suppose we are querying build E APX 1st quantile summary pass Yin for VJ login 6K E space w Kt in K VF a b query O n rank 6 E Kt 2in E KE K 2rn Erankca

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend