DNSql Processing Massive DNS Collections Stephen Herwig, Dave - PowerPoint PPT Presentation

Apr 29, 2023 •295 likes •397 views

DNSql Processing Massive DNS Collections Stephen Herwig, Dave Levin, Bobby Bhattacharjee, Neil Spring University of Maryland, College Park D-root Operated by UMD Anycast with 109 replicas Hourly sampled collection by replica global local

DNSql Processing Massive DNS Collections Stephen Herwig, Dave Levin, Bobby Bhattacharjee, Neil Spring University of Maryland, College Park
D-root Operated by UMD Anycast with 109 replicas Hourly sampled collection by replica global local
Problem Lots of data ~140 GiB / day Serial processing is slow ~8h to read a month’s worth of collection for CPMD replica Diverse analyses Short-term, Long-Term Aggregation by source, replica, geography, topology
Approach pcap.gz sqlite3 dnsqlite3c MapReduce CREATE TABLE queryresp ( id INTEGER PRIMARY KEY, sec INTEGER, usec INTEGER, src BLOB, sport INTEGER, opcode INTEGER, qclass INTEGER, qtype INTEGER, rcode INTEGER, qname TEXT ); CREATE INDEX qname_index ON queryresp(qname); CREATE INDEX src_index ON queryresp(src); CREATE TABLE qps (sec INTEGER, n INTEGER);
Processing Speed CPMD March 2015 700 zcat | tcpdump dnsqlite3c aggregate.db 600 parallel dnsqlite3c 500 resp (K) / sec 400 300 200 100 0 single pcap.gz month of pcap.gzs
Database Size CPMD March 2015 1750 month of pcaps month of SQLite3 shards aggregate.db 1500 1250 1000 GiB 750 500 250 0 normal gzip'd
Query Speed CPMD March 2015 aggregate.db mapreduce 8 6 minutes 4 2 0 distinct source IP count distinct QPS source IPs frequency hashed qnames
Additional Data Sources Percent of Queries to CPMD By Country (March 2015) 0 3 6 9 12 15 18 21 24 27 30 33 MaxMind GeoLite database 7m query time
Per-Source Metrics 466,021 unique sources 1h 10m query time
Discussion Additional queries? Optimizations? Extension to non-root servers?

Recommend

Distributed Submodular Maximization in Massive Datasets Huy L. Nguyen Joint work with Rafael

Distributed Submodular Maximization in Massive Datasets Huy L. Nguyen Joint work with Rafael Barbosa, Alina Ene, Justin Ward Combinatorial Optimization Given A set of objects V A function f on subsets of V A collection of

483 views • 30 slides

Massive Open Online Courses Theme Group Report 2013 International Society for EBHC & EBHC

Massive Open Online Courses Theme Group Report 2013 International Society for EBHC & EBHC Teachers and Developers Conference What are MOOCs ? MOOC M assive O nline O pen C ourses What they offer Open access Free Available to

209 views • 10 slides

Software-defined Infrastructure for Advanced Wireless Testbeds December 2 nd , 2016 Ivan Seskar

FALL 2016 RESEARCH REVIEW Software-defined Infrastructure for Advanced Wireless Testbeds December 2 nd , 2016 Ivan Seskar WINLAB Department of ECE Rutgers, The State University of New Jersey seskar (at) winlab (dot) Rutgers (dot) edu

391 views • 21 slides

News from the Cluster-Jet Target Ann-Katrin Hergemller Westflische Wilhelms-Universitt

News from the Cluster-Jet Target Ann-Katrin Hergemller Westflische Wilhelms-Universitt Mnster, Institut fr Kernphysik PANDA Meeting Darmstadt, June 10th 2014 1 / 17 collimator skimmer insulation cold head drive step motor cold

496 views • 33 slides

A Lonely Giant: The Sparse Satellite Population of M94 Challenges Galaxy Formation Adam Smercina

A Lonely Giant: The Sparse Satellite Population of M94 Challenges Galaxy Formation Adam Smercina Eric Bell, Richard DSouza, Paul Price, Colin Slater, Jeremy Bailin, Antonela Monachesi, David Nidever Smercina et al. 2018, ApJ, 863, 152

317 views • 8 slides

Scalable Methods for the Analysis of Network-Based Data MURI Project: University of California,

Scalable Methods for the Analysis of Network-Based Data MURI Project: University of California, Irvine Project Meeting August 25 th 2009 Principal Investigator: Padhraic Smyth Goals for Todays Meeting Introductions and brief review of

998 views • 32 slides

earching for dist stant world stant world GO DIRECTLY TO THE PLANETARIUM The

ASTR 1120 ASTR 1120 General Astronomy: General Astronomy: Stars & Galaxies Stars & Galaxies ID IDTERM # #2 : ID IDTERM # #2 EXT ursday 10/15: EXT Tuesday 10/20: earching for dist earching for dist

635 views • 37 slides

MOVEMENT BACK TO SYRIA: SCENARIOS Possible developmentsin Syria and neighbouring countriesover the

MOVEMENT BACK TO SYRIA: SCENARIOS Possible developmentsin Syria and neighbouring countriesover the nextNINE months 19 September 2017 www.acaps.org www.mixedmigrationplatform.org ACAPS Scenarios: Movement back to Syria, September 2017 SUMMARY

437 views • 14 slides

Lecture 6: Learning Theory Probability Review Aykut Erdem October 2016 Hacettepe

Lecture 6: Learning Theory Probability Review Aykut Erdem October 2016 Hacettepe University Last time Regularization , Cross-Validation N E ( w ) = 1 { y ( x n , w ) t n } 2 + 2 w 2 2 n =1 where w

671 views • 48 slides

Establishing Best Practices for 1 Wheel/Rail Interaction APTA/AREMA Working Group on

Establishing Best Practices for 1 Wheel/Rail Interaction APTA/AREMA Working Group on WheelRail Interaction Steve Chrismer, Chairman (LTK Engineering) Joseph Smak, AREMA Martin Schroeder, APTA Insert logo here in first Master slide 2

660 views • 9 slides

A look at Ansible Community in 2020 - from Collections to Contributions to Conferences

A look at Ansible Community in 2020 - from Collections to Contributions to Conferences foss-north 2020 take II November 1, 2020 Carol Chen Senior Community Architect 1 $ whoami Software engineer in Nokia (9 years) Community Chief

653 views • 30 slides

Querying Linked Data with SPARQL and the Wikidata Query Service Lucas Werkmeister 2019-12-27

Querying Linked Data with SPARQL and the Wikidata Query Service Lucas Werkmeister 2019-12-27 Lucas Werkmeister https://tinyurl.com/36c3-wdqs 1/20 An example graph happens in is next to Esszimmer is next to Kche is part of happens in

561 views • 41 slides

Katrin twitter: @_die_katrin mastodon: @katrin-k.chaos.social Next Workshop: Sunday June 16th

G RATITUDE Katrin twitter: @_die_katrin mastodon: @katrin-k.chaos.social Next Workshop: Sunday June 16th @co.up @cssclasses T HANK YOU

408 views • 11 slides

KDE Kerala The story of FOSS & KDE in Kerala Subin Siby 2020-01-18 subinsb.com/s/cki2020 1

KDE Kerala The story of FOSS & KDE in Kerala Subin Siby 2020-01-18 subinsb.com/s/cki2020 1 Outline History of FOSS In Kerala FOSS In Education Localization 2 History of FOSS In Kerala Kerala Made by Filpro, CC-BY-SA 4.0 Language:

741 views • 30 slides

Descriptors II CSE 576 Ali Farhadi Many slides from

Descriptors II CSE 576 Ali Farhadi Many slides from Larry Zitnick, Steve Seitz How can we find corresponding points? How can we find correspondences? SIFT descriptor Full version

898 views • 51 slides

Computer Programming Dr. Deepak B Phatak Dr. Supratik Chakraborty Department of Computer Science

IIT Bombay Computer Programming Dr. Deepak B Phatak Dr. Supratik Chakraborty Department of Computer Science and Engineering IIT Bombay Session: Gaussian Elimination Dr. Deepak B. Phatak & Dr. Supratik Chakraborty, IIT Bombay 1 Quic ick

373 views • 15 slides

T AKING S TOCK OF THE S ELF - R EGULATION E XPERIMENT : T HE P AST

T AKING S TOCK OF THE S ELF - R EGULATION E XPERIMENT : T HE P AST , P RESENT AND F UTURE OF THE P LATFORM ON D IET , P HYSICAL A CTIVITY AND THE EU A LCOHOL AND H

179 views • 4 slides

Grammar Update for Indonesian Resource Grammar (INDRA) David Moeljadi Francis Bond , Sanghoun Song

Grammar Update for Indonesian Resource Grammar (INDRA) David Moeljadi Francis Bond , Sanghoun Song , Luis Morgado da Costa and many more Division of Linguistics and Multilingual Studies, Nanyang Technological University, Singapore The 12th

440 views • 11 slides

Alignment and Deformation for Cryostat of CADS Injector Jiandong Yuan, Lizhen Ma, Yuan He, Bin

Alignment and Deformation for Cryostat of CADS Injector Jiandong Yuan, Lizhen Ma, Yuan He, Bin Zhang, Juihui Zhang, Guozhen Sun Presenter: Jiandong Yuan Institute of Modern Physics, Chinese Acadmey of Science(IMP,CAS) 2018.10

516 views • 20 slides

Conjunctive networks Complexity of limit cycle problems with different schedules Julio Aracena,

Conjunctive networks Complexity of limit cycle problems with different schedules Julio Aracena, Florian Bridoux, Luis G omez, Lilian Salinas Florian BRIDOUX Conjunctive networks 2020 1/18 Boolean networks and interaction digraph

488 views • 45 slides

FRONT All your companys external communications in one collaborative inbox. THE PROBLEM

FRONT All your companys external communications in one collaborative inbox. THE PROBLEM Email is the most important business communication channel 215 billion 54% 7% emails sent per are business year-on-year day emails growth But

488 views • 21 slides

Welcome! Parents, Middle Primary classes 17 January 2020 The slides will be uploaded onto the

Welcome! Parents, Middle Primary classes 17 January 2020 The slides will be uploaded onto the CHIJ OLN website on Monday 20 Jan. CHIJ J Our La Lady dy of the e Nat Nativity Simple in Virtue, Steadfast in Duty Agenda 1. Principals

544 views • 29 slides

Kernels + Support Vector Machines (SVMs) SVM Readings: Matt Gormley Murphy

10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Kernels + Support Vector Machines (SVMs) SVM Readings: Matt Gormley Murphy

519 views • 48 slides

Support Constrained Generator Matrices of Gabidulin Codes in Characteristic Zero Hikmet Yildiz

Support Constrained Generator Matrices of Gabidulin Codes in Characteristic Zero Hikmet Yildiz , Netanel Raviv , Babak Hassibi California Institute of Technology, Pasadena CA Washington University in Saint Louis, St. Louis MO

336 views • 21 slides