PGCon-2020 Online 2020-05-26 Database systems: - PowerPoint PPT Presentation

PGCon-2020 Online 2020-05-26

Database systems: ○ ○ 2002-2005: ○ since 2005: ○ Worked on XML data type and functions (2005-2007) ○ Long-term community activist – #RuPostgres, 2000+ members Conferences Program Committee ○ etc. ○ Current business:

- — boost development of fast-growing PostgreSQL-based projects using thin cloning and high level of automation Clients and partners: CHEWY.COM CHEWY.COM

- Get an overview of numerous tools and methods in Postgres ecosystem - Focus on remembering what can be done rather than how - Learn about the new methodology: “Seamless SQL Optimization” to build powerful and scalable optimization process in your organization - Bonus: PostgreSQL and tools developers might find some new (or old) ideas

● SQL Performance analysis. Methodologies Macro-analysis: tools, metrics, visualization ● ● Switching from Macro- to Micro-analysis (and back) Micro-analysis: tools, metrics, visualization ● ● How do we optimize? Where? 3 key principles of Seamless SQL Optimization methodology ●

bit.ly/pgcon2020 Just launch it, do not follow the steps (yet). Watch what I’m doing

(inherited from Brendan Gregg’s talks on Performance Analysis) - “Zero methodology”: lack of tools and analysis / optimization activities.

(inherited from Brendan Gregg’s talks on Performance Analysis) - “Zero methodology”: lack of tools and analysis / optimization activities. - “Street Light” anti-method: - Pick query that is known to be slow (e.g., random slow query from logs) - Use tools and metrics that are familiar, without understanding the whole picture - Avoid query analysis, focus on config tuning, scaling-in, or partitioning / sharding

(inherited from Brendan Gregg’s talks on Performance Analysis) - “Zero methodology”: lack of tools and analysis / optimization activities. - “Street Light” anti-method. - “Drunk man” anti-method. Tune random things. For example: - Increase shared_buffers . Or work_mem . Or random / seq_page_cost . Or effective_cache_size - Build indexes for all columns - Put VACUUM FULL to cron. - Pick random inefficient query and optimize it, then iterate - Let’s use Google, StackOverflow, etc. … until the problems go away

(inherited from Brendan Gregg’s talks on Performance Analysis) - “Zero methodology”: lack of tools and analysis / optimization activities. - “Street Light” anti-method. - “Drunk man” anti-method. - “Blame someone else” anti-method: - “Database is slow again!” - “It’s not database, it’s network!” - “Our configuration is fine, it’s AWS/GCP/… problem!”

- USE (Utilization, Saturation, Errors) by Brendan Gregg - RED (Requests Rate, Errors, Duration) by Tom Wilkie - The Four Golden Signals (RED + Saturation) by Rob Ewaschuk - Problem statement method - Resource analysis - CPU and Off-CPU FlameGraph analysis (perf, eBPF) - Wait Events analysis - Transaction and lock analysis - Tuple and operation stats analysis - Workload analysis (pg_stat_statements, log-based) - Single query analysis (EXPLAIN, auto_explain). Variations: - Planner numbers only - Time-centric - Focused on cardinality/selectivity (row counts) - Buffers-centric Database Experiments -

Macro-analysis: - analyze workload as a whole - split to segments (e.g., group queries removing parameters) - find “heavy” segments (“heavy” may have various meanings here) - apply macro- or micro-optimization to speed up whole workload or significant part of it - pro step: ensure that no parts of workload have slowed down Micro-analysis: - analyze a single query, forget about all others – run EXPLAIN to obtain the plan - find bottlenecks, inefficient nodes in the plan - apply micro- or macro-optimization to improve performance - pro step: ensure that all queries have not slowed down

postgres-checkup MACRO-ANALYSIS pgwatch2 MICRO-ANALYSIS PEV2 EXPLAIN (BUFFERS, ANALYZE)

- pg_stat_activity - pid, query (usually limited to default 1024 bytes), ... , xact_start, query_start, state_change, …., state, …, wait_event, wait_event_type, … - pg_locks - pg_stat_statements - queryid, query (entire text w/o params), calls, total_time, mean_time, …, shared_blk_***, … - Non-standard: pg_stat_kcache, pg_qualstats, pg_wait_sampling - PostgreSQL logs - Slow queries: - pid, query (full), duration (if exceeds log_min_duration_statement), ... - Blocked queries, with blockers (when blocking state exceeds deadlock_timeout) - Canceled queries (deadlocks, statement_timeout, idle_in_transaction_session_timeout)

pg_stat_statements (extension, in core) pg_stat_activity ┌─────────────────────┬──────────────────┐ ┌──────────────────┬──────────────────────────┐ │ Column │ Type │ │ Column │ Type │ ├─────────────────────┼──────────────────┤ ├──────────────────┼──────────────────────────┤ │ userid │ oid │ │ datid │ oid │ │ dbid │ oid │ │ datname │ name │ │ queryid │ bigint │ │ pid │ integer │ │ query │ text │ │ usename │ name │ │ calls │ bigint │ │ application_name │ text │ │ total_time │ double precision │ │ client_addr │ inet │ │ min_time │ double precision │ │ client_hostname │ text │ │ max_time │ double precision │ │ client_port │ integer │ │ mean_time │ double precision │ │ backend_start │ timestamp with time zone │ │ stddev_time │ double precision │ │ xact_start │ timestamp with time zone │ │ rows │ bigint │ │ query_start │ timestamp with time zone │ │ shared_blks_hit │ bigint │ │ state_change │ timestamp with time zone │ │ shared_blks_read │ bigint │ │ wait_event_type │ text │ │ shared_blks_dirtied │ bigint │ │ wait_event │ text │ │ shared_blks_written │ bigint │ │ state │ text │ │ local_blks_hit │ bigint │ │ backend_xid │ xid │ │ local_blks_read │ bigint │ │ backend_xmin │ xid │ │ local_blks_dirtied │ bigint │ │ query │ text │ │ local_blks_written │ bigint │ │ backend_type │ text │ │ temp_blks_read │ bigint │ └──────────────────┴──────────────────────────┘ │ temp_blks_written │ bigint │ │ blk_read_time │ double precision │ │ blk_write_time │ double precision │ └─────────────────────┴──────────────────┘

PGCon-2020 Online 2020-05-26 Database systems: - PowerPoint PPT Presentation

PGCon-2020 Online 2020-05-26 Database systems: 2002-2005: since 2005: Worked on XML data type and functions (2005-2007) Long-term community activist #RuPostgres, 2000+ members Conferences Program

FYE 03/2002 2Q Financial Results FYE 03/2002 FYE 03/2002 FYE 03/2002 2Q Financial Results 2Q

FYE 03/2002 3Q Financial Results FYE 03/2002 FYE 03/2002 FYE 03/2002 3Q Financial Results 3Q

Tucson Fire Department 2002 Awards Presentation Included in this PDF is information of the

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

PGCon 2020 Tatsuro Yamada Julien Rouhaud Who we are Tatsuro Yamada Works for NTT Comware as

Aviva plc plc Aviva 2002 Interim Results 2002 Interim Results 1 August 2002 1 August 2002

The Mouse Genome The Mouse Genome Database (MGD) Database (MGD) Eppig J.T., et al. (2005). The

Building Better Benchmarks PGCon 2020 . . . . . . . . . . . . . . . . . . . .

Implementing System Versioned Temporal Table Surafel Temesgen Mamo Pgcon 2020 About me

Hacking the Query Planner, Again Richard Guo / VMware PGCon 2020 Agenda What does planner

Mechanical Sympathy for Elephants Reducing I/O and memory stalls Thomas Munro, PGCon 2020

CREATE STATISTICS What is it for? Tomas Vondra, 2ndQuadrant tomas.vondra@2ndquadrant.com PGCon

Standard SQL Gap Analysis Features where PostgreSQL lags behind its competitors PgCon.org 2018

H a c k i n g P o s t g r e S Q L PGCon 2013 Ottawa, Canada Stephen Frost sfrost@snowman.net

NEBC Database Course 2008 Database Servers Database Interfaces Tim Booth : tbooth@ceh.ac.uk

DATABASE SECURITY CS4750 Database Systems Prof. Nada Basit Email: basit@virginia.edu Fall

Private ! Virtual ! Infrastructure for ! Cloud ! Computing John ! Krautheim UMBC ! Cyber ! Defense

oderint dum metuant... ! Last Release[10/2016]: v0.8\U-NATi0n!

Neuroevolution of Combat Bots Artificial Intelligence for Interactive Media and Games Professor

How to Design a Program Repair Bot? Insights from the Repairnator Project Simon Urli , Zhongxing

IntelMQ - a KISS incident handling automation project (IHAP) L. Aaron Kaplan kaplan@cert.at

Social Computing CS 278 | Stanford University | Michael Bernstein How can we design the social

Security Issues and Solutions in Peer-to- peer Systems for Real-time Communications

Functions as a Service (Serverless computing) Motivation All require at least one server to

Sambuz

Useful Links

Newsletter

Mail Us