beyond the cluster
play

BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV - PowerPoint PPT Presentation

BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV ZHDANOV WHO? Yakov Zhdanov: - GridGains Product Development VP - With GridGain since 2010 - Apache Ignite committer and PMC - Passion for performance & scalability -


  1. BEYOND THE CLUSTER: WAN DATA REPLICATION WITH GRIDGAIN YAKOV ZHDANOV

  2. WHO? Yakov Zhdanov: - GridGain’s Product Development VP - With GridGain since 2010 - Apache Ignite committer and PMC - Passion for performance & scalability - Finding ways to make product better - St. Petersburg, Russia

  3. PLAN 1)Why replicate?

  4. PLAN 1)Why replicate? 2)How do DBs solve this?

  5. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)

  6. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)GridGain DR overview – roles, features, process

  7. PLAN 1)Why replicate? 2)How do DBs solve this? 3)Replication: Monolith vs Distributed 4)GridGain DR overview – roles, features, process 5)Future plans – Sync/Async TX replication

  8. WHY REPLICATE YOUR DATA?

  9. WHY REPLICATE?  Data security

  10. WHY REPLICATE?  Data security  Failover

  11. WHY REPLICATE?  Data security  Failover  Data warehousing

  12. WHY REPLICATE?  Data security  Failover  Data warehousing  Load balancing

  13. WHY REPLICATE?  Data security  Failover  Data warehousing  Load balancing  Increasing system capacity

  14. POSTGRESQL REPLICATION  PostgreSQL is an object-relational database management system (ORDBMS)  Pioneered many things and concepts  High maturity level  Opensource and widely used

  15. POSTGRESQL REPLICATION  Shared disk storage https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  16. POSTGRESQL REPLICATION  Shared disk storage  File system replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  17. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  18. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  19. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  20. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  21. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware  Async Multimaster Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  22. POSTGRESQL REPLICATION  Shared disk storage  File system replication  Write-Ahead Log Shipping  Logical Replication  T rigger-Based Master-Standby Replication  Statement-Based Replication Middleware  Async Multimaster Replication  Sync Multimaster Replication https://www.postgresql.org/docs/10/static/different-replication-solutions.html

  23. REPLICATION IN DISTRIBUTED SYSTEMS Monolith Distributed Data security + ? Failover + + Load balancing + ?/+ Increasing system capacity + ? Data warehousing + ?

  24. REPLICATION IN GRIDGAIN  Introduced in 2012/2013  Completely new feature  Required a lot of engineering efgorts  Required revisiting of existing logic  Async KEY/VALUE mode available  Sync/Async TX replication under development https://docs.gridgain.com/docs/data-center- replication

  25. REPLICATION IN GRIDGAIN: ROLES • Sender cache • Sender hub • Receiver hub • Receiver cache https://docs.gridgain.com/docs/data-center- replication

  26. REPLICATION IN GRIDGAIN: FEATURES • Complex topologies (up to 32 datacenters) • Failover • Pluggable conflict resolution • Filtering • Pause/Resume • Full state transfer https://docs.gridgain.com/docs/data-center- replication

  27. REPLICATION IN GRIDGAIN: HOW IT WORKS

  28. REPLICATION IN GRIDGAIN: COMPLEX TOPOLOGIES

  29. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in sending topology

  30. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in sending topology

  31. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Sender hub(s) failure

  32. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Sender hub(s) failure

  33. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Receiver hub(s) failure

  34. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Receiver hub(s) failure

  35. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in receiving topology

  36. REPLICATION IN GRIDGAIN: WHAT CAN GO WRONG? Node failure in receiving topology

  37. REPLICATION IN GRIDGAIN: WHAT CAN BE BETTER? Batching on per-node basis vs per-partition basis Per-node batching Per-partition batching  More efficient from memory standpoint  No need for additional processing on receiving side  Batches collected quickly  Less contention – honors thread-per-partition model  Probably, higher GC pressure  But still expected to perform better

  38. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication

  39. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication – over stretched cluster

  40. REPLICATION IN GRIDGAIN: FUTURE PLANS TX replication mechanism

  41. REPLICATION IN GRIDGAIN: FUTURE PLANS TX Replication – sync modes  Strict SYNC mode Main primary node (PN) → Sender → Receiver → Stand-In PN →Receiver →Sender → Main PN  Merciful SYNC mode Main PN → Sender → Receiver → Receiver's WAL → Sender → Main PN  ASYNC mode Main PN->Sender->Sender WAL->Main PN

  42. LESSONS LEARNED  Is replication able to solve your problem?

  43. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.

  44. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.  Be aware of internals – know what makes it work.

  45. LESSONS LEARNED  Is replication able to solve your problem?  Pickup proper settings: sync/async, physical vs logical changes.  Be aware of internals – know what makes it work.  Make sure to test, tune and monitor.

  46. CONTACTS yzhdanov@gridgain.com http://ignite.apache.org dev@ignite.apache.org user@ignite.apache.org

  47. QUESTIONS? ANY QUESTIONS?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend