Check PXC/Galera-specific replication latency
For example: pt-osc, pt-table-checksum. These tools check if any slaves are behind and meter themselves accordingly.
For Galera there are a few things that are important:
- Galera uses 'flow control' as a replication lag feedback loop. If the replication queue gets too large on any node, it will use flow control to slow down writes. This causes write-stalls (by design). These tools should avoid that.
- The default queue size (gcs.fc_limit - measured in pending transactions) is 16 (which changes a bit by default depending on how many nodes you have). This can be tune up to the several hundreds. Typically any queue sizes > 0 may indicate some amount of lag on the slaves.
There are several status variables that should be useful here:
- wsrep_flow_
- wsrep_flow_
- wsrep_flow_
- wsrep_local_
- wsrep_local_
Blueprint information
- Status:
- Not started
- Approver:
- None
- Priority:
- Medium
- Drafter:
- None
- Direction:
- Needs approval
- Assignee:
- None
- Definition:
- New
- Series goal:
- Accepted for 2.2
- Implementation:
- Not started
- Milestone target:
- None
- Started by
- Completed by