Commit Graph

381 Commits

Author SHA1 Message Date
Ian Barwick
47a4b49890 Add "repmgr standby follow --upstream-node-id"
In an automatic failover situation, after a standby has been promoted
there's a risk the original primary may become available again before
"standby follow" is issued on another standby node, in which case "standby
follow" will reconnect to the original primary.

As the standby's repmgrd will have received a notification from the new
primary, it will know the primary's ID and can therefore explicitly
direct "standby follow" to follow that primary.
2017-09-04 09:11:59 +09:00
Ian Barwick
3aeceab081 "standby follow": add missing sleep() call when --wait specified 2017-09-02 13:11:03 +09:00
Ian Barwick
72e037d55b SCRIPTS_built also required for 9.4 2017-09-02 12:03:17 +09:00
Ian Barwick
4ba3e67e11 Hacky workaround for hacky workaround in 9.4
For some reason 9.4 doesn't install the repmgr/repmgrd binaries
with "SCRIPTS_built". "SCRIPTS" seems to work for both 9.4 and
later versions.
2017-09-02 11:54:07 +09:00
Ian Barwick
ab6702891a Minor fixes to cascading standby failover. 2017-09-01 13:09:17 +09:00
Ian Barwick
edb74ccef9 Various fixes to "repmgr node rejoin" 2017-09-01 11:30:31 +09:00
Ian Barwick
f9f05158d2 Fix "cluster show --csv" output 2017-09-01 00:14:01 +09:00
Ian Barwick
c7423ebb44 Various minor fixes 2017-08-31 23:54:52 +09:00
Ian Barwick
91941183bc Use replication user, if set, when checking replication connections 2017-08-31 17:54:49 +09:00
Ian Barwick
1517c06bb1 Document "replication_user" configuration file parameter. 2017-08-31 17:29:09 +09:00
Ian Barwick
04c9779561 Add documentation about passwords and recovery.conf 2017-08-31 15:24:09 +09:00
Ian Barwick
0e0b221507 Add configuration file setting "use_primary_conninfo_password"
If, for whatever reason, the upstream server password needs to be set
in "primary_conninfo", enable it to be extracted from $PGPASSWORD.
2017-08-31 14:57:07 +09:00
Ian Barwick
677cd9073a Document parameter "degraded_monitoring_timeout"
Also ensure it can be reloaded.
2017-08-31 14:48:51 +09:00
Ian Barwick
7fecc611a9 Update minimum version check in configure.in 2017-08-31 13:33:45 +09:00
Ian Barwick
ed16c32fe7 Check minimum server version for pg_rewind 2017-08-31 13:30:59 +09:00
Ian Barwick
705b52fde4 Use actual program name rather than "repmgr" when executing remote commands
It's possible some distribution packages may assign a different name to the
"repmgr" binary (typically appending a version number), so we shouldn't hard-code
into the command string.

However it's reasonable to assume the "repmgr" binary will have the same name
across a replication cluster so we won't engage in any contortions to account
for possible variations.

See: https://github.com/2ndQuadrant/repmgr/issues/323
2017-08-31 11:04:11 +09:00
Ian Barwick
ae634100a3 "standby switchover": sanity-check remote "repmgr" binary before proceeding 2017-08-31 11:00:14 +09:00
Ian Barwick
8e35a415a9 Refactor extraction of value to use for "primary_conninfo"
Also add improved error detection.

Basically in the worst case we want to enable the user to clone a standby
from Barman even if the upstream node is not running/reachable, as long as
the user explicitly provides a string to use for "primary_conninfo".
2017-08-31 09:59:30 +09:00
Ian Barwick
dfb796ca8f Check config file *before* checking command line parameters
Validity of command line parameters may depened on settings in the configuration
file.
2017-08-31 09:55:32 +09:00
Ian Barwick
13f2d46e92 "standby clone": exit with 0 after successful --dry-run 2017-08-30 10:00:48 +09:00
Ian Barwick
8b03859dbe "standby clone": add early sanity check for external configuration files
This still requires an SSH connection, so we need to check early before
the cloning starts, and also emit useful information for --dry-run.
2017-08-29 22:11:16 +09:00
Ian Barwick
16d7282c9d Update README 2017-08-29 16:32:29 +09:00
Ian Barwick
154c76e5e7 repmgrd: improve cascaded standby failover
Check primary is available.
2017-08-29 15:29:17 +09:00
Ian Barwick
6d02415d26 Ensure "log_status_interval" gets reloaded 2017-08-29 15:26:10 +09:00
Ian Barwick
9cf3508979 Log configuration parameters which change following a reload 2017-08-29 13:08:37 +09:00
Ian Barwick
e0888c1f62 repmgrd: handle SIGHUP 2017-08-29 12:55:13 +09:00
Ian Barwick
df827c6518 Update repmgrd documentation 2017-08-29 11:04:30 +09:00
Ian Barwick
0b9c538cb9 Update README1 2017-08-29 08:13:42 +09:00
Ian Barwick
e40a14289d Update README 2017-08-28 17:33:38 +09:00
Ian Barwick
725bcf3177 Update README
Document replication slots.
2017-08-28 17:26:00 +09:00
Ian Barwick
4d5ff146e0 Update README 2017-08-28 17:07:14 +09:00
Ian Barwick
0b7dbb845c Add warning for --dry-run when not effective 2017-08-28 15:10:57 +09:00
Ian Barwick
b900f9996f "repmgr standby clone": add --dry-run option 2017-08-28 15:04:50 +09:00
Ian Barwick
e05bab8284 "standby switchover": epxlictly confirm suitability for --pg-rewind
If --force-rewind requested.
2017-08-28 14:50:08 +09:00
Ian Barwick
4ba2291cdd Update README 2017-08-28 12:32:05 +09:00
Ian Barwick
3b28cfa70e Update "standby switchover" command reference 2017-08-28 11:58:10 +09:00
Ian Barwick
4b2fb7d15b Update "repmgr primary ..." --help output and command reference 2017-08-28 11:53:41 +09:00
Ian Barwick
d85b066f45 repmgrd cluster show: add warnings with information about discrepancies 2017-08-28 11:19:32 +09:00
Ian Barwick
4a11551c2f repmgrd: handle local node failure 2017-08-28 10:31:43 +09:00
Ian Barwick
754084c814 Update "repmgr standby --help" output 2017-08-26 10:27:22 +09:00
Ian Barwick
57215a8bd7 Add --help output for "standby clone" 2017-08-25 23:07:17 +09:00
Ian Barwick
c1ed248fb1 Handle "event_notifications" when reloading configuration 2017-08-25 23:07:07 +09:00
Ian Barwick
e280e2670e Add code to reload configuration file 2017-08-25 22:31:42 +09:00
Ian Barwick
5208655a35 Parse "recovery_min_apply_delay" from recovery.conf 2017-08-25 21:47:14 +09:00
Ian Barwick
5ee1eb6bf7 Convert --recovery-min-apply-delay to configuration file option
That way it only needs to be set once, and won't get lost during
follow operations etc.
2017-08-25 21:25:15 +09:00
Ian Barwick
dc172cae20 When performing a follow operation, start/restart server as appropriate
Before this we were always forcing a restart, which is technically not
a problem but produces some potentially confusing log entries along the
lines:

  pg_ctl: PID file "/path/to/postmaster.pid" does not exist
  Is server running?
  starting server anyway
2017-08-25 16:50:30 +09:00
Ian Barwick
1ba2a2462c Update README 2017-08-25 16:48:14 +09:00
Ian Barwick
4921f389b6 Fix spurious warning when executing "repmgr node rejoin"
Database connection parameters required for this.
2017-08-25 16:28:25 +09:00
Ian Barwick
a449e8512e repmgr: improve "repmgr standby switchover" log output
Particularly in --dry-run mode it's useful to get a confirmation that
various prerequisites are met.
2017-08-25 16:01:11 +09:00
Ian Barwick
1452238613 Update document "repmgrd-node-fencing.md" 2017-08-25 15:58:52 +09:00