repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-22 22:56:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	4d4ed3bcd6	Remove BDR 2.x support The BDR 2.x support was conceptual only and was never used in production. As BDR 2.x will be EOL'd shortly, there is no risk it will be needed.	2020-01-16 09:52:42 +09:00
Ian Barwick	7fdf2f1778	Update copyright notices to 2020	2020-01-13 14:06:20 +09:00
Ian Barwick	b74f965f54	standby clone: rename --recovery-conf-only to --replication-conf-only A more generic option name to cover pre- and post-Pg12 replication configuration methods. --recovery-conf-only is retained as an alias for backwards compatibility.	2019-10-18 14:44:57 +09:00
Ian Barwick	931da14df1	Rename some "repmgr daemon ..." commands to "repmgr service ..." "repmgr daemon" can be interpreted to mean the commands affect the local daemon process only. Rename the commands which affect the entire cluster to "repmgr service ...". The "repmgr daemon ..." form of the affected commands is retained for backwards compatibility.	2019-08-28 14:58:11 +09:00
Ian Barwick	5ca0b57d0c	Remove command-line options deprecated since repmgr 3.3 The following options have long since been deprecated, and any attempt to use them results only in a warning that they are no longer valid: --data-dir --no-conninfo-password --recovery-min-apply-delay	2019-08-05 16:26:12 +09:00
Ian Barwick	b938f10206	repmgr client: mark some options as deprecated	2019-05-13 15:45:34 +09:00
Ian Barwick	d8e4c54ea4	"standby switchover": add "--repmgrd-force-unpause" Implements GitHub #559.	2019-05-10 16:04:07 +09:00
Ian Barwick	9fe2fa2daf	daemon status: make output more like that of "cluster show" In particular make any issues with unexpected server state more obvious.	2019-04-25 14:45:41 +09:00
Ian Barwick	1615353f48	repmgrd: optionally disconnect WAL receivers during failover This is intended to ensure that all nodes have a constant LSN while making the failover decision. This feature is experimental and needs to be explicitly enabled with the configuration file option "standby_disconnect_on_failover". Note enabling this option will result in a delay in the failover decision until the WAL receiver is disconnected on all nodes.	2019-03-06 15:53:57 +09:00
Ian Barwick	48381a5b4e	Use --compact option for abbreviated display output --terse is meant for reducing log chatter.	2019-02-02 13:06:59 +09:00
Ian Barwick	d7420d7274	daemon (start\|stop): verify that repmgrd starts/stops. Note this may not always be possible for "daemon stop" if we are unable to determine the repmgrd PID.	2019-01-30 14:41:31 +09:00
Ian Barwick	32b81e7d49	"daemon start": initial implementation	2019-01-29 13:01:14 +09:00
Ian Barwick	7dce3ed234	Update copyright notices to 2019	2019-01-21 14:54:35 +09:00
Ian Barwick	0b3a310802	Add --data-directory-config option to "repmgr node check" Implements part of GitHub #523.	2019-01-16 16:03:44 +09:00
Ian Barwick	10be941298	Fix typo "node join" should be "node rejoin"	2019-01-14 15:39:13 +09:00
Ian Barwick	b5b9aacc8a	Add command line option "repmgr --version-number" Outputs the raw version number. Intended for use by scripts etc.	2019-01-08 10:08:23 +09:00
Ian Barwick	2491b8ae52	Add functionality to "pause" repmgrd In some circumstances, e.g. while performing a switchover, it is essential that repmgrd does not take any kind of failover action, as this will put the cluster into an incorrect state. Previously it was necessary to stop repmgrd on all nodes (or at least those nodes which repmgrd would consider as promotion candidates), however this is a cumbersome and potentially risk-prone operation, particularly if the replication cluster contains more than a couple of servers. To prevent this issue from occurring, this patch introduces the ability to "pause" repmgrd on all nodes wth a single command ("repmgr daemon pause") which notifies repmgrd not to take any failover action until the node is "unpaused" ("repmgr daemon unpause"). "repmgr daemon status" provides an overview of each node and whether repmgrd is running, and if so whether it is paused. "repmgr standby switchover" has been modified to automatically pause repmgrd while carrying out the switchover. See documentation for further details.	2018-09-27 16:42:10 +09:00
Ian Barwick	56919ea499	repmgr: add -q/--quiet option This suppresses log output below log level ERROR. This is useful mainly when repmgr is being executed programmatically, e.g. in a cronjob, where it's only useful to receive output if something goes wrong. Note we advise against using this option when executing repmgr commands which operate on PostgreSQL nodes (standby follow, standby promote, standby switchover, node rejoin), particularly when executed by repmgrd, as the log output will provide valuable troubleshooting information. Implements suggestion in GitHub #468.	2018-07-13 12:09:41 +09:00
Ian Barwick	080a29c33b	node check: add --missing-slots check This enables an explicit check for slots which should exist (according to the repmgr metadata) but which aren't present.	2018-06-22 17:21:40 +09:00
Ian Barwick	a3f371b8c0	"node rejoin": actively check for node to rejoin cluster Previously repmgr was relying on whatever command was configured to start PostgreSQL to determine whether the node being rejoined had started correctly. However it's preferable to actively poll the upstream to confirm it has restarted and actually attached as a standby before confirming success of the "node rejoin" action. This can be overridden with the -W/--no-wait option. (Note that for consistency with other PostgreSQL utilities, the short form of the --wait option is now "-w"; this is currently only used in "repmgr standby follow".) Also update "repmgr node rejoin" documentation with a list of supported options, and add some useful index entries for "pg_rewind". Implements GitHub #415.	2018-04-03 10:34:44 +09:00
Ian Barwick	3ccf1cf182	Enable pg_rewind to be used with PostgreSQL 9.3/9.4 pg_rewind is not part of the core distribution for those, but we provided support in repmgr 3.3 so should extend it to repmgr 4. Note that there is no check in place whether the pg_rewind binary exists, so it's up to the user to ensure it's present. Addresses GitHub #413.	2018-04-02 20:54:29 +09:00
Ian Barwick	ee98a3a58e	"standby clone": add --recovery-conf-only option This will generate "recovery.conf" for an existing standby. Typical use-case is a standby cloned manually from an external data source (e.g. Barman), where "recovery.conf" needs to be created (and if required a replication slot). The --dry-run option will check the pre-requisites but not actually create "recovery.conf" or a replication slot. This requires that the upstream node is running, a replication connection can be made and if required a replication slot can be created. Implements GitHub #382.	2018-02-22 15:50:51 +09:00
Ian Barwick	927bf038a0	"standby switchover": check demotion candidate can make replication connection Check it's actually possible for the demotion candidate to attach to the promotion candidate before executing the switchover. As with other checks of this nature, there's a faint possibility the situation could change between the time the check is carried out and the demotion candidate is restarted to connect to the promotion candidate, but there's not a lot we can do about that. The main purpose is to be able to catch existing misconfigurations before anything gets changed. Implements GitHub #370.	2018-02-09 10:00:54 +09:00
Ian Barwick	b705127a34	"repmgr standby register": add --wait-start option Implements GitHub #356.	2018-01-04 14:56:08 +09:00
Ian Barwick	26a9e848fd	Update copyright notices to 2018	2018-01-02 10:19:46 +09:00
Ian Barwick	8c121da8a1	Add diagnostic option "repmgr node check --has-passfile" This checks if the active libpq version (9.6 and later) has the "passfile" option, and returns 0 if present, 1 if not. `	2017-12-11 20:09:48 +09:00
Ian Barwick	7fffe3ed96	witness: initial code framework	2017-11-15 13:47:41 +09:00
Ian Barwick	37bdad290c	Add --help output for "repmgr node service" Addresses GitHub #329.	2017-10-20 16:44:44 +09:00
Ian Barwick	f00e6296e9	Move deprecated command line option Not required in repmgr4, we're keeping it around for backwards compatibility; a warning will be issued if used.	2017-10-17 16:07:44 +09:00
Ian Barwick	f565851de3	repmgr client: clean up command line option handling	2017-10-04 09:35:04 +09:00
Ian Barwick	ea2693bc75	Move create_recovery_file() et al to repmgr-action-standby.c As they're only ever called from there.	2017-09-18 09:53:08 +09:00
Ian Barwick	b6b31b15b2	Implement "repmgr cluster cleanup"	2017-09-11 13:48:46 +09:00
Ian Barwick	a9f4a027a7	pgindent run	2017-09-11 11:14:13 +09:00
Ian Barwick	e4f7dc8234	Add copyright notices	2017-09-08 13:27:39 +09:00
Ian Barwick	edee80cc37	Rename option "node check --is-shutdown" to "--is-shutdown-cleanly" As that's what we really want to know. Also return "UNCLEAN_SHUTDOWN" if that's the case, rather than "RUNNING" which is confusing, even though it's a command for internal use.	2017-09-07 11:15:27 +09:00
Ian Barwick	0e0b221507	Add configuration file setting "use_primary_conninfo_password" If, for whatever reason, the upstream server password needs to be set in "primary_conninfo", enable it to be extracted from $PGPASSWORD.	2017-08-31 14:57:07 +09:00
Ian Barwick	da24d883e5	Remove option "--wal-keep-segments" This is a remnant of the early repmgr days when there were no alternative mechanisms for ensuring sufficient WAL remains available while cloning a standby. The purpose of this setting was to override a check for an (arbitrary) minimum setting for "wal_keep_segments". As there's no reliable way of determining a sensible value for this, and improvements in pg_basebackup mean WALs can be streamed (possibly using a replication slot) while the backup is in progress, there's no point in keeping this around. We will however still emit a warning about setting "wal_keep_segments" if the configuration doesn't appear to provide any other way of ensuring WAL is available during/after the cloning process and "wal_keep_segments" is not set.	2017-08-17 14:45:13 +09:00
Ian Barwick	b1ba476241	Rename "archiver" check etc. to "archive-ready" Gives a better indication of what's being checked.	2017-08-17 12:23:56 +09:00
Ian Barwick	4c0d719cdb	Add replication slot check to "repmgr node check"	2017-08-16 11:17:02 +09:00
Ian Barwick	554673e83e	Add "repmgr node check --downstream"	2017-08-15 15:50:46 +09:00
Ian Barwick	10ef30096c	"node check": add server role check	2017-08-14 22:57:09 +09:00
Ian Barwick	a57fb5b50c	After switchover, enable sibling standbys to follow new primary	2017-08-10 00:06:16 +09:00
Ian Barwick	bae82318f1	No need to expose configuration file archive functions as repmgr commands	2017-08-09 13:32:15 +09:00
Ian Barwick	b1e544f962	Enable use of pg_rewind during switchover operations But only if required and --force-rewind required, and pg_rewind can actually be used.	2017-08-09 12:09:37 +09:00
Ian Barwick	f2cf46bba3	Check replication lag before attempting switchover	2017-08-08 10:16:47 +09:00
Ian Barwick	2499b42ef8	switchover: check for pending archive files on the demotion candidate If the current primary (demotion candidate) still has any files to archive, it will delay the shutdown until all files are archived. If there is a substantial number of files, and/or the archive command executes slowly, this will probably lead to an unwelcome delay in the switchover process.	2017-08-08 00:37:20 +09:00
Ian Barwick	112ca6321a	Initial switchover implementation The repmgr3 implementation required the promotion candidate (standby) to directly work with the demotion candidate's data directory, directly execute server control commands etc. Here we delegated a lot more of that work to the repmgr on the demotion candidate, which reduces the amount of back-and-forth over SSH and generally makes things cleaner and smoother. In particular the repmgr on the demotion candidate will carry out a thorough check that the node is shut down and report the last checkpoint LSN to the promotion candidate; this can then be used to determine whether pg_rewind needs to be executed on the demoted primary before reintegrating it back into the cluster (todo). Also implement "--dry-run" for this action, which will sanity-check the nodes as far as possible without executing the switchover. Additionally some of the new repmgr node commands (or command options) introduced for this can be also executed by the user to obtain additional information about the status of each node.	2017-08-03 16:38:37 +09:00
Ian Barwick	aa528dfdfb	Consolidate generation of various server control commands This is needed for better switchover control, so we can instruct the remote repmgr to issue the appropriate server command rather than trying to work out what it should be from the local node.	2017-08-02 12:01:20 +09:00
Ian Barwick	f023b9c90c	Add "repmgr node archive-config"	2017-08-01 17:38:54 +09:00
Ian Barwick	8a2e4db1bc	Add "repmgr node status" Outputs an overview of a node's status, and emits warnings if any issues detected.	2017-07-25 00:39:04 +09:00

1 2

78 Commits