repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-23 15:16:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	47a4b49890	Add "repmgr standby follow --upstream-node-id" In an automatic failover situation, after a standby has been promoted there's a risk the original primary may become available again before "standby follow" is issued on another standby node, in which case "standby follow" will reconnect to the original primary. As the standby's repmgrd will have received a notification from the new primary, it will know the primary's ID and can therefore explicitly direct "standby follow" to follow that primary.	2017-09-04 09:11:59 +09:00
Ian Barwick	c7423ebb44	Various minor fixes	2017-08-31 23:54:52 +09:00
Ian Barwick	91941183bc	Use replication user, if set, when checking replication connections	2017-08-31 17:54:49 +09:00
Ian Barwick	04c9779561	Add documentation about passwords and recovery.conf	2017-08-31 15:24:09 +09:00
Ian Barwick	0e0b221507	Add configuration file setting "use_primary_conninfo_password" If, for whatever reason, the upstream server password needs to be set in "primary_conninfo", enable it to be extracted from $PGPASSWORD.	2017-08-31 14:57:07 +09:00
Ian Barwick	705b52fde4	Use actual program name rather than "repmgr" when executing remote commands It's possible some distribution packages may assign a different name to the "repmgr" binary (typically appending a version number), so we shouldn't hard-code into the command string. However it's reasonable to assume the "repmgr" binary will have the same name across a replication cluster so we won't engage in any contortions to account for possible variations. See: https://github.com/2ndQuadrant/repmgr/issues/323	2017-08-31 11:04:11 +09:00
Ian Barwick	dfb796ca8f	Check config file before checking command line parameters Validity of command line parameters may depened on settings in the configuration file.	2017-08-31 09:55:32 +09:00
Ian Barwick	8b03859dbe	"standby clone": add early sanity check for external configuration files This still requires an SSH connection, so we need to check early before the cloning starts, and also emit useful information for --dry-run.	2017-08-29 22:11:16 +09:00
Ian Barwick	0b7dbb845c	Add warning for --dry-run when not effective	2017-08-28 15:10:57 +09:00
Ian Barwick	754084c814	Update "repmgr standby --help" output	2017-08-26 10:27:22 +09:00
Ian Barwick	5ee1eb6bf7	Convert --recovery-min-apply-delay to configuration file option That way it only needs to be set once, and won't get lost during follow operations etc.	2017-08-25 21:25:15 +09:00
Ian Barwick	4921f389b6	Fix spurious warning when executing "repmgr node rejoin" Database connection parameters required for this.	2017-08-25 16:28:25 +09:00
Ian Barwick	ff07763242	repmgr: update --help output Display database connection options.	2017-08-22 15:07:22 +09:00
Ian Barwick	7ca396b9cb	Add missing Barman options check	2017-08-21 14:10:08 +09:00
Ian Barwick	594e9e5007	Document upgrade process from repmgr3 Also provide unpackaged extension upgrade SQL, and a script to assist converting repmgr.conf files.	2017-08-17 23:37:31 +09:00
Ian Barwick	da24d883e5	Remove option "--wal-keep-segments" This is a remnant of the early repmgr days when there were no alternative mechanisms for ensuring sufficient WAL remains available while cloning a standby. The purpose of this setting was to override a check for an (arbitrary) minimum setting for "wal_keep_segments". As there's no reliable way of determining a sensible value for this, and improvements in pg_basebackup mean WALs can be streamed (possibly using a replication slot) while the backup is in progress, there's no point in keeping this around. We will however still emit a warning about setting "wal_keep_segments" if the configuration doesn't appear to provide any other way of ensuring WAL is available during/after the cloning process and "wal_keep_segments" is not set.	2017-08-17 14:45:13 +09:00
Ian Barwick	b1ba476241	Rename "archiver" check etc. to "archive-ready" Gives a better indication of what's being checked.	2017-08-17 12:23:56 +09:00
Ian Barwick	b1b5870d54	"repmgr node status": add --help output, fix CSV output Also ensure is executed only on local node, as it needs to read the data directory.	2017-08-17 11:27:31 +09:00
Ian Barwick	0ac16f7630	Add more --help output	2017-08-16 17:49:46 +09:00
Ian Barwick	8ff545f9ae	Add --help output for "repmgr cluster"	2017-08-16 16:33:07 +09:00
Ian Barwick	4efc8fb9ce	Add placeholder functions for "repmgr $command --help" There are now too many options to sensibly fit into general --help output; we'll add separate output for each repmgr command, e.g. "repmgr node --help".	2017-08-16 13:24:14 +09:00
Ian Barwick	4c0d719cdb	Add replication slot check to "repmgr node check"	2017-08-16 11:17:02 +09:00
Ian Barwick	554673e83e	Add "repmgr node check --downstream"	2017-08-15 15:50:46 +09:00
Ian Barwick	10ef30096c	"node check": add server role check	2017-08-14 22:57:09 +09:00
Ian Barwick	3b2158edbf	Initialise variables, where appropriate	2017-08-14 15:11:42 +09:00
Ian Barwick	4260fdf1e7	More code cleanup	2017-08-14 12:19:57 +09:00
Ian Barwick	0f31756733	General code cleanup	2017-08-14 10:04:53 +09:00
Ian Barwick	7ca68b7cc8	Standardize "primary_conninfo" generation Previously repmgr would write all the default libpq parameters into "primary_conninfo" on "standby clone", but not for "standby follow", which is inconsistent. For repmgr4 we'll determine that the upstream node's conninfo must be canonical and contain all required connection parameters, even if these are available as defaults or environment variables in the local environment, as those are transient and may not be available in all environments/situations. recovery.conf's "primary_conninfo" will be generated using the upstream's conninfo parameters, except for those specific to the downstream node. These are: - "application_name": this will always be set to the "node_name" of the downstream node - "passfile" and "servicefile": these, must of course reference files on the downstream node so will be extracted from the downstream node's conninfo, if set	2017-08-10 12:37:50 +09:00
Ian Barwick	5fb86771b1	Use stored node configuration file path when executing remote commands Makes life much easier.	2017-08-10 09:12:07 +09:00
Ian Barwick	1d99a07b43	Store configuration file in repmgr.nodes table When executing repmgr on remote nodes, we otherwise end up jumping through hoops as we can't make assumptions about where the configuration file is located, but really need to be able to provide it. From a support point of view it will also make life easier as it will be easy to specify exactly which file to provide.	2017-08-10 08:03:24 +09:00
Ian Barwick	a57fb5b50c	After switchover, enable sibling standbys to follow new primary	2017-08-10 00:06:16 +09:00
Ian Barwick	bae82318f1	No need to expose configuration file archive functions as repmgr commands	2017-08-09 13:32:15 +09:00
Ian Barwick	df425a38b7	Refactor "standby follow" functionality "standby follow" was originally co-opted to start up a demoted node; this functionality is now delegated to "node rejoin", with the core functionality of "standby follow" implemented as an internal function.	2017-08-09 13:26:27 +09:00
Ian Barwick	b1e544f962	Enable use of pg_rewind during switchover operations But only if required and --force-rewind required, and pg_rewind can actually be used.	2017-08-09 12:09:37 +09:00
Ian Barwick	f2cf46bba3	Check replication lag before attempting switchover	2017-08-08 10:16:47 +09:00
Ian Barwick	2499b42ef8	switchover: check for pending archive files on the demotion candidate If the current primary (demotion candidate) still has any files to archive, it will delay the shutdown until all files are archived. If there is a substantial number of files, and/or the archive command executes slowly, this will probably lead to an unwelcome delay in the switchover process.	2017-08-08 00:37:20 +09:00
Ian Barwick	82639b6903	Refactor slot name handling Better to work with the slot name in a node record, rather than creating a global variable.	2017-08-04 11:56:11 +09:00
Ian Barwick	112ca6321a	Initial switchover implementation The repmgr3 implementation required the promotion candidate (standby) to directly work with the demotion candidate's data directory, directly execute server control commands etc. Here we delegated a lot more of that work to the repmgr on the demotion candidate, which reduces the amount of back-and-forth over SSH and generally makes things cleaner and smoother. In particular the repmgr on the demotion candidate will carry out a thorough check that the node is shut down and report the last checkpoint LSN to the promotion candidate; this can then be used to determine whether pg_rewind needs to be executed on the demoted primary before reintegrating it back into the cluster (todo). Also implement "--dry-run" for this action, which will sanity-check the nodes as far as possible without executing the switchover. Additionally some of the new repmgr node commands (or command options) introduced for this can be also executed by the user to obtain additional information about the status of each node.	2017-08-03 16:38:37 +09:00
Ian Barwick	c67aa15581	Make "pgdata" a mandatory configuration file setting There are some circumstances, e.g. during switchover operations, where repmgr may need to operate on a data directory while the server isn't running, in which case there's no way to retrieve that information.	2017-08-02 23:04:24 +09:00
Ian Barwick	83cda89362	Get data directory for server commands if needed Also add configuration file option "pgdata" for hard-coding the node's data directory - if the "repmgr" DB user isn't a superuser or doesn't have permission to extract the data directory, we'll need another way of finding out.	2017-08-02 13:16:16 +09:00
Ian Barwick	aa528dfdfb	Consolidate generation of various server control commands This is needed for better switchover control, so we can instruct the remote repmgr to issue the appropriate server command rather than trying to work out what it should be from the local node.	2017-08-02 12:01:20 +09:00
Ian Barwick	e5d50bbfd5	Separate configuration file queries into a discrete function Simplifies main application code and makes it easier to reuse the queries.	2017-08-02 00:04:20 +09:00
Ian Barwick	a1ad62d04e	Add "repmgr node restore-config"	2017-08-01 22:13:32 +09:00
Ian Barwick	f023b9c90c	Add "repmgr node archive-config"	2017-08-01 17:38:54 +09:00
Ian Barwick	dc24d62009	repmgrd: improve BDR recovery handling	2017-07-27 11:53:55 +09:00
Ian Barwick	d8a1799215	Update -?/--help output	2017-07-27 10:08:32 +09:00
Ian Barwick	a9b0c16b3c	Add "cluster matrix" and "cluster crosscheck" actions	2017-07-26 11:24:33 +09:00
Ian Barwick	56b2e9bb84	Rename/add configuration file options In previous versions of repmgr, some options had ambiguous meanings, and/or were used for slightly different purposes. This way we end up with a couple more options (most of which probably won't need adjusting) but greater clarity and flexibility. Removed: master_reponse_timeout: renamed to "async_query_timeout", as this was its main usage retry_promote_interval_secs: replaced by "primary_notification_timeout" Added: async_query_timeout: timeout (in seconds) when executing asynchronous queries primary_notification_timeout: number of seconds to wait for notification from the new primary after a failover primary_follow_timeout: number of seconds to wait for the new primary to become available when executing "repmgr standby follow"	2017-07-25 11:13:32 +09:00
Ian Barwick	8a2e4db1bc	Add "repmgr node status" Outputs an overview of a node's status, and emits warnings if any issues detected.	2017-07-25 00:39:04 +09:00
Ian Barwick	b99443b0c8	Improvements to `repmgr cluster show` Add documentation; show recovery status in --csv mode.	2017-07-20 10:25:13 +09:00

1 2 3

126 Commits