repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-07-16 14:29:05 +00:00

Author	SHA1	Message	Date
Ian Barwick	0dce03a5f8	standby clone: don't query upstream's data directory In early repmgr versions, this used to be a requirement for cloning via rsync, and/or as a fallback location if the user didn't supply a data directory to clone into. However as rsync cloning has been deprecated, and the data directory must be specified in repmgr.conf, this is no longer required, and removing it simplifies user privilege requirements. Note that it is still possible to explicitly provide a target data directory with -D/--pgdata, though this is primarily useful for the niche use case where repmgr is used as a convenience tool to clone a node which is not intended to become part of a repmgr cluster. This is part of the implementation of GitHub #536 for the minimizing of user privilege requirements.	2019-10-16 13:21:29 +09:00
Ian Barwick	a0591afb1e	doc: add repmgr 5.0 release date	2019-10-15 10:29:52 +09:00
Ian Barwick	2304584679	Fix handling of upstream node change check repmgrd has a check to see if the upstream node has unexpectedly changed, e.g. if the repmgrd service is paused and the PostgreSQL instance has been pointed to another node. However this check was relying on the node record on the local node being up-to-date, which may not be the case immediately after a failover, when the node is still replaying records updated prior to the node's own record being updated. In this case it will mistakenly assume the node is following the original primary and attempt to restart monitoring, which will fail as the original primary is no longer available. To prevent this, we check against the node's record on the upstream node. Addresses issue noted in GitHub #587 and #588.	2019-10-14 12:28:04 +09:00
Ian Barwick	10f00b8822	repmgr: pass explicitly provided log level when executing repmgr remotely This makes it possible to return log output when executing repmgr remotely at a different level to the one defined in the remote repmgr's repmgr.conf. This is particularly useful when DEBUG output is required.	2019-09-17 15:38:43 +09:00
Ian Barwick	56aae22b6c	doc: update release notes	2019-09-17 11:00:04 +09:00
Ian Barwick	931da14df1	Rename some "repmgr daemon ..." commands to "repmgr service ..." "repmgr daemon" can be interpreted to mean the commands affect the local daemon process only. Rename the commands which affect the entire cluster to "repmgr service ...". The "repmgr daemon ..." form of the affected commands is retained for backwards compatibility.	2019-08-28 14:58:11 +09:00
Ian Barwick	ffc7b7817b	doc: update HISTORY Note PostgreSQL support.	2019-08-22 15:42:29 +09:00
Ian Barwick	fb6352735a	The next major release will be 5.0. 4.5 was a placeholder release number in case a major release was required prior to the release of Pg12.	2019-08-22 15:15:56 +09:00
Ian Barwick	666c6f5140	"standby clone": improve error messages related to extension status Previously repmgr would emit the "repmgr extension not found on source node" which depending on context is somewhat misleading, as it may exist but not be installed, or the user may be attempting to clone from the wrong database.	2019-08-07 16:41:27 +09:00
Ian Barwick	38b373e6df	"node check": check role membership when trying to read pg_settings From PostgreSQL 10, a member of the default roles "pg_monitor" and/or "pg_read_all_settings" can read pg_settings without requiring superuser privileges. Previously, a hint was being emitted about making the repmgr user a member of one of those groups, but no check for membership was being made, meaning the check could only be run by a superuser.	2019-08-07 14:26:48 +09:00
Ian Barwick	8d55cab25e	Convert configuration file parsing to use flex Previously, repmgr was using a very simple ad-hoc string-based parser, which had various limitations and allowed configuration files to be created in a way which could cause confusion and/or unexpected behaviour. For example, it accepted strings enclosed in single quotes, but treated strings enclosed in double quotes literally. A node_name defined thusly: node_name="somenode" would result in the literal value '"somenode"' being used, which could lead to unobvious errors along the lines of: no record found for ""somenode"" The configuration file parser has been adapted from the one used by PostgreSQL itself, so behaves more-or-less identically (though some functions such as file inclusion are not supported in repmgr). This makes configuration parsing more robust and consistent; additionally, error reporting will be more precise. Note this does mean that some repmgr.conf items previously accepted as valid by repmgr will now be rejected; in particular this includes strings containing spaces which are not enclosed in single quotes.	2019-08-01 10:17:20 +09:00
Ian Barwick	5bf9605286	Revert "Convert configuration file parsing to use flex" This reverts commit `c6ca183247`. Backing out this patch for now as the Debian build system doesn't seem to like it, even though it builds just fine on Debian itself.	2019-07-18 10:19:18 +09:00
Ian Barwick	c6ca183247	Convert configuration file parsing to use flex Previously, repmgr was using a very simple ad-hoc string-based parser, which had various limitations and allowed configuration files to be created in a way which could cause confusion and/or unexpected behaviour. For example, it accepted strings enclosed in single quotes, but treated strings enclosed in double quotes literally. A node_name defined thusly: node_name="somenode" would result in the literal value '"somenode"' being used, which could lead to unobvious errors along the lines of: no record found for ""somenode"" The configuration file parser has been adapted from the one used by PostgreSQL itself, so behaves more-or-less identically (though some functions such as file inclusion are not supported in repmgr). This makes configuration parsing more robust and consistent; additionally, error reporting will be more precise. Note this does mean that some repmgr.conf items previously accepted as valid by repmgr will now be rejected; in particular this includes strings containing spaces which are not enclosed in single quotes.	2019-07-03 12:18:01 +09:00
Ian Barwick	b125628f7b	doc: update release notes Finalize release date.	2019-06-26 15:57:42 +09:00
Ian Barwick	09979eaa91	note that "standby follow" requires a primary to be available While it's technically possible to have a standby follow another standby while the primary is not available, repmgr will not be able to update its metadata, which will cause Confusion and Chaos. Update the documentation to make this clear, and provide a more helpful error message if this situation occurs. The operation previously failed anyway, but with an unhelpful message about not being able to find a node record.	2019-06-11 15:14:17 +09:00
Ian Barwick	7180e2bed7	Canonicalize the data directory path when parsing the configuration file This ensures the provided path matches the path PostgreSQL reports as its data directory.	2019-06-07 09:48:01 +09:00
Ian Barwick	f5d29f6591	doc: update release notes	2019-06-06 11:30:30 +09:00
Ian Barwick	d4df0055c9	repmgr: use --compact (not --terse) in "cluster events" to hide details column This is consistent with usage elsewhere. "--terse" is intended to reduce logging noise.	2019-05-30 14:19:37 +09:00
Ian Barwick	9085ca46a8	doc: update release notes	2019-05-28 15:38:19 +09:00
Ian Barwick	c560dfbbce	cluster show: display timeline ID This helps provide a better picture of the state of the cluster, i.e. making it more obvious whether there's been a timeline divergence. This also provides infrastructure for further improvements in cluster status display and diagnosis. Note this is only available in PostgreSQL 9.6 and later as it relies on the SQL functions for interrogating pg_control, which can be executed remotely. As PostgreSQL 9.5 will shortly be the only community-supported version without these functions, it's not worth the effort of trying to duplicate their functionality.	2019-05-27 09:39:19 +09:00
Ian Barwick	2bce1b371c	doc: fold putative 4.3.1 release notes into 4.4	2019-05-23 09:03:18 +09:00
Ian Barwick	c9e85996f5	repmgr: prevent a standby being cloned from a witness server Previously repmgr would happily clone from whatever server it found at the provided source server address. We should ensure that a standby can only be cloned from a node which is part of the main replication cluster. This check fetches a list of nodes from the source server, connects to the first non-witness server it finds, and compares the system identifiers of the source node and the node it has connected to. If there is a mismatch, then the source server is clearly not part of the main replication cluster, and is most likely the witness server.	2019-05-22 16:52:25 +09:00
Ian Barwick	f03e012c99	cluster show/daemon status: report if node not attached to advertised upstream	2019-05-14 16:15:03 +09:00
Ian Barwick	fca033fb9d	cluster show/daemon status: report upstream node mismatches When showing node information, check if the node's copy of its record shows a different upstream to the one expected according to the node where the command is executed. This helps visualise situations where the cluster is in an unexpected state, and provide a better idea of the actual state. For example, if a cluster has divided somehow and a set of nodes are following a new primary, when running "cluster show" etc., repmgr will now show the name of the primary those nodes are actually following, rather than the now outdated node name recorded on the other side of the split. A warning will also be issued about the situation.	2019-05-14 13:11:31 +09:00
Ian Barwick	d8e4c54ea4	"standby switchover": add "--repmgrd-force-unpause" Implements GitHub #559.	2019-05-10 16:04:07 +09:00
Ian Barwick	d43b40c5c6	doc: enable creation of PDF files	2019-05-10 10:50:49 +09:00
Ian Barwick	5e03627e6c	doc: update release notes	2019-05-07 15:29:56 +09:00
Ian Barwick	8da355eb3f	doc: update release notes	2019-05-02 14:00:07 +09:00
Ian Barwick	dbbf35ded1	Update HISTORY	2019-04-25 14:59:33 +09:00
Ian Barwick	9fe2fa2daf	daemon status: make output more like that of "cluster show" In particular make any issues with unexpected server state more obvious.	2019-04-25 14:45:41 +09:00
Ian Barwick	ef47589c6b	standby clone: always ensure directory is created with correct permissions In Barman mode, if there is an existing, populated data directory, and the "--force" option is provided, the entire directory was being deleted, and later recreated as part of the rsync process, but with the default permissions. Fix this by recreating the data directory with the correct permissions after deleting it.	2019-04-09 10:58:27 +09:00
Ian Barwick	77b9887d61	standby clone: improve --dry-run behaviour in barman mode - emit additional informational output - ensure that provision of --force does not result in an existing data directory being modified in any way	2019-04-08 15:12:22 +09:00
Ian Barwick	7631c60933	doc: update release notes	2019-04-08 11:27:25 +09:00
Ian Barwick	602e06a8f4	doc: finalize 4.3 release notes	2019-04-02 14:42:06 +09:00
Ian Barwick	73ad689390	standby register: fail if --upstream-node-id is the local node ID	2019-03-27 14:22:55 +09:00
Ian Barwick	ec873b0119	doc: update release notes	2019-03-22 15:43:49 +09:00
Ian Barwick	dfb92df05f	doc: miscellaenous cleanup	2019-03-15 14:39:37 +09:00
Ian Barwick	63f7ad546e	repmgrd: add option "connection_check_type" This enable selection of the method repmgrd uses to check whether the upstream node is available. Possible values are: - "ping" (default): uses PQping() to check server availability - "connection": executes a query on the connection to check server availability (similar to repmgr3.x).	2019-03-06 12:09:54 +09:00
Ian Barwick	0578053875	standby clone: check upstream connections after data copy operation With long-running copy operations, it's possible the connection(s) to the primary/source server may go away for some reason, so recheck their availability before attempting to reuse.	2019-02-26 14:37:05 +09:00
Ian Barwick	07097575b1	daemon status: add column "upstream last seen" This displays the interval (in seconds) since the repmgrd instance on each node last confirmed its upstream node is available.	2019-02-23 13:03:16 +09:00
Ian Barwick	71d151ca87	Don't check status of logical replication slots We only want to check the status of physical replication slots to determine whether a streaming replication standby has become detached and there is therefore a risk of uncontrolled WAL buildup on the local node. It's not feasible to second-guess the state of logical replication slots.	2019-02-23 10:09:43 +09:00
Ian Barwick	629c552348	primary unregister: ensure correct behaviour when executed on a witness Fixes GitHub #548.	2019-02-15 19:49:17 +09:00
Ian Barwick	3a5a4388c7	cluster show: differentiate unreachable status Differentiate between unreachable nodes and nodes which are running but rejecting connections.	2019-02-15 16:01:55 +09:00
Ian Barwick	f1667a7e98	repmgrd: don't consider nodes where repmgrd is not running If, for whatever reason, repmgrd is not running on a node, but that node qualifies as promotion candidate, failover will not take place as that node will never promote itself. We therefore discount nodes where repmgrd is running as promotion candidates, which will ensure one node is always promoted. There is a slight risk here that the node(s) where repmgrd is not running are further ahead, leading to a timeline fork. It might be possible to mitigate that by having the "election" leader perform the promote (or follow) operation.	2019-02-07 17:07:13 +09:00
Ian Barwick	2c9700586c	repmgr: "witness register" - check connection is to primary node Previously, if the witness server connection details were provided to "repmgr witness register" rather than those of the primary server, repmgr a) write the node record to the witness server rather than the primary, and b) would loop indefinitely trying to copy the node table to itself. Addresses GitHub #538.	2019-02-04 14:45:32 +09:00
Ian Barwick	59ed86c01a	"cluster show": fix formatting with multiple digit node IDs	2019-02-02 14:07:49 +09:00
Ian Barwick	48381a5b4e	Use --compact option for abbreviated display output --terse is meant for reducing log chatter.	2019-02-02 13:06:59 +09:00
Ian Barwick	b9ba97a36d	"standby switchover": check replication connection to upstream Ensure repmgr checks the standby (promotion candidate) is currently attached to the primary (demotion candidate). Addresses issue reported in GitHub #519.	2019-02-01 15:28:06 +09:00
Ian Barwick	9273e7af73	"standby switchover": avoid potential race condition with WAL location check Immediately after the demotion candidate (primary) has shut down, we can't be absolutely sure that the walreceiver has flushed all WAL to disk, so checking pg_last_wal_receive_lsn() at that point might not reflect the actual last available WAL location. To handle this, we'll loop for a while (timeout controlled by configuration parameter "wal_receive_check_timeout") before finally deciding whether the standby is still behind the shut-down primary. Addresses issue raised in GitHub #518.	2019-02-01 12:06:22 +09:00
Ian Barwick	7654dd615b	Finalize "daemon (start\|stop)" commands Implements GitHub #528.	2019-01-29 13:16:11 +09:00

1 2 3

134 Commits