repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-22 22:56:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	e561ddc8d3	node check: accept -S/--superuser option This is mainly useful for the --data-directory-config option, which requires permission to read pg_settings to verify that the data directory configured in "repmgr.conf" matches the data directory actually in use. If pg_settings read permission is not available, repmgr will fall back to a simple check that the data directory configured in "repmgr.conf" is a valid PostgreSQL directory. This is not entirely foolproof, as it's possible PostgreSQL could be using a different data directory.	2020-03-23 17:14:04 +09:00
Ian Barwick	12adb5e0d1	Add warning if --superuser option provided when it won't be used Currently the only place this option is relevant is "standby clone".	2020-03-23 15:28:22 +09:00
Ian Barwick	9de31428f1	Consolidate replication connection code In a few places, replication connections are generated from the parameters used by existing connections. This has resulted in a number of similar blocks of code which do more-or-less the same thing almost but not quite identically. In two cases, the code omitted to set "dbname=replication", which can cause problems in some contexts. These code blocks have now been consolidated into standardized functions. This also resolves the issue addressed by GitHub #619.	2020-03-05 17:21:37 +09:00
Ian Barwick	8f6058c676	standby switchover: check replication configuration file ownership Within a PostgreSQL data directory, all files should have the same ownership as the data directory itself. PostgreSQL itself expects this, and ownership of files by another user is likely to cause problems. In PostgreSQL 11 or earlier, if "recovery.conf" cannot be moved by PostgreSQL (because e.g. it is owned by root), it will not be possible to promote the standby to primary. In PostgreSQL 12 and later, if "postgresql.auto.conf" on the demotion candidate (current primary) has incorrect ownership (e.g. owned by root), repmgr will very likely not be able to modify this file and write the replication configuration required for the node to rejoin the cluster as a standby. Checks added to catch both cases before a switchover is executed.	2020-03-04 17:21:22 +09:00
Ian Barwick	194b6d0948	Minor code simplification	2020-03-03 15:27:45 +09:00
Ian Barwick	6ef722956b	cluster show: show unreachable node's upstream name as uncertain	2020-02-25 16:50:45 +09:00
Ian Barwick	b4af80fdec	Add optional check for unsupported future PostgreSQL releases This is for backbranches to prevent them running against newer PostgreSQL versions with which they are not compatible, for example 4.4.x with PostgreSQL 12 and later.	2020-02-14 10:43:19 +09:00
Ian Barwick	7ed0a99d70	Make code to check standby join status available globally This makes it possible to check the standby join status from another node, e.g. the promotion candidate during a switchover operation.	2020-02-04 12:52:55 +09:00
Ian Barwick	cd7f36a6fd	Add general check function "check_replication_slots_available()" Make the code previously only used by "standby follow" generally available - we'll want to use this from "node rejoin" as well. While we're at it, when reporting failure due to lack of free replication slots, report the current value of "max_replication_slots".	2020-02-03 16:43:55 +09:00
Ian Barwick	84b824d86a	Add missing values to action_name()	2020-01-29 15:32:40 +09:00
Ian Barwick	4d4ed3bcd6	Remove BDR 2.x support The BDR 2.x support was conceptual only and was never used in production. As BDR 2.x will be EOL'd shortly, there is no risk it will be needed.	2020-01-16 09:52:42 +09:00
Ian Barwick	7fdf2f1778	Update copyright notices to 2020	2020-01-13 14:06:20 +09:00
Ian Barwick	f158e35c13	Make variable local to code block	2019-11-20 10:13:55 +09:00
Ian Barwick	25fb24eee4	Minor cleanup in repmgr-client.c	2019-10-30 16:58:30 +09:00
Ian Barwick	220ec7fc96	Minimize user permissions requirements for replication slots Enable operations which create or drop replication slots to be carried out with the minimum necessary user permissions, i.e. a user with the REPLICATION attribute. This can be the repmgr user, or a dedicated replication user. In the latter case, if the dedicated replication user is only permitted to make replication connections, the streaming replication protocol is used to create/drop slots. Implements part of GitHub #536.	2019-10-30 15:51:15 +09:00
Ian Barwick	dc11330d58	Rename replication slot create/drop functions Append "_sql" to the respective function names, as we'll later be creating equivalent functions which use the replication protocol so need a way to distinguish between them.	2019-10-23 13:43:09 +09:00
Ian Barwick	b74f965f54	standby clone: rename --recovery-conf-only to --replication-conf-only A more generic option name to cover pre- and post-Pg12 replication configuration methods. --recovery-conf-only is retained as an alias for backwards compatibility.	2019-10-18 14:44:57 +09:00
Ian Barwick	a502b2cf96	Move function parse_repmgr_version() to a more appropriate location	2019-09-24 13:14:03 +09:00
Ian Barwick	10f00b8822	repmgr: pass explicitly provided log level when executing repmgr remotely This makes it possible to return log output when executing repmgr remotely at a different level to the one defined in the remote repmgr's repmgr.conf. This is particularly useful when DEBUG output is required.	2019-09-17 15:38:43 +09:00
Ian Barwick	677a94513e	repmgr: note that --dry-run is not effective with "repmgr service status"	2019-08-28 15:14:35 +09:00
Ian Barwick	931da14df1	Rename some "repmgr daemon ..." commands to "repmgr service ..." "repmgr daemon" can be interpreted to mean the commands affect the local daemon process only. Rename the commands which affect the entire cluster to "repmgr service ...". The "repmgr daemon ..." form of the affected commands is retained for backwards compatibility.	2019-08-28 14:58:11 +09:00
Ian Barwick	f5044465cb	Add function to safely modify postgresql.auto.conf This is required for PostgreSQL 12 and later.	2019-08-14 16:57:42 +09:00
Ian Barwick	a1775237d4	Update comment Deprecated command line option --data-dir was removed in commit `5ca0b57`, but a comment still referred to it.	2019-08-14 14:12:09 +09:00
Ian Barwick	94ba635811	Define our own PG_AUTOCONF_FILENAME	2019-08-13 16:48:44 +09:00
Ian Barwick	c0f3990973	Use appendPQExpBufferStr where appropriate	2019-08-13 16:32:40 +09:00
Ian Barwick	5ca0b57d0c	Remove command-line options deprecated since repmgr 3.3 The following options have long since been deprecated, and any attempt to use them results only in a warning that they are no longer valid: --data-dir --no-conninfo-password --recovery-min-apply-delay	2019-08-05 16:26:12 +09:00
Ian Barwick	7d20aea606	Fix typo in comment	2019-08-01 15:20:44 +09:00
Ian Barwick	d4df0055c9	repmgr: use --compact (not --terse) in "cluster events" to hide details column This is consistent with usage elsewhere. "--terse" is intended to reduce logging noise.	2019-05-30 14:19:37 +09:00
Ian Barwick	e6195edbca	cluster show: warn if unable to connect to witness's upstream Fix also applies to "daemon status".	2019-05-21 12:35:49 +09:00
Ian Barwick	2326c384c0	cluster show: fix upstream check for witnesses Fix also applies to "daemon status"	2019-05-21 12:28:32 +09:00
Ian Barwick	f03e012c99	cluster show/daemon status: report if node not attached to advertised upstream	2019-05-14 16:15:03 +09:00
Ian Barwick	8587539adb	Fix command line sanity check	2019-05-14 13:27:00 +09:00
Ian Barwick	fca033fb9d	cluster show/daemon status: report upstream node mismatches When showing node information, check if the node's copy of its record shows a different upstream to the one expected according to the node where the command is executed. This helps visualise situations where the cluster is in an unexpected state, and provide a better idea of the actual state. For example, if a cluster has divided somehow and a set of nodes are following a new primary, when running "cluster show" etc., repmgr will now show the name of the primary those nodes are actually following, rather than the now outdated node name recorded on the other side of the split. A warning will also be issued about the situation.	2019-05-14 13:11:31 +09:00
Ian Barwick	d8e4c54ea4	"standby switchover": add "--repmgrd-force-unpause" Implements GitHub #559.	2019-05-10 16:04:07 +09:00
Ian Barwick	b9f07f6a91	standby promote: use variable name "local_conn" for the local connection handle This is consistent with usage in other functions, and makes it easier to differentiate between the local node connection and the primary connection.	2019-05-02 12:04:26 +09:00
Ian Barwick	89a7261483	Always quote node names in log messages	2019-04-30 15:52:56 +09:00
Ian Barwick	5f10e68f31	emit warning if "--siblings-follow" provided out-of-context	2019-04-29 14:12:22 +09:00
Ian Barwick	2082a8d3f3	Consolidate some code	2019-04-25 16:04:40 +09:00
Ian Barwick	9fe2fa2daf	daemon status: make output more like that of "cluster show" In particular make any issues with unexpected server state more obvious.	2019-04-25 14:45:41 +09:00
Ian Barwick	5a9175c740	Clarify hints about updating the repmgr extension	2019-04-24 11:37:31 +09:00
Ian Barwick	a9b56d9833	Fix hint message s/UPGRADE/UPDATE	2019-04-10 12:08:26 +09:00
Ian Barwick	5e9f202c9a	Add missing break	2019-03-28 12:44:50 +09:00
Ian Barwick	9d5afeebbc	Remove logically dead code	2019-03-28 12:35:41 +09:00
Ian Barwick	ba1f05ece9	Restrict "node_name" to maximum 63 characters In "recovery.conf", the configuration parameter "node_name" is used as the "application_name" value, which will be truncated by PostgreSQL to 63 characters (NAMEDATALEN - 1). repmgr sometimes needs to be able to extract the application name from pg_stat_replication to determine if a node is connected (e.g. when executing "repmgr standby register"), so the comparison will fail if "node_name" exceeds 63 characters.	2019-03-28 10:37:57 +09:00
Ian Barwick	539861cb58	repmgrd: during failover, check if a node was already promoted Previously, repmgrd assumed that during a failover, there would not already be another primary node. However it's possible a node was promoted manually. While this is not a desirable situation, it's conceivable this could happen in the wild, so we should check for it and react accordingly. Also sanity-check that the follow target can actually be followed. Addresses issue raised in GitHub #420.	2019-03-22 14:06:41 +09:00
Ian Barwick	46efe57cd0	Improve database connection failure logging Log the output of PQerrorStatus() in a couple of places where it was missing. Additionally, always log the output of PQerrorStatus() starting with a blank line, otherwise the first line looks like it was emitted by repmgr, and it's harder to scan the error message. Before: [2019-03-20 11:24:15] [DETAIL] could not connect to server: Connection refused Is the server running on host "localhost" (::1) and accepting TCP/IP connections on port 5501? could not connect to server: Connection refused Is the server running on host "localhost" (127.0.0.1) and accepting TCP/IP connections on port 5501? After: [2019-03-20 11:27:21] [DETAIL] could not connect to server: Connection refused Is the server running on host "localhost" (::1) and accepting TCP/IP connections on port 5501? could not connect to server: Connection refused Is the server running on host "localhost" (127.0.0.1) and accepting TCP/IP connections on port 5501?	2019-03-20 11:47:28 +09:00
Ian Barwick	ae8171e461	Improve logging/sanity checking for "node control" options	2019-03-06 15:54:30 +09:00
Ian Barwick	1615353f48	repmgrd: optionally disconnect WAL receivers during failover This is intended to ensure that all nodes have a constant LSN while making the failover decision. This feature is experimental and needs to be explicitly enabled with the configuration file option "standby_disconnect_on_failover". Note enabling this option will result in a delay in the failover decision until the WAL receiver is disconnected on all nodes.	2019-03-06 15:53:57 +09:00
Ian Barwick	b1875a8d91	Split command execution functions into separate library These may need to be executed by repmgrd.	2019-02-27 14:41:17 +09:00
Ian Barwick	9338a9e233	Improve logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail lineImprove logging output Avoid emitting blank detail line	2019-02-15 10:49:56 +09:00

1 2 3 4 5 ...

265 Commits