repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-22 22:56:29 +00:00

Author	SHA1	Message	Date
RealGreenDragon	b92d43d136	Fixed repmgr.conf.sample	2024-10-14 14:46:27 +02:00
RealGreenDragon	94b21ae8ac	Fixed standby_disconnect_on_failover description in repmgr.conf	2024-09-09 15:29:48 +02:00
Felix Dreissig	35c8a53790	Sample config: Update link to docs	2022-04-19 10:20:18 +09:00
Ian Barwick	81f9c0ebd0	doc: update repmgr.conf.sample Minor formatting fix.	2021-12-08 09:50:37 +09:00
Ian Barwick	9f53d45c74	doc: update repmgr.conf.sample Remove bogus -W option in "repmgr standby follow" example invocation for the "follow_command" parameter. The option (which corresponds to "--no-wait") is not used by "repmgr standby follow". Per report from Jimmy Angelakos.	2021-12-08 09:50:19 +09:00
Ian Barwick	79d1f005db	repmgrd: activate inactive node record at startup If a PostgreSQL instance was shut down while repmgrd was running, and repmgrd was subsequently restarted (this chain of events could occur during e.g. a server reboot), the node record will have been set to "inactive". Previously, in this case repmgrd would refuse to start up. However, as we can determine the node is running, it should normally be no problem to automatically set the node record to "active". The old behaviour can be restored by setting the new parameter "repmgrd_exit_on_inactive_node" to "true". RM19604.	2021-07-12 17:46:09 +09:00
Ian Barwick	888e1d7a3b	docs: update repmgr.conf.sample Fix description for connection_check_type='connection'.	2021-03-02 11:44:14 +09:00
Ian Barwick	ce59d92731	doc: update repmgr.conf.sample	2021-01-14 15:27:24 +09:00
Josh Soref	842c67ca18	doc: various spelling fixes Via GitHub #687.	2020-12-22 13:47:56 +09:00
Ian Barwick	4b524c52b6	standby clone: honour --waldir setting when cloning from Barman By setting --waldir in "pg_basebackup_options", standbys cloned using pg_basebackup would have their WAL directory set to the specified location and symlinked from the data directory. This commit causes repmgr to honour that setting even when cloning from Barman.	2020-10-07 15:13:52 +09:00
Ian Barwick	73d2088a85	standby follow: don't restart server (PostgreSQL 13 and later) As of PostgreSQL 13, changes to the fundamental replication configuration can be applied with a simple SIGHUP, no restart required. In case the old behaviour is desired, i.e. a full restart to apply the configuration changes, the new configuration parameter "standby_follow_restart" can be set. This parameter has no effect in PostgreSQL 12 and earlier.	2020-09-29 17:53:51 +09:00
Ian Barwick	ce229beff8	repmgrd: add configuration option "always_promote" In certain corner cases, it's possible repmgrd may end up monitoring a standby which was a former primary, but the node record has not yet been updated. Previously repmgrd would abort the promotion with a cryptic message about being unable to find a node record for node_id -1 (the default value for an unknown node id). This commit addes a new configuration option "always_promote", which determines whether repmgrd should promote the node in this case. The default is "false", to effectively maintain the existing behaviour. Logging output has also been improved to make it clearer what has happened when this situation occurs.	2020-09-29 14:18:00 +09:00
Ian Barwick	271d407c7c	doc: update sample configuration file Clarify parameters for recovery_min_apply_delay	2020-07-16 11:07:49 +09:00
Ian Barwick	9b6fe6858a	doc: update repmgr.conf.sample Was missing "query" option for "connection_check_type".	2020-05-12 17:05:22 +09:00
Ian Barwick	4d4ed3bcd6	Remove BDR 2.x support The BDR 2.x support was conceptual only and was never used in production. As BDR 2.x will be EOL'd shortly, there is no risk it will be needed.	2020-01-16 09:52:42 +09:00
Renaud Fortier	afa88f0514	Update repmgr.conf.sample Add empty single quotes to promote_command and follow_command	2019-12-16 12:27:59 +09:00
Ian Barwick	e2ffeac67d	doc: add missing single quotes in repmgr.conf.sample	2019-11-20 15:13:12 +09:00
Ian Barwick	ce85ba6df5	doc: update repmgr.conf sample Convert recovery.conf references to generic configuration descriptions, and fix spacing.	2019-11-08 11:54:27 +09:00
Ian Barwick	dbd3d34c89	doc; update repmgr.conf.sample Note new PostgreSQL-style parsing and add link to documentation.	2019-10-07 18:28:16 +09:00
Ian Barwick	931da14df1	Rename some "repmgr daemon ..." commands to "repmgr service ..." "repmgr daemon" can be interpreted to mean the commands affect the local daemon process only. Rename the commands which affect the entire cluster to "repmgr service ...". The "repmgr daemon ..." form of the affected commands is retained for backwards compatibility.	2019-08-28 14:58:11 +09:00
Ian Barwick	01852f7e3a	doc: improve repmgr.conf settings documentation	2019-06-07 12:48:36 +09:00
Ian Barwick	36a09a5c4b	doc: improve configuration documentation	2019-06-07 12:16:04 +09:00
Ian Barwick	5a90513878	repmgrd: monitor standbys attached to primary This functionality enables repmgrd (when running on the primary) to monitor connected child nodes. It will log connections and disconnections and generate events. Additionally, repmgrd can execute a custom script if the number of connected child nodes falls below a configurable threshold. This script can be used e.g. to "fence" the primary following a failover situation where a new primary has been promoted and all standbys are now child nodes of that primary.	2019-04-22 16:18:52 +09:00
Ian Barwick	c338bc9c5e	doc: add note about BDR replication type in sample config	2019-04-05 14:37:49 +09:00
Ian Barwick	e23f5afc5f	doc: note valid characters for "node_name" "node_name" will be used as "application_name", so should only contain characters valid for that; see: https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-APPLICATION-NAME Not yet enforced.	2019-03-28 10:53:43 +09:00
Ian Barwick	ba1f05ece9	Restrict "node_name" to maximum 63 characters In "recovery.conf", the configuration parameter "node_name" is used as the "application_name" value, which will be truncated by PostgreSQL to 63 characters (NAMEDATALEN - 1). repmgr sometimes needs to be able to extract the application name from pg_stat_replication to determine if a node is connected (e.g. when executing "repmgr standby register"), so the comparison will fail if "node_name" exceeds 63 characters.	2019-03-28 10:37:57 +09:00
Ian Barwick	fbdf9617fa	doc: update repmgrd example output	2019-03-15 15:43:11 +09:00
Ian Barwick	9dd87dd5ce	doc: add explanation of the configuration file format	2019-03-15 14:02:42 +09:00
Ian Barwick	dd6ece326f	doc: update repmgrd configuration documentation	2019-03-13 13:34:08 +09:00
Ian Barwick	fc397f25f6	repmgrd: enable election rerun If "failover_validation_command" is set, and the command returns an error, rerun the election. There is a pause between reruns to avoid "churn"; the length of this pause is controlled by the configuration parameter "election_rerun_interval".	2019-03-12 17:12:19 +09:00
Ian Barwick	2a8f8d8400	doc: expand repmgrd configuration section	2019-03-11 14:50:33 +09:00
Ian Barwick	33fefd9f52	Add configuration option "primary_visibility_consensus" This determines whether repmgrd should continue with a failover if one or more nodes report they can still see the standby.	2019-03-07 10:41:42 +09:00
Ian Barwick	a3f90d2bba	Add configuration option "sibling_nodes_disconnect_timeout" This controls the maximum length of time in seconds that repmgrd will wait for other standbys to disconnect their WAL receivers in a failover situation. This setting is only used when "standby_disconnect_on_failover" is set to "true".	2019-03-06 15:56:21 +09:00
Ian Barwick	63f7ad546e	repmgrd: add option "connection_check_type" This enable selection of the method repmgrd uses to check whether the upstream node is available. Possible values are: - "ping" (default): uses PQping() to check server availability - "connection": executes a query on the connection to check server availability (similar to repmgr3.x).	2019-03-06 12:09:54 +09:00
Ian Barwick	a41e7bb726	doc: various minor updates	2019-02-01 17:24:32 +09:00
Ian Barwick	9273e7af73	"standby switchover": avoid potential race condition with WAL location check Immediately after the demotion candidate (primary) has shut down, we can't be absolutely sure that the walreceiver has flushed all WAL to disk, so checking pg_last_wal_receive_lsn() at that point might not reflect the actual last available WAL location. To handle this, we'll loop for a while (timeout controlled by configuration parameter "wal_receive_check_timeout") before finally deciding whether the standby is still behind the shut-down primary. Addresses issue raised in GitHub #518.	2019-02-01 12:06:22 +09:00
Ian Barwick	32b81e7d49	"daemon start": initial implementation	2019-01-29 13:01:14 +09:00
Ian Barwick	ba7ef9e643	doc: update PostgreSQL documentation links "/static/" path element no longer required.	2019-01-15 12:45:33 +09:00
Ian Barwick	40e94635b2	doc: fix typo in repmgr.conf.sample	2018-10-08 09:36:28 +09:00
Ian Barwick	11d25e2aef	Add configuration parameter "repmgr_bindir" This is to facilitate remote invocation of repmgr when the repmgr binary is located somewhere other than the PostgreSQL binary directory, as it cannot be assumed all package maintainers will install repmgr there. This parameter is optional; if not set (the default), repmgr will fall back to "pg_bindir" (if set). Addresses GitHub #246.	2018-10-02 09:59:12 +09:00
Ian Barwick	38e3aae053	repmgr: add parameter "shutdown_check_timeout" Previously, "repmgr standby switchover" used the configuration file parameters "reconnect_interval" and "reconnect_attempts" to define a timeout to determine whether the current primary (demotion candidate) has shut down. However, these parameters are intended for primary failure detection and are generally lower in value, while a controlled shutdown may take longer, resulting in the switchover being aborted as repmgr was not waiting long enough. To prevent this happening, parameter "shutdown_check_timeout" has been added. This complements the existing "standby_reconnect_timeout" parameter used by "repmgr standby switchover". Implements GitHub #504.	2018-09-25 11:34:06 +09:00
Ian Barwick	80bef0eb28	doc: minor fixes to "repmgr.conf.sample"	2018-09-25 10:53:24 +09:00
Ian Barwick	f8667c1aac	doc: better explain where pg_bindir won't be applied Basically any setting which can contain a user-defined script must have the full path set, even if it's repmgr being executed. We could potentially apply some heuristics to detect if the first item in the setting is "repmgr" (or more precisely repmgrd's program name), but this will require some careful thought and testing that it works as intended.	2018-08-14 09:54:27 +09:00
Ian Barwick	63242e2277	doc: update documentation of "promote_command" and "service_promote_command" The documentation implied it would override "promote_command", which is not the case. "promote_command" is used by repmgrd to execute "repmgr standby promote" (either directly or via a custom script). "service_promote_command" can be set to specify a package-level service command to promote the local PostgreSQL instance from standby to primary, e.g. Debian's pg_ctlcluster. If set, this will be executed by "repmgr standby promote". Also update code comments to clarify usage. Related to GitHub #473.	2018-07-16 14:43:53 +09:00
Ian Barwick	8b059bc9b0	Change default for "log_level" to INFO Default was previously NOTICE (as in repmgr 3.x) but documentation implied it was INFO, and many of the the documentation examples assume it is. This produces some quite informative log output, without creating excessive log file volume. In particular it's useful to get a better idea of what repmgrd is actually doing. Also add documentation section for the log configuration parameters. GitHub #470, containing change suggested in GitHub #467.	2018-07-12 14:50:48 +09:00
Greg Clough	ff16d3b3bb	Fixed typo in repmgr.conf.sample, "priority" Fixed typo in repmgr.conf.sample, "priority"	2018-06-29 22:00:09 +01:00
Ian Barwick	8d636690bd	repmgrd: create pid file by default Traditionally repmgrd will only write a pidfile if explicitly requested with -p/--pid-file. However it's normally desirable to have a pidfile, and it's preferable to have one used by default to prevent accidentally starting a second repmgrd instance. Following changes made: - add configuration file parameter "repmgrd_pid_file" (initially overridden by -p/--pid-file for backwards compatibility, though eventually we'll want to drop -p/--pid-file altogether) - add command line option --no-pid-file - if neither "repmgrd_pid_file" nor -p/--pid-file is set, create the pid file in a temporary directory Implements GitHub #457.	2018-06-29 14:36:24 +09:00
Ian Barwick	b2081dca52	De-overload configuration file parameter "standby_reconnect_timeout" Currently the (very generic sounding) "standby_reconnect_timeout" configuration file parameter is used in several different contexts and it would be useful to have more granular control over the different timeouts it's used to configure. This patch introduces "node_rejoin_timeout", used in place of "standby_reconnect_timeout" (which wasn't documented) when "repmgr node rejoin" is executed, to determine how long to wait for the node to rejoin the replication cluster. Additionally "repmgrd_standby_startup_timeout" is introduced as a timeout for failover situations, when repmgrd executes "repmgr standby follow" to follow a new primary, and waits for the standby to restart and become available for connections. "standby_reconnect_timeout" is now only relevant for "repmgr standby switchover". Implements GitHub #454.	2018-06-28 18:00:55 +09:00
Ian Barwick	efc388065e	standby follow: check node has connect to new primary After restarting the standby, poll pg_stat_replication on the upstream until the standby connects, and exit with an error if it doesn't by the timeout defined in "standby_follow_timeout". Implments GitHub #444.	2018-06-07 15:04:45 +09:00
Ian Barwick	9c0c1b663e	Minor documentation fixes	2018-05-10 10:25:29 +09:00

1 2

92 Commits