repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-23 15:16:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	11d25e2aef	Add configuration parameter "repmgr_bindir" This is to facilitate remote invocation of repmgr when the repmgr binary is located somewhere other than the PostgreSQL binary directory, as it cannot be assumed all package maintainers will install repmgr there. This parameter is optional; if not set (the default), repmgr will fall back to "pg_bindir" (if set). Addresses GitHub #246.	2018-10-02 09:59:12 +09:00
Ian Barwick	38e3aae053	repmgr: add parameter "shutdown_check_timeout" Previously, "repmgr standby switchover" used the configuration file parameters "reconnect_interval" and "reconnect_attempts" to define a timeout to determine whether the current primary (demotion candidate) has shut down. However, these parameters are intended for primary failure detection and are generally lower in value, while a controlled shutdown may take longer, resulting in the switchover being aborted as repmgr was not waiting long enough. To prevent this happening, parameter "shutdown_check_timeout" has been added. This complements the existing "standby_reconnect_timeout" parameter used by "repmgr standby switchover". Implements GitHub #504.	2018-09-25 11:34:06 +09:00
Ian Barwick	80bef0eb28	doc: minor fixes to "repmgr.conf.sample"	2018-09-25 10:53:24 +09:00
Ian Barwick	f8667c1aac	doc: better explain where pg_bindir won't be applied Basically any setting which can contain a user-defined script must have the full path set, even if it's repmgr being executed. We could potentially apply some heuristics to detect if the first item in the setting is "repmgr" (or more precisely repmgrd's program name), but this will require some careful thought and testing that it works as intended.	2018-08-14 09:54:27 +09:00
Ian Barwick	63242e2277	doc: update documentation of "promote_command" and "service_promote_command" The documentation implied it would override "promote_command", which is not the case. "promote_command" is used by repmgrd to execute "repmgr standby promote" (either directly or via a custom script). "service_promote_command" can be set to specify a package-level service command to promote the local PostgreSQL instance from standby to primary, e.g. Debian's pg_ctlcluster. If set, this will be executed by "repmgr standby promote". Also update code comments to clarify usage. Related to GitHub #473.	2018-07-16 14:43:53 +09:00
Ian Barwick	8b059bc9b0	Change default for "log_level" to INFO Default was previously NOTICE (as in repmgr 3.x) but documentation implied it was INFO, and many of the the documentation examples assume it is. This produces some quite informative log output, without creating excessive log file volume. In particular it's useful to get a better idea of what repmgrd is actually doing. Also add documentation section for the log configuration parameters. GitHub #470, containing change suggested in GitHub #467.	2018-07-12 14:50:48 +09:00
Greg Clough	ff16d3b3bb	Fixed typo in repmgr.conf.sample, "priority" Fixed typo in repmgr.conf.sample, "priority"	2018-06-29 22:00:09 +01:00
Ian Barwick	8d636690bd	repmgrd: create pid file by default Traditionally repmgrd will only write a pidfile if explicitly requested with -p/--pid-file. However it's normally desirable to have a pidfile, and it's preferable to have one used by default to prevent accidentally starting a second repmgrd instance. Following changes made: - add configuration file parameter "repmgrd_pid_file" (initially overridden by -p/--pid-file for backwards compatibility, though eventually we'll want to drop -p/--pid-file altogether) - add command line option --no-pid-file - if neither "repmgrd_pid_file" nor -p/--pid-file is set, create the pid file in a temporary directory Implements GitHub #457.	2018-06-29 14:36:24 +09:00
Ian Barwick	b2081dca52	De-overload configuration file parameter "standby_reconnect_timeout" Currently the (very generic sounding) "standby_reconnect_timeout" configuration file parameter is used in several different contexts and it would be useful to have more granular control over the different timeouts it's used to configure. This patch introduces "node_rejoin_timeout", used in place of "standby_reconnect_timeout" (which wasn't documented) when "repmgr node rejoin" is executed, to determine how long to wait for the node to rejoin the replication cluster. Additionally "repmgrd_standby_startup_timeout" is introduced as a timeout for failover situations, when repmgrd executes "repmgr standby follow" to follow a new primary, and waits for the standby to restart and become available for connections. "standby_reconnect_timeout" is now only relevant for "repmgr standby switchover". Implements GitHub #454.	2018-06-28 18:00:55 +09:00
Ian Barwick	efc388065e	standby follow: check node has connect to new primary After restarting the standby, poll pg_stat_replication on the upstream until the standby connects, and exit with an error if it doesn't by the timeout defined in "standby_follow_timeout". Implments GitHub #444.	2018-06-07 15:04:45 +09:00
Ian Barwick	9c0c1b663e	Minor documentation fixes	2018-05-10 10:25:29 +09:00
Ian Barwick	8320179f34	Add configuration file parameter "config_directory" This enables explicit provision of an external configuration file directory, which if set will be passed to "pg_ctl" as the -D parameter. Otherwise "pg_ctl" will default to using the data directory, which will cause some operations to fail if the configuration files are not present there. Note this is implemented primarily for feature completeness and for development/testing purposes. Users who have installed "repmgr" from a package should not rely on "pg_ctl" to stop/start/restart PostgreSQL, instead they should set the appropriate "service_..._command" for their operating system. For more details see: https://repmgr.org/docs/4.0/configuration-service-commands.html Note: in a future release, the presence of "config_directory" in repmgr.conf will be used to implictly set "--copy-external-config-files=samepath" when cloning a standby; this is a behaviour change so will be implemented in the next major realease (repmgr 4.1). Implements GitHub #424.	2018-04-25 11:58:24 +09:00
Ian Barwick	09b8a86605	doc: improve configuration documentation With special attention to setting service commands, and extra special mention of "pg_ctlcluster" for Debian/Ubuntu users.	2018-04-20 10:15:18 +09:00
Ian Barwick	dfdebd6c08	Enable provision of "archive_cleanup_command" in recovery.conf If "archive_cleanup_command" is defined in "repmgr.conf", a corresponding entry will be made in the node's "recovery.conf" file after cloning a standby. Note that we recommend using PgBarman to manage WAL archives, but are providing this facility to help repmgr to be integrated in existing environments. Implements GitHub #416.	2018-04-03 14:10:21 +09:00
Ian Barwick	63a11f8926	"standby promote": make timeout values configurable This introduces following new configuration file parameters, which were previously hard-coded values: - promote_check_timeout - promote_check_interval Implements GitHub #387.	2018-04-03 14:10:14 +09:00
Ian Barwick	55441f2729	repmgrd: add configuration file parameter "standby_reconnect_timeout" This is used for determining a timeout when reconnecting to the standby after executing the "follow_command". This will normally not need to be set explicitly, but maybe useful in cases where the standby's startup phase can last longer than usual.	2018-03-02 11:04:56 +09:00
Ian Barwick	5719a0dfd3	Update repmgr.conf.sample Add missing parameter "monitor_interval_secs"	2018-02-12 11:38:22 +09:00
Ian Barwick	c47f976bde	repmgr.conf.sample: fix command line argument "repmgr node check --archive-ready" is correct, however abbreviated versions will be accepted by getopt_long() if they don't match or partially match any other options. Per report by "chaintng" in GitHub #355.	2017-12-27 09:39:14 +09:00
Martín Marqués	f58954b3be	Switch spaces for tabs in repmgr.conf sample file. This makes comments stay aligned in most cases the conf file is modified, and when indentation changes, it's easy to re-align (by removing or adding a tab) Signed-off-by: Martín Marqués <martin.marques@2ndquadrant.com>	2017-12-14 07:00:05 -03:00
Ian Barwick	8b78b7292d	docs: add note about "service_promote_command" in repmgr.conf.sample It must never contain "repmgr standby promote", as it is intended to enable use of package-level promote commands such as Debian's "pg_ctlcluster promote". Addresses GitHub #336.	2017-11-20 12:29:47 +09:00
Ian Barwick	a6cc4d80f0	Add "witness register" functionality	2017-11-15 13:47:45 +09:00
Ian Barwick	eb14bb58c6	Add configuration file "passfile" This will enable a custom .pgpass to be included in "primary_conninfo" (provided it's supported by the libpq version on the standby).	2017-11-14 19:30:25 +09:00
Ian Barwick	97471626b4	Update repmgr.conf.sample	2017-11-02 17:43:03 +09:00
Ian Barwick	7c3abe28b9	Standardize terminology on "primary" (in place of "master")	2017-10-24 13:42:50 +09:00
Ian Barwick	34ee16899e	doc: add missing entry for "priority" in repmgr.conf.sample Per report from Shaun Thomas.	2017-10-19 13:14:52 +09:00
Ian Barwick	55f203a2fc	Add "-o ConnectTimeout=10" as default in "ssh_options"	2017-09-13 13:23:16 +09:00
Gianni Ciolli	6d63c0f941	Small clarification on sudo-based configuration (#1 ) Now we are more explicit on what we recommend for the various service_X_command settings when using sudo. Signed-off-by: Gianni Ciolli <gianni.ciolli@2ndQuadrant.com>	2017-09-06 20:32:54 +01:00
Ian Barwick	e21a3ef7ec	Fix typo	2017-09-06 09:31:16 +09:00
Ian Barwick	78e6bdeebe	Have repmgrd parse "standby follow --upstream-node-id=%n"	2017-09-04 13:42:50 +09:00
Ian Barwick	1517c06bb1	Document "replication_user" configuration file parameter.	2017-08-31 17:29:09 +09:00
Ian Barwick	0e0b221507	Add configuration file setting "use_primary_conninfo_password" If, for whatever reason, the upstream server password needs to be set in "primary_conninfo", enable it to be extracted from $PGPASSWORD.	2017-08-31 14:57:07 +09:00
Ian Barwick	df827c6518	Update repmgrd documentation	2017-08-29 11:04:30 +09:00
Ian Barwick	5ee1eb6bf7	Convert --recovery-min-apply-delay to configuration file option That way it only needs to be set once, and won't get lost during follow operations etc.	2017-08-25 21:25:15 +09:00
Ian Barwick	6259463007	repmgrd: various fixes for "manual" failover mode	2017-08-23 10:56:55 +09:00
Ian Barwick	b1ba476241	Rename "archiver" check etc. to "archive-ready" Gives a better indication of what's being checked.	2017-08-17 12:23:56 +09:00
Ian Barwick	f2cf46bba3	Check replication lag before attempting switchover	2017-08-08 10:16:47 +09:00
Ian Barwick	fd5dfa2ebc	Document "archiver_lag_*" configuration settings.	2017-08-08 00:50:12 +09:00
Ian Barwick	112ca6321a	Initial switchover implementation The repmgr3 implementation required the promotion candidate (standby) to directly work with the demotion candidate's data directory, directly execute server control commands etc. Here we delegated a lot more of that work to the repmgr on the demotion candidate, which reduces the amount of back-and-forth over SSH and generally makes things cleaner and smoother. In particular the repmgr on the demotion candidate will carry out a thorough check that the node is shut down and report the last checkpoint LSN to the promotion candidate; this can then be used to determine whether pg_rewind needs to be executed on the demoted primary before reintegrating it back into the cluster (todo). Also implement "--dry-run" for this action, which will sanity-check the nodes as far as possible without executing the switchover. Additionally some of the new repmgr node commands (or command options) introduced for this can be also executed by the user to obtain additional information about the status of each node.	2017-08-03 16:38:37 +09:00
Ian Barwick	c67aa15581	Make "pgdata" a mandatory configuration file setting There are some circumstances, e.g. during switchover operations, where repmgr may need to operate on a data directory while the server isn't running, in which case there's no way to retrieve that information.	2017-08-02 23:04:24 +09:00
Ian Barwick	83cda89362	Get data directory for server commands if needed Also add configuration file option "pgdata" for hard-coding the node's data directory - if the "repmgr" DB user isn't a superuser or doesn't have permission to extract the data directory, we'll need another way of finding out.	2017-08-02 13:16:16 +09:00
Ian Barwick	e5d50bbfd5	Separate configuration file queries into a discrete function Simplifies main application code and makes it easier to reuse the queries.	2017-08-02 00:04:20 +09:00
Ian Barwick	4c2ba42000	Update sample configuration file	2017-07-27 18:10:56 +09:00
Ian Barwick	4cf66c33db	repmgrd: more fixes to BDR recovery handling	2017-07-27 16:33:41 +09:00
Ian Barwick	eff26b496c	repmgrd: updates for BDR monitoring	2017-07-27 09:49:53 +09:00
Ian Barwick	56b2e9bb84	Rename/add configuration file options In previous versions of repmgr, some options had ambiguous meanings, and/or were used for slightly different purposes. This way we end up with a couple more options (most of which probably won't need adjusting) but greater clarity and flexibility. Removed: master_reponse_timeout: renamed to "async_query_timeout", as this was its main usage retry_promote_interval_secs: replaced by "primary_notification_timeout" Added: async_query_timeout: timeout (in seconds) when executing asynchronous queries primary_notification_timeout: number of seconds to wait for notification from the new primary after a failover primary_follow_timeout: number of seconds to wait for the new primary to become available when executing "repmgr standby follow"	2017-07-25 11:13:32 +09:00
Ian Barwick	1a45287e76	Misc updates and fixes	2017-07-20 21:15:55 +09:00
Ian Barwick	e3b3fb65f0	repmgrd: restrict BDR monitoring to two node setup It's not safe to have more than two nodes with this kind of "failover", so we don't need to select alternative nodes by priority.	2017-07-14 12:56:11 +09:00
Ian Barwick	5fbcf3e476	Remove witness server references	2017-07-10 09:31:31 +09:00
Ian Barwick	35df85e67d	repmgrd: improve handling of "degraded monitoring" In some cases, the monitored upstream may not be available for a while (e.g. network split), in which case it makes sense to have repmgrd keep running and trying to reconnect. Previously it would just keel over and quit.	2017-07-06 17:19:55 +09:00
Ian Barwick	582d0ef363	Rename "logxxx" configuration file parameters to "log_xxx" This is more consistent with other parameters and conforms to the pattern used by PostgreSQL itself, which uses the prefix "log_" for logging parameters. A warning will be emitted if the old version of the parameter name is detected.	2017-07-04 23:36:47 +09:00

1 2

53 Commits