repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-23 07:06:30 +00:00

Author	SHA1	Message	Date
Ian Barwick	931da14df1	Rename some "repmgr daemon ..." commands to "repmgr service ..." "repmgr daemon" can be interpreted to mean the commands affect the local daemon process only. Rename the commands which affect the entire cluster to "repmgr service ...". The "repmgr daemon ..." form of the affected commands is retained for backwards compatibility.	2019-08-28 14:58:11 +09:00
Ian Barwick	01852f7e3a	doc: improve repmgr.conf settings documentation	2019-06-07 12:48:36 +09:00
Ian Barwick	36a09a5c4b	doc: improve configuration documentation	2019-06-07 12:16:04 +09:00
Ian Barwick	5a90513878	repmgrd: monitor standbys attached to primary This functionality enables repmgrd (when running on the primary) to monitor connected child nodes. It will log connections and disconnections and generate events. Additionally, repmgrd can execute a custom script if the number of connected child nodes falls below a configurable threshold. This script can be used e.g. to "fence" the primary following a failover situation where a new primary has been promoted and all standbys are now child nodes of that primary.	2019-04-22 16:18:52 +09:00
Ian Barwick	c338bc9c5e	doc: add note about BDR replication type in sample config	2019-04-05 14:37:49 +09:00
Ian Barwick	e23f5afc5f	doc: note valid characters for "node_name" "node_name" will be used as "application_name", so should only contain characters valid for that; see: https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-APPLICATION-NAME Not yet enforced.	2019-03-28 10:53:43 +09:00
Ian Barwick	ba1f05ece9	Restrict "node_name" to maximum 63 characters In "recovery.conf", the configuration parameter "node_name" is used as the "application_name" value, which will be truncated by PostgreSQL to 63 characters (NAMEDATALEN - 1). repmgr sometimes needs to be able to extract the application name from pg_stat_replication to determine if a node is connected (e.g. when executing "repmgr standby register"), so the comparison will fail if "node_name" exceeds 63 characters.	2019-03-28 10:37:57 +09:00
Ian Barwick	fbdf9617fa	doc: update repmgrd example output	2019-03-15 15:43:11 +09:00
Ian Barwick	9dd87dd5ce	doc: add explanation of the configuration file format	2019-03-15 14:02:42 +09:00
Ian Barwick	dd6ece326f	doc: update repmgrd configuration documentation	2019-03-13 13:34:08 +09:00
Ian Barwick	fc397f25f6	repmgrd: enable election rerun If "failover_validation_command" is set, and the command returns an error, rerun the election. There is a pause between reruns to avoid "churn"; the length of this pause is controlled by the configuration parameter "election_rerun_interval".	2019-03-12 17:12:19 +09:00
Ian Barwick	2a8f8d8400	doc: expand repmgrd configuration section	2019-03-11 14:50:33 +09:00
Ian Barwick	33fefd9f52	Add configuration option "primary_visibility_consensus" This determines whether repmgrd should continue with a failover if one or more nodes report they can still see the standby.	2019-03-07 10:41:42 +09:00
Ian Barwick	a3f90d2bba	Add configuration option "sibling_nodes_disconnect_timeout" This controls the maximum length of time in seconds that repmgrd will wait for other standbys to disconnect their WAL receivers in a failover situation. This setting is only used when "standby_disconnect_on_failover" is set to "true".	2019-03-06 15:56:21 +09:00
Ian Barwick	63f7ad546e	repmgrd: add option "connection_check_type" This enable selection of the method repmgrd uses to check whether the upstream node is available. Possible values are: - "ping" (default): uses PQping() to check server availability - "connection": executes a query on the connection to check server availability (similar to repmgr3.x).	2019-03-06 12:09:54 +09:00
Ian Barwick	a41e7bb726	doc: various minor updates	2019-02-01 17:24:32 +09:00
Ian Barwick	9273e7af73	"standby switchover": avoid potential race condition with WAL location check Immediately after the demotion candidate (primary) has shut down, we can't be absolutely sure that the walreceiver has flushed all WAL to disk, so checking pg_last_wal_receive_lsn() at that point might not reflect the actual last available WAL location. To handle this, we'll loop for a while (timeout controlled by configuration parameter "wal_receive_check_timeout") before finally deciding whether the standby is still behind the shut-down primary. Addresses issue raised in GitHub #518.	2019-02-01 12:06:22 +09:00
Ian Barwick	32b81e7d49	"daemon start": initial implementation	2019-01-29 13:01:14 +09:00
Ian Barwick	ba7ef9e643	doc: update PostgreSQL documentation links "/static/" path element no longer required.	2019-01-15 12:45:33 +09:00
Ian Barwick	40e94635b2	doc: fix typo in repmgr.conf.sample	2018-10-08 09:36:28 +09:00
Ian Barwick	11d25e2aef	Add configuration parameter "repmgr_bindir" This is to facilitate remote invocation of repmgr when the repmgr binary is located somewhere other than the PostgreSQL binary directory, as it cannot be assumed all package maintainers will install repmgr there. This parameter is optional; if not set (the default), repmgr will fall back to "pg_bindir" (if set). Addresses GitHub #246.	2018-10-02 09:59:12 +09:00
Ian Barwick	38e3aae053	repmgr: add parameter "shutdown_check_timeout" Previously, "repmgr standby switchover" used the configuration file parameters "reconnect_interval" and "reconnect_attempts" to define a timeout to determine whether the current primary (demotion candidate) has shut down. However, these parameters are intended for primary failure detection and are generally lower in value, while a controlled shutdown may take longer, resulting in the switchover being aborted as repmgr was not waiting long enough. To prevent this happening, parameter "shutdown_check_timeout" has been added. This complements the existing "standby_reconnect_timeout" parameter used by "repmgr standby switchover". Implements GitHub #504.	2018-09-25 11:34:06 +09:00
Ian Barwick	80bef0eb28	doc: minor fixes to "repmgr.conf.sample"	2018-09-25 10:53:24 +09:00
Ian Barwick	f8667c1aac	doc: better explain where pg_bindir won't be applied Basically any setting which can contain a user-defined script must have the full path set, even if it's repmgr being executed. We could potentially apply some heuristics to detect if the first item in the setting is "repmgr" (or more precisely repmgrd's program name), but this will require some careful thought and testing that it works as intended.	2018-08-14 09:54:27 +09:00
Ian Barwick	63242e2277	doc: update documentation of "promote_command" and "service_promote_command" The documentation implied it would override "promote_command", which is not the case. "promote_command" is used by repmgrd to execute "repmgr standby promote" (either directly or via a custom script). "service_promote_command" can be set to specify a package-level service command to promote the local PostgreSQL instance from standby to primary, e.g. Debian's pg_ctlcluster. If set, this will be executed by "repmgr standby promote". Also update code comments to clarify usage. Related to GitHub #473.	2018-07-16 14:43:53 +09:00
Ian Barwick	8b059bc9b0	Change default for "log_level" to INFO Default was previously NOTICE (as in repmgr 3.x) but documentation implied it was INFO, and many of the the documentation examples assume it is. This produces some quite informative log output, without creating excessive log file volume. In particular it's useful to get a better idea of what repmgrd is actually doing. Also add documentation section for the log configuration parameters. GitHub #470, containing change suggested in GitHub #467.	2018-07-12 14:50:48 +09:00
Greg Clough	ff16d3b3bb	Fixed typo in repmgr.conf.sample, "priority" Fixed typo in repmgr.conf.sample, "priority"	2018-06-29 22:00:09 +01:00
Ian Barwick	8d636690bd	repmgrd: create pid file by default Traditionally repmgrd will only write a pidfile if explicitly requested with -p/--pid-file. However it's normally desirable to have a pidfile, and it's preferable to have one used by default to prevent accidentally starting a second repmgrd instance. Following changes made: - add configuration file parameter "repmgrd_pid_file" (initially overridden by -p/--pid-file for backwards compatibility, though eventually we'll want to drop -p/--pid-file altogether) - add command line option --no-pid-file - if neither "repmgrd_pid_file" nor -p/--pid-file is set, create the pid file in a temporary directory Implements GitHub #457.	2018-06-29 14:36:24 +09:00
Ian Barwick	b2081dca52	De-overload configuration file parameter "standby_reconnect_timeout" Currently the (very generic sounding) "standby_reconnect_timeout" configuration file parameter is used in several different contexts and it would be useful to have more granular control over the different timeouts it's used to configure. This patch introduces "node_rejoin_timeout", used in place of "standby_reconnect_timeout" (which wasn't documented) when "repmgr node rejoin" is executed, to determine how long to wait for the node to rejoin the replication cluster. Additionally "repmgrd_standby_startup_timeout" is introduced as a timeout for failover situations, when repmgrd executes "repmgr standby follow" to follow a new primary, and waits for the standby to restart and become available for connections. "standby_reconnect_timeout" is now only relevant for "repmgr standby switchover". Implements GitHub #454.	2018-06-28 18:00:55 +09:00
Ian Barwick	efc388065e	standby follow: check node has connect to new primary After restarting the standby, poll pg_stat_replication on the upstream until the standby connects, and exit with an error if it doesn't by the timeout defined in "standby_follow_timeout". Implments GitHub #444.	2018-06-07 15:04:45 +09:00
Ian Barwick	9c0c1b663e	Minor documentation fixes	2018-05-10 10:25:29 +09:00
Ian Barwick	8320179f34	Add configuration file parameter "config_directory" This enables explicit provision of an external configuration file directory, which if set will be passed to "pg_ctl" as the -D parameter. Otherwise "pg_ctl" will default to using the data directory, which will cause some operations to fail if the configuration files are not present there. Note this is implemented primarily for feature completeness and for development/testing purposes. Users who have installed "repmgr" from a package should not rely on "pg_ctl" to stop/start/restart PostgreSQL, instead they should set the appropriate "service_..._command" for their operating system. For more details see: https://repmgr.org/docs/4.0/configuration-service-commands.html Note: in a future release, the presence of "config_directory" in repmgr.conf will be used to implictly set "--copy-external-config-files=samepath" when cloning a standby; this is a behaviour change so will be implemented in the next major realease (repmgr 4.1). Implements GitHub #424.	2018-04-25 11:58:24 +09:00
Ian Barwick	09b8a86605	doc: improve configuration documentation With special attention to setting service commands, and extra special mention of "pg_ctlcluster" for Debian/Ubuntu users.	2018-04-20 10:15:18 +09:00
Ian Barwick	dfdebd6c08	Enable provision of "archive_cleanup_command" in recovery.conf If "archive_cleanup_command" is defined in "repmgr.conf", a corresponding entry will be made in the node's "recovery.conf" file after cloning a standby. Note that we recommend using PgBarman to manage WAL archives, but are providing this facility to help repmgr to be integrated in existing environments. Implements GitHub #416.	2018-04-03 14:10:21 +09:00
Ian Barwick	63a11f8926	"standby promote": make timeout values configurable This introduces following new configuration file parameters, which were previously hard-coded values: - promote_check_timeout - promote_check_interval Implements GitHub #387.	2018-04-03 14:10:14 +09:00
Ian Barwick	55441f2729	repmgrd: add configuration file parameter "standby_reconnect_timeout" This is used for determining a timeout when reconnecting to the standby after executing the "follow_command". This will normally not need to be set explicitly, but maybe useful in cases where the standby's startup phase can last longer than usual.	2018-03-02 11:04:56 +09:00
Ian Barwick	5719a0dfd3	Update repmgr.conf.sample Add missing parameter "monitor_interval_secs"	2018-02-12 11:38:22 +09:00
Ian Barwick	c47f976bde	repmgr.conf.sample: fix command line argument "repmgr node check --archive-ready" is correct, however abbreviated versions will be accepted by getopt_long() if they don't match or partially match any other options. Per report by "chaintng" in GitHub #355.	2017-12-27 09:39:14 +09:00
Martín Marqués	f58954b3be	Switch spaces for tabs in repmgr.conf sample file. This makes comments stay aligned in most cases the conf file is modified, and when indentation changes, it's easy to re-align (by removing or adding a tab) Signed-off-by: Martín Marqués <martin.marques@2ndquadrant.com>	2017-12-14 07:00:05 -03:00
Ian Barwick	8b78b7292d	docs: add note about "service_promote_command" in repmgr.conf.sample It must never contain "repmgr standby promote", as it is intended to enable use of package-level promote commands such as Debian's "pg_ctlcluster promote". Addresses GitHub #336.	2017-11-20 12:29:47 +09:00
Ian Barwick	a6cc4d80f0	Add "witness register" functionality	2017-11-15 13:47:45 +09:00
Ian Barwick	eb14bb58c6	Add configuration file "passfile" This will enable a custom .pgpass to be included in "primary_conninfo" (provided it's supported by the libpq version on the standby).	2017-11-14 19:30:25 +09:00
Ian Barwick	97471626b4	Update repmgr.conf.sample	2017-11-02 17:43:03 +09:00
Ian Barwick	7c3abe28b9	Standardize terminology on "primary" (in place of "master")	2017-10-24 13:42:50 +09:00
Ian Barwick	34ee16899e	doc: add missing entry for "priority" in repmgr.conf.sample Per report from Shaun Thomas.	2017-10-19 13:14:52 +09:00
Ian Barwick	55f203a2fc	Add "-o ConnectTimeout=10" as default in "ssh_options"	2017-09-13 13:23:16 +09:00
Gianni Ciolli	6d63c0f941	Small clarification on sudo-based configuration (#1 ) Now we are more explicit on what we recommend for the various service_X_command settings when using sudo. Signed-off-by: Gianni Ciolli <gianni.ciolli@2ndQuadrant.com>	2017-09-06 20:32:54 +01:00
Ian Barwick	e21a3ef7ec	Fix typo	2017-09-06 09:31:16 +09:00
Ian Barwick	78e6bdeebe	Have repmgrd parse "standby follow --upstream-node-id=%n"	2017-09-04 13:42:50 +09:00
Ian Barwick	1517c06bb1	Document "replication_user" configuration file parameter.	2017-08-31 17:29:09 +09:00

1 2

73 Commits