repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-23 15:16:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	1e1b4b1a65	"standby register/follow": provide primary node details for event notifications For events generated by these commands, it may be useful to know details of the primary node. This makes following additional parameters available to event notification scripts: - %p: node ID of the primary - %a: node name of the primary - %c: conninfo string for the primary Implements GitHub #375	2018-04-03 14:32:19 +09:00
Ian Barwick	3ccf1cf182	Enable pg_rewind to be used with PostgreSQL 9.3/9.4 pg_rewind is not part of the core distribution for those, but we provided support in repmgr 3.3 so should extend it to repmgr 4. Note that there is no check in place whether the pg_rewind binary exists, so it's up to the user to ensure it's present. Addresses GitHub #413.	2018-04-02 20:54:29 +09:00
Ian Barwick	a403da67bc	Consolidate connection closure calls	2018-03-27 16:43:59 +09:00
Ian Barwick	0219f4c91f	Always set "connect_timeout" when pinging a PostgreSQL instance Insert "connect_timeout=2" into the connection parameters, if not explicitly set by the user. This will prevent excessive wait time for the host operating system to report a connection timeout.	2018-03-21 11:48:57 +09:00
Ian Barwick	d7702b3444	Correctly handle error message pointer when parsing strings. When parsing conninfo strings, ensure the error message pointer is actually returned to the caller. Not a criticial issue, just meant the contents of the error message were not being displayed.	2018-03-10 14:29:12 +09:00
Ian Barwick	9981ede1af	"standby clone": fix --superuser handling get_superuser_connection() was erroneously using the local node record to connect to as a superuser, which works when registering the primary but obviously not when cloning a standby. Addresses GitHub #380.	2018-03-02 16:43:19 +09:00
Ian Barwick	29cb153643	"node status": improve replication slot warnings Addresses GitHub #385	2018-02-23 11:19:33 +09:00
Ian Barwick	c644ddde51	Fix typo in function name	2018-02-22 15:50:57 +09:00
Ian Barwick	22b3a74fa0	repmgrd: improve detection of status change from primary to standby If repmgrd is running in degraded mode on a primary which has been stopped, then manually been brought back online as a standby (e.g. by creating recovery.conf and starting the server), ensure it not only detects the change but automatically updates the node record so it can resume monitoring the node as a standby. Previously, repmgrd was looping waiting for the record to be updated (as is done transparently when executing "repmgr node rejoin") but if the record was not updated within the timeout period (e.g. by "repmgr standby register) it would fail to resume monitoring as a standby. It seems reasonable to have repmgrd automatically update the node record, as this will restore failover capability as quickly as possible. If this is not desired, then the onus is on the user to shut down repmgrd while making the desired changes.	2018-02-22 15:50:45 +09:00
Ian Barwick	6b7f6089ba	"node status": add warning about missing replication slots Implements GitHub #364.	2018-02-12 11:38:27 +09:00
Ian Barwick	927bf038a0	"standby switchover": check demotion candidate can make replication connection Check it's actually possible for the demotion candidate to attach to the promotion candidate before executing the switchover. As with other checks of this nature, there's a faint possibility the situation could change between the time the check is carried out and the demotion candidate is restarted to connect to the promotion candidate, but there's not a lot we can do about that. The main purpose is to be able to catch existing misconfigurations before anything gets changed. Implements GitHub #370.	2018-02-09 10:00:54 +09:00
Ian Barwick	657ed83921	"cluster show": improve handling of database errors In particular, if running "repmgr cluster show" against a database without the repmgr metadata, showing the error (rather than just "no records found" etc.) will provide some clues about the problem.	2018-02-05 10:35:56 +09:00
Ian Barwick	6c81e54f76	"standby follow": check for replication slot availability on target node	2018-02-02 17:18:43 +09:00
Ian Barwick	8fd0c4ad83	repmgr: assume node is actually shutting down if pingable and that's the reported status	2018-01-12 21:53:37 +09:00
Ian Barwick	7ccae6c2b1	repmgr: automatically create slot name if missing It's possible that a node was registered with "use_replication_slots=false" but that was later changed to "use_replication_slots=true". If the node was not subsequently re-registered, the node record will contain an empty slot name, which will cause any slot creation operation during "standby follow" or "node rejoin" to fail. To prevent this happening, check for an empty slot name and automatically set before proceeding. Addresses GitHub #343.	2018-01-11 14:47:50 +09:00
Ian Barwick	61d46172b9	repmgr: catch possible corner case when checking node shutdown status It's conceivable that PQping is returning "no response" but the shutdown hasn't quite completed.	2018-01-10 15:09:21 +09:00
Ian Barwick	810471b2f2	repmgr: during switchover, correctly detect unclean shutdown status	2018-01-10 12:25:16 +09:00
Ian Barwick	5bd8cf958a	repmgr standby switchover: add "%p" event notification parameter This will contain the node ID of the former primary.	2018-01-10 12:25:12 +09:00
Ian Barwick	fcb7e7a29b	"repmgr bdr register": create missing connection replication set if needed Previously the assumption was that the "repmgr" replication set would be set up when the nodes are created, however no checks were implemented and this was not well-documented. Addresses GitHub #347.	2018-01-04 17:46:49 +09:00
Ian Barwick	26e404b1f3	"repmgr bdr register": improve node name check We'll use "bdr.bdr_get_local_node_name()" to check the local BDR node name and the repmgr one match.	2018-01-04 17:46:44 +09:00
Ian Barwick	cad12b1fb7	"repmgr cluster event": move query to dbutils.c	2018-01-04 14:55:46 +09:00
Ian Barwick	26a9e848fd	Update copyright notices to 2018	2018-01-02 10:19:46 +09:00
Ian Barwick	472d703d2e	repmgr: initialise "voting_term" in "repmgr primary register" This previously happened in the extension SQL code, which could potentially cause replay problems if installing on a BDR cluster. As this table is only required for streaming replication failover, move the initialisation to "repmgr primary register". Addresses GitHub #344 .	2017-11-28 11:08:12 +09:00
Ian Barwick	8c422d6084	Remove unneeded functions	2017-11-20 15:18:21 +09:00
Ian Barwick	9165d27f9f	"repmgr node ...": fixes for 9.3 Mainly to account for the lack of replication slots.	2017-11-16 11:25:16 +09:00
Ian Barwick	a6cc4d80f0	Add "witness register" functionality	2017-11-15 13:47:45 +09:00
Ian Barwick	eb14bb58c6	Add configuration file "passfile" This will enable a custom .pgpass to be included in "primary_conninfo" (provided it's supported by the libpq version on the standby).	2017-11-14 19:30:25 +09:00
Ian Barwick	aa089820ab	repmgrd: check shared library is loaded If this isn't the case, "repmgrd" will appear to run but not handle failover correctly. Address GitHub #337.	2017-11-10 14:35:17 +09:00
Ian Barwick	4ca7e6a6bf	repmgrd: remove unneeded functions	2017-11-09 19:31:08 +09:00
Ian Barwick	79d21b516b	repmgrd: fixes to failover handling get_new_primary() returns NULL if no notification for the new primary has been received, but the code was expecting it to return UNKNOWN_NODE_ID, which was causing repmgrd to prematurely drop out of the new primary detection loop if no notification had been received by the time the loop started. Also store the electoral term as a single row, single column table, to ensure that all repmgrds see the same turn. It is then bumped by the winning node after it gets promoted. Various logging improvements.	2017-11-08 14:28:08 +09:00
Ian Barwick	b6b31b15b2	Implement "repmgr cluster cleanup"	2017-09-11 13:48:46 +09:00
Ian Barwick	a9f4a027a7	pgindent run	2017-09-11 11:14:13 +09:00
Ian Barwick	e4f7dc8234	Add copyright notices	2017-09-08 13:27:39 +09:00
Ian Barwick	edee80cc37	Rename option "node check --is-shutdown" to "--is-shutdown-cleanly" As that's what we really want to know. Also return "UNCLEAN_SHUTDOWN" if that's the case, rather than "RUNNING" which is confusing, even though it's a command for internal use.	2017-09-07 11:15:27 +09:00
Ian Barwick	fcd111ac4c	Improve logging output during failover process	2017-08-24 22:44:03 +09:00
Ian Barwick	eee8d65259	Update view "replication_status"	2017-08-24 15:05:13 +09:00
Ian Barwick	a659132ea4	repmgrd: write monitoring statistics	2017-08-24 11:49:44 +09:00
Ian Barwick	4c0d719cdb	Add replication slot check to "repmgr node check"	2017-08-16 11:17:02 +09:00
Ian Barwick	554673e83e	Add "repmgr node check --downstream"	2017-08-15 15:50:46 +09:00
Ian Barwick	10ef30096c	"node check": add server role check	2017-08-14 22:57:09 +09:00
Ian Barwick	3b2158edbf	Initialise variables, where appropriate	2017-08-14 15:11:42 +09:00
Ian Barwick	eabd56f3be	"standby follow": check node system identifiers match	2017-08-14 11:45:08 +09:00
Ian Barwick	0f31756733	General code cleanup	2017-08-14 10:04:53 +09:00
Ian Barwick	b95b3e50e3	Return system identification information with appropriate data types	2017-08-14 08:50:54 +09:00
Ian Barwick	50b82f785e	Add function to execute "IDENTIFY_SYSTEM"	2017-08-11 22:01:02 +09:00
Ian Barwick	8a50a72dc5	Additional "node status" output	2017-08-10 17:18:08 +09:00
Ian Barwick	4f2161bd83	Cleanup various #defines	2017-08-10 15:11:53 +09:00
Ian Barwick	7ca68b7cc8	Standardize "primary_conninfo" generation Previously repmgr would write all the default libpq parameters into "primary_conninfo" on "standby clone", but not for "standby follow", which is inconsistent. For repmgr4 we'll determine that the upstream node's conninfo must be canonical and contain all required connection parameters, even if these are available as defaults or environment variables in the local environment, as those are transient and may not be available in all environments/situations. recovery.conf's "primary_conninfo" will be generated using the upstream's conninfo parameters, except for those specific to the downstream node. These are: - "application_name": this will always be set to the "node_name" of the downstream node - "passfile" and "servicefile": these, must of course reference files on the downstream node so will be extracted from the downstream node's conninfo, if set	2017-08-10 12:37:50 +09:00
Ian Barwick	1d99a07b43	Store configuration file in repmgr.nodes table When executing repmgr on remote nodes, we otherwise end up jumping through hoops as we can't make assumptions about where the configuration file is located, but really need to be able to provide it. From a support point of view it will also make life easier as it will be easy to specify exactly which file to provide.	2017-08-10 08:03:24 +09:00
Ian Barwick	a57fb5b50c	After switchover, enable sibling standbys to follow new primary	2017-08-10 00:06:16 +09:00

1 2 3

121 Commits