repmgr

mirror of https://github.com/EnterpriseDB/repmgr.git synced 2026-03-23 15:16:29 +00:00

Author	SHA1	Message	Date
Ian Barwick	8f62b4c9e6	Update copyright notice to 2016	2016-01-05 15:57:25 +09:00
Ian Barwick	fc75084e42	repmgrd: -v/--verbose option does not require a parameter	2016-01-05 10:50:04 +09:00
Ian Barwick	94579b5f2e	Clean up whitespace and comments	2016-01-04 14:41:15 +09:00
Ian Barwick	e9a25c367a	Prevent invalid replication_lag values being written to the monitoring table A fix for this was introduced with commit `ee9270fe8d` and removed in `4f1c67a1bf`. Refactor the original fix to simply omit attempting to write an invalid entry to the monitoring table.	2016-01-04 14:37:22 +09:00
Martin	ac17033d61	This doesn't really mean the standby s following a new master, so we are removing it. Basically, on startup the standby will start receiving again from the begining of the WAL and so received will be lower then applied. A proper code is needed to make sure the standby is still following the correct master (as per node information)	2016-01-04 14:29:56 +09:00
Martín Marqués	711ad0a76c	Change where we activate back the standby node that was failed. We will do it where we are sending the message that says that the standby has recovered, eliminating some complexity	2016-01-04 14:28:39 +09:00
Martín Marqués	ad988dccce	Fix bug discovered last week which prevents recovered standby from being used in the cluster. Main issue was that if the local repmgrd was not able to connect locally, it would set the local node as failed (active = false). This is fine, because we actually don't know if the node is active (actually, it's not active ATM) so it's best to keep it out of the cluster. The problem is that if the postgres service comes back up, and is able to recover by it self, then we should ack that fact and set it as active. There was another issue related with repmgrd being terminated if the postgres service was downs. This is not the correct thing to do: we should keep trying to connect to the local standby.	2016-01-04 14:28:33 +09:00
Martín Marqués	53fe3c7e5a	Fix bug discovered last week which prevents recovered standby from being used in the cluster. Main issue was that if the local repmgrd was not able to connect locally, it would set the local node as failed (active = false). This is fine, because we actually don't know if the node is active (actually, it's not active ATM) so it's best to keep it out of the cluster. The problem is that if the postgres service comes back up, and is able to recover by it self, then we should ack that fact and set it as active. There was another issue related with repmgrd being terminated if the postgres service was downs. This is not the correct thing to do: we should keep trying to connect to the local standby.	2016-01-04 14:28:26 +09:00
Ian Barwick	ce2d4fb86f	Make t_node_info generally available And have it include all the fields from the repl_nodes table.	2015-11-30 16:18:35 +09:00
Ian Barwick	ce3594d52d	Add /etc/repmgr.conf as a default configuration file location Also refactor configuration file handling while we're at it. Previously a configuration file would be ignored if it couldn't be opened, however that is now treated as an error.	2015-11-30 16:17:23 +09:00
Ian Barwick	f64c42a514	Simplify logger_init() parameters We're passing the t_configuration_options structure anyway, no need to pass items it contains as separate parameters.	2015-11-30 16:17:17 +09:00
Ian Barwick	29842f0e0d	Metadata update also handled by repmgr	2015-11-30 16:16:37 +09:00
Ian Barwick	25db1ba737	When following a new primary, have repmgr (not repmgrd) create the new slot	2015-11-30 16:16:26 +09:00
Ian Barwick	807dcc1038	Repurpose -v/--verbose; add -t/--terse option (repmgr only) repmgr and particularly repmgrd currently produce substantial amounts of log output. Much of this is only useful when troubleshooting or debugging. Previously the -v/--verbose option just forced the log level to INFO. With repmgrd this is pretty pointless - just set the log level in the configuration file. With repmgr the configuration file can be overriden by the new -L/--log-level option. -v/--verbose now provides an additional, chattier/pedantic level of logging ("Opening this logfile", "Executing this query", "running in this loop") which is helpful for understanding repmgr/repmgrd's behaviour, particularly for troubleshooting. What additional verbose logging is generated will of course a also depends on the log level set, so e.g. someone trying to work out which configuration file is actually being opened can use '--log-level=INFO --verbose' without being bothered by an avalanche of extra verbose debugging output. -t/--terse option will silence certain non-essential output, at the moment any HINTs. Note that -v/--verbose and -t/--terse are not mutually exclusive (suggestions for better names welcome).	2015-11-30 16:15:03 +09:00
Ian Barwick	ae84041a4e	Add log_hint() function for logging hints There are a few places where additional hints are written as log output, usually LOG_NOTICE. Create an explicit function to provide hints in a standardized manner; by storing the log level of the previous logger call, we can ensure the hint is only displayed when the log message itself would be. Part of an ongoing effort to better control repmgr's logging output.	2015-11-30 16:14:08 +09:00
Ian Barwick	ea01d1d30b	Always use catalog path when calling system functions Removes any risk of issues due to search path mangling etc.	2015-11-30 16:13:31 +09:00
Ian Barwick	43626892d0	Improve configuration file parsing Related to Github #127. - use the previously introduced repmgr_atoi() function to parse integers better - collate all detected errors and output as a list, rather than failing on the first error.	2015-11-30 16:13:16 +09:00
Ian Barwick	8870b7d7f1	Rename variable 'reconnect_intvl' to 'reconnect_interval' For consistency with the configuration file parameter name	2015-11-30 16:13:08 +09:00
Ian Barwick	f56f70c2a6	Specify relevant node in error message	2015-11-30 16:09:59 +09:00
Ian Barwick	d353fe2a9f	Terminate repmgrd if standby is no longer connected to upstream	2015-11-30 16:09:50 +09:00
Ian Barwick	a59ea243c0	Improve logging and event notifications when following new upstream node	2015-11-30 16:08:30 +09:00
Ian Barwick	0c5025b3d6	Add note about checking replication slots when following upstream node	2015-11-30 16:08:07 +09:00
Ian Barwick	42b79b9b54	Improve log messages when following new primary	2015-11-30 16:07:39 +09:00
Ian Barwick	2e47c6b40b	Minor formatting tweak	2015-11-30 16:07:32 +09:00
Ian Barwick	69c552b8e0	Only log some debug items if verbose flag is set.	2015-11-30 16:03:56 +09:00
Ian Barwick	51967d2bd8	Add missing space	2015-11-30 16:03:50 +09:00
Martín Marqués	fb6781775d	Fix bug which prevents repmgrd from starting when the cluster name has upper case letters.	2015-10-08 19:46:34 -03:00
Ian Barwick	e115825cd6	Fix comment capitalization	2015-09-30 14:58:43 +09:00
Ian Barwick	c3bd02b83d	Standardize if-statement formatting "if(" -> "if ("	2015-09-24 17:45:08 +09:00
Ian Barwick	8e7d110a22	Check for existing master record before deleting it Otherwise repmgr implies it's deleting a record which isn't actually there.	2015-09-24 17:39:39 +09:00
Tomas Vondra	ef6b24551a	call update_node_record_set_upstream() for STANDBY FOLLOW repmgrd correctly updates ID of the upstream node after automatic failover, but repmgr was not doing that for manual failvers. This moves the existing function to dbutils and modifies it so that it does not rely on global variables with configuration (available just in repmgrd). This should fix issue #67 (hopefully, haven't done much testing).	2015-09-23 12:32:47 +09:00
Ian Barwick	30fd111cba	Rework config file handling If no configuration file provided, also check default Postgres sysconfig dir. It would also be useful to check the configuration directory provided by the RPM/DEB packages, not sure if that's programmatically feasible.	2015-09-21 15:55:29 +09:00
Ian Barwick	65e63b062e	Generally tidy up help output	2015-09-21 11:49:06 +09:00
Ian Barwick	053f672caa	Treat -?/--help and -V/--version as normal options Currently repmgr/repmgrd will only accept these as valid when provided as the first command line option, however it's possible a user will want to get the output of those options by adding them to the end of a previously inputted command. Note that after the first of these options is encountered, the program will terminate and not process any other options. This is consistent with psql's behaviour Per GitHub issue #107 from Sébastien Gross.	2015-09-21 09:53:51 +09:00
Ian Barwick	7345ddcf00	Whitespace tweak	2015-09-10 14:27:21 +09:00
Gianni Ciolli	462d446477	Bug #90 fix (autofailover with reconnect_attemps > 1). The main change is that now check_connection requires a conninfo parameter, and the connection object has type (PGconn *) so it can be replaced by check_connection if needed. The bug was caused by the fact that the first failure resulted in conn == NULL, so that subsequent checks of the upstream connection were failing irrespectively of the actual state of the upstream node. Now, when conn == NULL, check_connection will use conninfo to establish a new connection and place it into conn. We introduce a new INTERNAL_ERROR code for the case when they are both NULL. In passing, we also reworded a confusing error message, distinguishing a timeout from the actual elapsed time.	2015-08-10 20:58:43 +02:00
Ian Barwick	1e5792f8df	Remove unused function	2015-04-14 14:29:47 +09:00
Ian Barwick	a01fefa7d0	After standby promotion, ensure metadata is updated by repmgr Previously this was handled by repmgrd but if a standby is promoted directly this will leave the metadata in an incorrect state.	2015-04-14 13:39:48 +09:00
Ian Barwick	07d220cb00	Correct monitoring table column names It would be more consistent to change the "primary" to "master" but that would make the table incompatible with the v2.0 table.	2015-03-31 18:14:32 +09:00
Ian Barwick	4dfeffe087	Add constant NODE_NOT_FOUND Which is what the magic number means in those contexts.	2015-03-31 14:35:16 +09:00
Ian Barwick	18544c82ca	Prevent rempgrd from looping infinitely if node was not registered	2015-03-31 14:25:08 +09:00
Ian Barwick	0f86bdcd05	Fixes for event logging We can't always assume a valid connection to the master	2015-03-31 14:15:29 +09:00
Ian Barwick	3e621f43d1	Use 100 as the default priority; 0 or less means node will never be promoted	2015-03-26 10:38:20 +09:00
Ian Barwick	15a531fed8	Update function description	2015-03-24 19:36:31 +09:00
Ian Barwick	8de0deddf9	Change "primary" to "master" Personally I prefer "primary", but repmgr uses "master" so let's consolidate on one version of the terminology for clarity.	2015-03-24 14:06:39 +09:00
Ian Barwick	bd19a2c868	Improve handling of event logging in rempgrd Provide the master connection if available, and if not enable create_event_record() to skip trying to write to the database, but execute the notification program if defined.	2015-03-24 13:40:39 +09:00
Ian Barwick	2cadb3424d	Don't try and log events when no master connection available	2015-03-24 12:48:02 +09:00
Ian Barwick	172a3d90cf	Terminate rather than destroy	2015-03-19 09:55:20 +09:00
Ian Barwick	7f98bb7aec	Create event record for rempgrd termination Also fix a few incorrect exit codes.	2015-03-17 19:08:59 +09:00
Ian Barwick	9e2736be4c	Remove superfluous configuration check Also add note about configuration parsing failure and event logging.	2015-03-17 18:41:17 +09:00

1 2 3 4 5 ...

275 Commits