pgcat

mirror of https://github.com/postgresml/pgcat.git synced 2026-07-17 01:49:05 +00:00

Author	SHA1	Message	Date
Zain Kabani	ca4431b67e	Add idle client in transaction configuration (#380 ) * Add idle client in transaction configuration * fmt * Update docs * trigger build * Add tests * Make the config dynamic from reloads * fmt * comments * trigger build * fix config.md * remove error	2023-03-24 08:20:30 -07:00
Lev Kokotov	b4baa86e8a	Extended query protocol sharding (#339 ) * Prepared stmt sharding s tests * len check * remove python test * latest rust * move that to debug for sure * Add the actual tests * latest image * Update tests/ruby/sharding_spec.rb	2023-03-10 07:55:22 -08:00
Mostafa Abdelraouf	aa89e357e0	PgCat Query Mirroring (#341 ) This is an implementation of Query mirroring in PgCat (outlined here #302) In configs, we match mirror hosts with the servers handling the traffic. A mirror host will receive the same protocol messages as the main server it was matched with. This is done by creating an async task for each mirror server, it communicates with the main server through two channels, one for the protocol messages and one for the exit signal. The mirror server sends the protocol packets to the underlying PostgreSQL server. We receive from the underlying PostgreSQL server as soon as the data is available and we immediately discard it. We use bb8 to manage the life cycle of the connection, not for pooling since each mirror server handler is more or less single-threaded. We don't have any connection pooling in the mirrors. Matching each mirror connection to an actual server connection guarantees that we will not have more connections to any of the mirrors than the parent pool would allow.	2023-03-10 06:23:51 -06:00
Mostafa Abdelraouf	2cc6a09fba	Add Manual host banning to PgCat (#340 ) Sometimes we want an admin to be able to ban a host for some time to route traffic away from that host for reasons like partial outages, replication lag, and scheduled maintenance. We can achieve this today using a configuration update but a quicker approach is to send a control command to PgCat that bans the replica for some specified duration. This command does not change the current banning rules like Primaries cannot be banned When all replicas are banned, all replicas are unbanned	2023-03-06 06:10:59 -06:00
Jose Fernández	8a0da10a87	Dev environment (#338 ) Add dev env	2023-03-02 12:14:10 -05:00
Mostafa Abdelraouf	75a7d4409a	Fix Back-and-forth RELOAD Bug (#330 ) We identified a bug where RELOAD fails to update the pools. To reproduce you need to start at some config state, modify that state a bit, reload, revert the configs back to the original state, and reload. The last reload will fail to update the pool because PgCat "thinks" the pool state didn't change. This is because we use a HashSet to keep track of config hashes but we never remove values from it. Say we start with State A, we modify pool configs to State B and reload. Now the POOL_HASHES struct has State A and State B. Attempting to go back to State A will encounter a hashset hit which is interpreted by PgCat as "Configs are the same, no need to reload pools" We fix this by attaching a config_hash value to ConnectionPool object and we calculate that value when we create the pool. This eliminates the need for a global variable. One shortcoming here is that changing any config under one user in the pool will trigger a reload for the entire pool (which is fine I think)	2023-02-21 21:53:10 -06:00
Nicholas Dujay	37e1c5297a	implement show users (#329 ) * implement show users * fix compile errors * add basic ruby test * gitignore things	2023-02-21 13:08:43 -08:00
Mostafa Abdelraouf	f9134807d7	More Test coverage + fix some code coverage bugs (#321 ) Connection to the CI databases is viewed by Postgres as coming from localhost. The pg_hba.conf file generated by the docker image uses trust for these connections, that's why we had no test coverage on SASL and md5 branches. This PR fixes this issue. There was also an issue with under-reporting code coverage. This should be fixed now	2023-02-16 23:09:22 -06:00
Mostafa Abdelraouf	bf6efde8cc	Fix code coverage + less flakiness (#318 ) Code coverage logic was missing coverage from rust tests. This is now fixed. Also, we weren't reaping spawned PgCat processes correctly which left zombie processes.	2023-02-13 15:29:08 -06:00
Mostafa Abdelraouf	f1265a5570	Introduce tcp_keepalives to PgCat (#315 ) We have encountered a case where PgCat pools were stuck following a database incident. Our best understanding at this point is that the PgCat -> Postgres connections died silently and because Tokio defaults to disabling keepalives, connections in the pool were marked as busy forever. Only when we deployed PgCat did we see recovery. This PR introduces tcp_keepalives to PgCat. This sets the defaults to be keepalives_idle: 5 # seconds keepalives_interval: 5 # seconds keepalives_count: 5 # a count These settings can detect the death of an idle connection within 30 seconds of its death. Please note that the connection can remain idle forever (from an application perspective) as long as the keepalive packets are flowing so disconnection will only occur if the other end is not acknowledging keepalive packets (keepalive packet acks are handled by the OS, the application does not need to do anything). I plan to add tcp_user_timeout in a follow-up PR.	2023-02-08 11:35:38 -06:00
Mostafa Abdelraouf	87a771aecc	Log error messages for network failures (#289 ) We are seeing some Error reading message code from socket error messages, we want to get more context so this PR logs the actual error reported.	2023-01-19 05:18:08 -06:00
dependabot[bot]	99a3b9896d	chore(deps): bump activerecord from 7.0.3.1 to 7.0.4.1 in /tests/ruby (#287 ) Bumps [activerecord](https://github.com/rails/rails) from 7.0.3.1 to 7.0.4.1. - [Release notes](https://github.com/rails/rails/releases) - [Changelog](https://github.com/rails/rails/blob/v7.0.4.1/activerecord/CHANGELOG.md) - [Commits](https://github.com/rails/rails/compare/v7.0.3.1...v7.0.4.1) --- updated-dependencies: - dependency-name: activerecord dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-01-18 16:56:16 -08:00
Mostafa Abdelraouf	7894bba59b	Introduce least-outstanding-connections load balancing (#282 ) Least outstanding connections load balancing can improve the load distribution between instances but for Pgcat it may also improve handling slow replicas that don't go completely down. With LoC, traffic will quickly move away from the slow replica without waiting for the replica to be banned. If all replicas slow down equally (due to a bad query that is hitting all replicas), the algorithm will degenerate to Random Load Balancing (which is what we had in Pgcat until today). This may also allow Pgcat to accommodate pools with differently-sized replicas.	2023-01-17 06:52:18 -06:00
zainkabani	19f635881a	Don't send discard all when state is changed in transaction (#186 ) * Don't send discard all when state is changed in transaction * Remove unnecessary clone * spelling * Move transaction check to SET command * Add test for set command in transaction * type * Update comments * Update comments * use moves instead of clones for initial message * don't make message mutable * Update unwrap * but i'm not a wrapper * Add set local test * change continue	2022-10-13 19:33:12 -07:00
Mostafa Abdelraouf	3d33ccf4b0	Fix maxwait metric (#183 ) Max wait was being reported as 0 after #159 This PR fixes that and adds test	2022-10-05 21:41:09 -05:00
Mostafa Abdelraouf	af064ef447	Set client state to idle after error (#179 ) * Set client state to idle after error * fmt * spelling * clean up	2022-09-24 09:09:15 -07:00
Mostafa Abdelraouf	f7a951745c	Report Query times (#166 ) * Report avg and total query timing * Report query times * fmt	2022-09-15 02:21:45 -04:00
Mostafa Abdelraouf	4ae1bc8d32	Add SHOW CLIENTS / SHOW SERVERS + Stats refactor and tests (#159 ) * wip * Main Thread Panic when swarmed with clients * fix * fix * 1024 * fix * remove test * Add SHOW CLIENTS * revert * fmt * Refactor + tests * fmt * add test * Add SHOW SERVERS + Make PR unreviewable * prometheus * add state to clients and servers * fmt * Add application_name to server stats * Add tests for waiting clients * Docs * remove comment * comments * typo * cleanup * CI	2022-09-14 11:20:41 -04:00
Mostafa Abdelraouf	9514b3b2d1	Clean connection state up after protocol named prepared statement (#163 ) * Clean connection state up after protocol named prepared statement * Avoid cloning + add test * fmt	2022-09-07 20:37:17 -07:00
Mostafa Abdelraouf	23a642f4a4	Send DISCARD ALL even if client is not in transaction (#152 ) * Send DISCARD ALL even if client is not in transaction * fmt * Added tests + avoided sending extra discard all * Adds set name logic to beginning of handle client * fmt * refactor dead code handling * Refactor reading command tag * remove unnecessary trim * Removing debugging statement * typo * typo{ * documentation * edit text * un-unwrap * run ci * run ci Co-authored-by: Zain Kabani <zain.kabani@instacart.com>	2022-09-01 20:06:55 -07:00
Mostafa Abdelraouf	d48c04a7fb	Ruby integration tests (#147 ) * Ruby integration tests * forgot a file * refactor * refactoring * more refactoring * remove config helper * try multiple databases * fix * more databases * Use pg stats * ports * speed * Fix tests * preload library * comment	2022-08-30 09:14:53 -07:00
Mostafa Abdelraouf	c054ff068d	Avoid sending `Z` packet in the middle of extended protocol packet sequence if we fail to get connection from pool (#137 ) * Failing test * maybe * try fail * try * add message * pool size * correct user * more * debug * try fix * see stdout * stick? * fix configs * modify * types * m * maybe * make tests idempotent * hopefully fails * Add client fix * revert pgcat.toml change * Fix tests	2022-08-23 11:02:23 -07:00
Mostafa Abdelraouf	7592339092	Prevent clients from sticking to old pools after config update (#113 ) * Re-acquire pool at the beginning of Protocol loop * Fix query router + add tests for recycling behavior	2022-08-09 12:18:27 -07:00
Mostafa Abdelraouf	1b648ca00e	Send proper server parameters to clients using admin db (#103 ) * Send proper server parameters to clients using admin db * clean up * fix python test * build * Add python * missing & * debug ls * fix tests * fix tests * fix * Fix warning * Address comments	2022-07-31 19:52:23 -07:00
Mostafa Abdelraouf	2ae4b438e3	Add support for multi-database / multi-user pools (#96 ) * Add support for multi-database / multi-user pools * Nothing * cargo fmt * CI * remove test users * rename pool * Update tests to use admin user/pass * more fixes * Revert bad change * Use PGDATABASE env var * send server info in case of admin	2022-07-27 19:47:55 -07:00
dependabot[bot]	eff8e3e229	Bump activerecord from 7.0.2.2 to 7.0.3.1 in /tests/ruby (#94 ) Bumps [activerecord](https://github.com/rails/rails) from 7.0.2.2 to 7.0.3.1. - [Release notes](https://github.com/rails/rails/releases) - [Changelog](https://github.com/rails/rails/blob/v7.0.3.1/activerecord/CHANGELOG.md) - [Commits](https://github.com/rails/rails/compare/v7.0.2.2...v7.0.3.1) --- updated-dependencies: - dependency-name: activerecord dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2022-07-12 13:24:41 -07:00
Lev Kokotov	37e3a86881	Pass application_name to server (#73 ) * Pass application_name to server * fmt	2022-06-03 00:15:50 -07:00
Lev Kokotov	ccbca66e7a	Poorly behaved client fix (#65 ) * Poorly behaved client fix * yes officer * fix tests * no useless rescue * Looks ok	2022-05-09 09:09:22 -07:00
Lev Kokotov	303fec063b	Ruby (#30 ) * cop * log	2022-02-20 23:33:04 -08:00
Lev Kokotov	a556ec1c43	More query router commands; settings last until changed again; docs (#25 ) * readme * touch up docs * stuff * refactor query router * remove unused * less verbose * docs * no link * method rename	2022-02-19 08:57:24 -08:00
Lev Kokotov	bbacb9cf01	Explicit shard selection; Rails tests (#24 ) * Explicit shard selection; Rails tests * try running ruby tests * try without lockfile * aha * ok	2022-02-18 09:43:07 -08:00
Lev Kokotov	6e83556867	Ruby tests	2022-02-03 18:08:51 -08:00

32 Commits