pgcat

mirror of https://github.com/postgresml/pgcat.git synced 2026-05-31 23:19:05 +00:00

Author	SHA1	Message	Date
Voldemarich	7205537b49	[BUG] Fix binding of NULL value parameters in prepared statements (#496 ) Fix binding of NULL value parameters in prepared statements Co-authored-by: anon <anon@non.existent>	2023-07-10 10:35:43 +02:00
Zain Kabani	1ed6e925ed	Fixes the default for round robing in General (#488 )	2023-06-23 09:15:44 -07:00
Lev Kokotov	4b78af9676	Implement Close for prepared statements (#482 ) * Partial support for Close * Close * respect config value * prepared spec * Hmm * Print cache size	2023-06-18 23:02:34 -07:00
Lev Kokotov	73500c0c96	Fix build (#481 )	2023-06-17 09:09:54 -07:00
Lev Kokotov	b167de5aa3	fmt (#480 )	2023-06-17 08:57:33 -07:00
Juraj Bubniak	473bb3d17d	Log not implemented messages as debug in prometheus metrics. (#477 )	2023-06-16 18:48:38 -07:00
Lev Kokotov	c7d6273037	Support for prepared statements (#474 ) * Start prepared statements * parse * Ok * optional * dont rewrite anonymous prepared stmts * Dont rewrite anonymous prep statements * hm? * prep statements * I see! * comment * Print config value * Rewrite bind and add sqlx test * fmt * ok * Fix * Fix stats * its late * clean up PREPARE	2023-06-16 12:57:44 -07:00
Jeff Chen	94c781881f	Report min_pool_size correctly (#471 )	2023-06-12 09:23:56 -07:00
Zain Kabani	aca9738821	Make queue strategy configurable and default to Fifo (#463 ) * Change idle timeout default to 10 minutes * Revert lifo for now while we investigate connection thrashing issues * Make queue strategy configurable * test revert idle time out * Add pgcat start to python test	2023-06-09 11:35:20 -07:00
Zain Kabani	0bc453a771	Change default server lifetime and bump bb8 version to use LIFO correctly (#462 ) Change default server lifetime and idle timeouts and bump bb8 version to use LIFO correctly	2023-05-31 08:25:42 -07:00
Zain Kabani	b67c33b6d0	Use latest bb8 and use Lifo as the queue strategy in the pool (#455 ) * Use git bb8 * Use latest bb8 and change pool is use stack	2023-05-28 19:46:13 -07:00
Mostafa Abdelraouf	a8a30ad43b	Refactor Pool Stats to be based off of Server/Client stats (#445 ) What is wrong Stats reported by SHOW POOLS seem to be leaking. We see lingering cl_idle , cl_waiting, and similarly for sv_idle , sv_active. We confirmed that these are reporting issues not actual lingering clients. This behavior is readily reproducible by running while true; do psql "postgres://sharding_user:sharding_user@localhost:6432/sharded_db" -c "SELECT 1" > /dev/null 2>&1 & done Why it happens I wasn't able to get to figure our the reason for the bug but my best guess is that we have race conditions when updating pool-level stats. So even though individual update operations are atomic, we perform a check then update sequence which is not protected by a guard. https://github.com/postgresml/pgcat/blob/main/src/stats/pool.rs#L174-L179 I am also suspecting that using Relaxed ordering might allow this behavior (I changed all operations to use Ordering::SeqCst but still got lingering clients) How to fix Since SHOW POOLS/SHOW SERVER/SHOW CLIENTS all show the current state of the proxy (as opposed to SHOW STATS which show aggregate values), this PR refactors SHOW POOLS to have it construct the results directly from SHOW SERVER and SHOW CLIENT datasets. This reduces the complexity of stat updates and eliminates the need for having locks when updating pool stats as we only care about updating individual client/server states. This will change the semantics of maxwait, so instead of it holding the maxwait time ever encountered by a client (connected or disconnected), it will only consider connected clients which should be okay given PgCat tends to hold on to client connections more than Pgbouncer.	2023-05-23 08:44:49 -05:00
Lev Kokotov	100778670c	Ensure data makes it to the client (#446 ) * Ensure data makes it to the client * flush all buffers	2023-05-18 16:41:22 -07:00
Lev Kokotov	37e3349c24	Optionally clean up server connections (#444 ) * Optionally clean up server connections * move setting to pool * fix test * Print setting to screen * fmt * Fix pool_settings override in tests	2023-05-18 10:46:55 -07:00
Zain Kabani	7f57a89d75	Fix time based average stats (#442 ) * keep track of current stats and zero them after updating averages * Try tests * typo * remove commented test stuff * Avoid dividing by zero * Fix test * refactor, get rid of iterator. do it manually * trigger build * Fix	2023-05-17 21:38:10 -07:00
Lev Kokotov	0898461c01	Allow to deploy pools without checking (#438 )	2023-05-12 12:48:37 -07:00
Lev Kokotov	52b1b43850	Prewarmer (#435 ) * Prewarmer * hmm * Tests * default * fix test * Correct configuration * Added minimal config example * remove connect_timeout	2023-05-12 09:50:52 -07:00
Zain Kabani	0907f1b77f	Improve logging for connection cleanup (#428 ) * initial commit * fix * fmt	2023-05-11 17:40:10 -07:00
Zain Kabani	73260690b0	Fixes average stats bug (#436 ) * Add test * Fix test * Add fix	2023-05-11 17:37:58 -07:00
Lev Kokotov	571b02e178	Calculate averages correctly and preserve totals like before (#429 ) * Reset totals after avg calculation * like it used to be	2023-05-08 10:06:16 -07:00
Andrew Tanner	159eb89bf0	First try with role reset (#427 ) * First try with role rest * update * extra line * Update src/server.rs Co-authored-by: Lev Kokotov <levkk@users.noreply.github.com> * Update tests/ruby/misc_spec.rb Co-authored-by: Lev Kokotov <levkk@users.noreply.github.com> --------- Co-authored-by: Lev Kokotov <levkk@users.noreply.github.com>	2023-05-05 15:31:27 -07:00
Lev Kokotov	389993bf3e	Accurate log messages (#425 )	2023-05-05 08:27:19 -07:00
Lev Kokotov	ba5243b6dd	Optionally validate config on boot (#423 )	2023-05-03 17:07:23 -07:00
Lev Kokotov	128ef72911	lowercase config query (#422 ) * lowercase config query * remove debug	2023-05-03 16:47:20 -07:00
Lev Kokotov	811885f464	Actually plugins (#421 ) * more plugins * clean up * fix tests * fix flakey test	2023-05-03 16:13:45 -07:00
Lev Kokotov	09e54e1175	Plugins! (#420 ) * Some queries * Plugins!! * cleanup * actual names * the actual plugins * comment * fix tests * Tests * unused errors * Increase reaper rate to actually enforce settings * ok	2023-05-03 09:13:05 -07:00
Jose Fernández	7dfbd993f2	Add dns_cache for server addresses as in pgbouncer (#249 ) * Add dns_cache so server addresses are cached and invalidated when DNS changes. Adds a module to deal with dns_cache feature. It's main struct is CachedResolver, which is a simple thread safe hostname <-> Ips cache with the ability to refresh resolutions every `dns_max_ttl` seconds. This way, a client can check whether its ip address has changed. * Allow reloading dns cached * Add documentation for dns_cached	2023-05-02 10:26:40 +02:00
Lev Kokotov	0d504032b2	Server TLS (#417 ) * Server TLS * Finish up TLS * thats it * diff * remove dead code * maybe? * dirty shutdown * skip flakey test * remove unused error * fetch config once	2023-04-30 09:41:46 -07:00
Lev Kokotov	4a87b4807d	Add more pool settings (#416 ) * Add some pool settings * fmt	2023-04-26 16:33:26 -07:00
Shawn	cb5ff40a59	fix typo (#415 ) chore: typo	2023-04-26 08:28:54 -07:00
Lev Kokotov	3dae3d0777	Separate server and client passwords optionally (#407 ) * Separate server and user passwords * config	2023-04-18 09:57:17 -07:00
Cluas	bae12fca99	feat: set keepalive for pgcat server itself (#402 ) * feat: set keepalive for pgcat server self * docs: note also set for client	2023-04-12 09:29:43 -07:00
Lev Kokotov	421c5d4b64	Load config on client connect (#401 )	2023-04-11 10:32:48 -07:00
Kian-Meng Ang	d568739db9	Fix typos (#398 ) Found via `typos --format brief`	2023-04-10 18:37:16 -07:00
Lev Kokotov	692353c839	A couple things (#397 ) * Format cleanup * fmt * finally	2023-04-10 14:51:01 -07:00
Lev Kokotov	a62f6b0eea	Fix port; add user pool mode (#395 ) * Fix port; add user pool mode * will probably break our session/transaction mode tests	2023-04-05 15:06:19 -07:00
Jose Fernández	6f768a84ce	Auth passthrough (auth_query) (#266 ) * Add a new exec_simple_query method This adds a new `exec_simple_query` method so we can make 'out of band' queries to servers that don't interfere with pools at all. In order to reuse startup code for making these simple queries, we need to set the stats (`Reporter`) optional, so using these simple queries wont interfere with stats. * Add auth passthough (auth_query) Adds a feature that allows setting auth passthrough for md5 auth. It adds 3 new (general and pool) config parameters: - `auth_query`: An string containing a query that will be executed on boot to obtain the hash of a given user. This query have to use a placeholder `$1`, so pgcat can replace it with the user its trying to fetch the hash from. - `auth_query_user`: The user to use for connecting to the server and executing the auth_query. - `auth_query_password`: The password to use for connecting to the server and executing the auth_query. The configuration can be done either on the general config (so pools share them) or in a per-pool basis. The behavior is, at boot time, when validating server connections, a hash is fetched per server and stored in the pool. When new server connections are created, and no cleartext password is specified, the obtained hash is used for creating them, if the hash could not be obtained for whatever reason, it retries it. When client authentication is tried, it uses cleartext passwords if specified, it not, it checks whether we have query_auth set up, if so, it tries to use the obtained hash for making client auth. If there is no hash (we could not obtain one when validating the connection), a new fetch is tried. Once we have a hash, we authenticate using it against whathever the client has sent us, if there is a failure we refetch the hash and retry auth (so password changes can be done). The idea with this 'retrial' mechanism is to make it fault tolerant, so if for whatever reason hash could not be obtained during connection validation, or the password has change, we can still connect later. * Add documentation for Auth passthrough	2023-03-30 13:29:23 -07:00
Jose Fernández	58ce76d9b9	Refactor stats to use atomics (#375 ) * Refactor stats to use atomics When we are dealing with a high number of connections, generated stats cannot be consumed fast enough by the stats collector loop. This makes the stats subsystem inconsistent and a log of warning messages are thrown due to unregistered server/clients. This change refactors the stats subsystem so it uses atomics: - Now counters are handled using U64 atomics - Event system is dropped and averages are calculated using a loop every 15 seconds. - Now, instead of snapshots being generated ever second we keep track of servers/clients that have registered. Each pool/server/client has its own instance of the counter and makes changes directly, instead of adding an event that gets processed later. * Manually mplement Hash/Eq in `config::Address` ignoring stats * Add tests for client connection counters * Allow connecting to dockerized dev pgcat from the host * stats: Decrease cl_idle when idle socket disconnects	2023-03-28 17:19:37 +02:00
Zain Kabani	ca4431b67e	Add idle client in transaction configuration (#380 ) * Add idle client in transaction configuration * fmt * Update docs * trigger build * Add tests * Make the config dynamic from reloads * fmt * comments * trigger build * fix config.md * remove error	2023-03-24 08:20:30 -07:00
Mostafa Abdelraouf	d66b377a8e	Check Slice bounds in read_message to avoid panics (#371 ) When recv is called in the mirroring client, we noticed an occasional panic when reading the message. thread 'tokio-runtime-worker' panicked at 'slice index starts at 5 but ends at 0', src/messages.rs:522:18 We are still debugging the reason why this happens but adding a check for slice bounds seems like a good idea. Instead of panicking, this will return an Err to the caller which will close the connection.	2023-03-17 12:31:43 -05:00
Mostafa Abdelraouf	e5df179ac9	Reduce memory and CPU footprint of mirroring (#369 ) The experimental mirroring feature used a lot of memory and CPU when put under production traffic. This change attempts to reduce memory and CPU usage. Memory footprint is reduced by making the channel smaller. CPU usage is reduced by avoiding allocations if the channel is full or is closed. We might lose more messages this way if the mirror falls behind but that is more acceptable than crashing the entire process when it goes out-of-memory (OOM)	2023-03-15 17:58:45 -05:00
Lev Kokotov	b4baa86e8a	Extended query protocol sharding (#339 ) * Prepared stmt sharding s tests * len check * remove python test * latest rust * move that to debug for sure * Add the actual tests * latest image * Update tests/ruby/sharding_spec.rb	2023-03-10 07:55:22 -08:00
Mostafa Abdelraouf	76e195a8a4	Reorder fields in Shard to avoid ValueAfterTable errors (#349 )	2023-03-10 07:39:42 -06:00
Mostafa Abdelraouf	aa89e357e0	PgCat Query Mirroring (#341 ) This is an implementation of Query mirroring in PgCat (outlined here #302) In configs, we match mirror hosts with the servers handling the traffic. A mirror host will receive the same protocol messages as the main server it was matched with. This is done by creating an async task for each mirror server, it communicates with the main server through two channels, one for the protocol messages and one for the exit signal. The mirror server sends the protocol packets to the underlying PostgreSQL server. We receive from the underlying PostgreSQL server as soon as the data is available and we immediately discard it. We use bb8 to manage the life cycle of the connection, not for pooling since each mirror server handler is more or less single-threaded. We don't have any connection pooling in the mirrors. Matching each mirror connection to an actual server connection guarantees that we will not have more connections to any of the mirrors than the parent pool would allow.	2023-03-10 06:23:51 -06:00
Mostafa Abdelraouf	2cc6a09fba	Add Manual host banning to PgCat (#340 ) Sometimes we want an admin to be able to ban a host for some time to route traffic away from that host for reasons like partial outages, replication lag, and scheduled maintenance. We can achieve this today using a configuration update but a quicker approach is to send a control command to PgCat that bans the replica for some specified duration. This command does not change the current banning rules like Primaries cannot be banned When all replicas are banned, all replicas are unbanned	2023-03-06 06:10:59 -06:00
Lev Kokotov	c3eaf023c7	Automatic sharding for SELECT v2 (#337 ) * More comprehensive read sharding support * A few fixes * fq * comment * wildcard	2023-03-02 00:53:31 -05:00
Jose Fernández	9241df18e2	Allow sending logs to stdout by using STDOUT_LOG env var (#334 ) * Allow sending logs to stdout by using STDOUT_LOG env var * Increase stats buffer size	2023-02-28 13:10:40 -08:00
zainkabani	eb8cfdb1f1	Adds SHUTDOWN command as alternate option to sending SIGINT (#331 ) * Adds SHUTDOWN command to PgCat as alternate option to sending SIGINT * Check if we're already in SHUTDOWN sequence * Send signal directly from shutdown instead of using channel * Add tests * trigger build * Lowercase response and boolean change * Update tests * Fix tests * typo	2023-02-26 22:16:30 -08:00
Mostafa Abdelraouf	75a7d4409a	Fix Back-and-forth RELOAD Bug (#330 ) We identified a bug where RELOAD fails to update the pools. To reproduce you need to start at some config state, modify that state a bit, reload, revert the configs back to the original state, and reload. The last reload will fail to update the pool because PgCat "thinks" the pool state didn't change. This is because we use a HashSet to keep track of config hashes but we never remove values from it. Say we start with State A, we modify pool configs to State B and reload. Now the POOL_HASHES struct has State A and State B. Attempting to go back to State A will encounter a hashset hit which is interpreted by PgCat as "Configs are the same, no need to reload pools" We fix this by attaching a config_hash value to ConnectionPool object and we calculate that value when we create the pool. This eliminates the need for a global variable. One shortcoming here is that changing any config under one user in the pool will trigger a reload for the entire pool (which is fine I think)	2023-02-21 21:53:10 -06:00
Nicholas Dujay	37e1c5297a	implement show users (#329 ) * implement show users * fix compile errors * add basic ruby test * gitignore things	2023-02-21 13:08:43 -08:00

1 2 3 4 5 ...

252 Commits