MatrixSynapse

Commit Graph

Author	SHA1	Message	Date
Patrick Cloke	ab3f1b3b53	Convert simple_select_one_txn and simple_select_one to return tuples. (#16612 )	2023-11-09 11:13:31 -05:00
Patrick Cloke	2bf9341406	Ensure local invited & knocking users leave before purge. (#16559 ) This is mostly useful for federated rooms where some users would get stuck in the invite or knock state when the room was purged from their homeserver.	2023-10-27 12:50:50 -04:00
Patrick Cloke	679c691f6f	Remove more usages of cursor_to_dict. (#16551 ) Mostly to improve type safety.	2023-10-26 15:12:28 -04:00
Patrick Cloke	9407d5ba78	Convert simple_select_list and simple_select_list_txn to return lists of tuples (#16505 ) This should use fewer allocations and improves type hints.	2023-10-26 13:01:36 -04:00
Patrick Cloke	a4904dcb04	Convert simple_select_many_batch, simple_select_many_txn to tuples. (#16444 )	2023-10-11 13:24:56 -04:00
Patrick Cloke	fa907025f4	Remove manys calls to cursor_to_dict (#16431 ) This avoids calling cursor_to_dict and then immediately unpacking the values in the dict for other users. By not creating the intermediate dictionary we can avoid allocating the dictionary and strings for the keys, which should generally be more performant. Additionally this improves type hints by avoid Dict[str, Any] dictionaries coming out of the database layer.	2023-10-05 11:07:38 -04:00
David Robertson	1026776380	mypy plugin to check `@cached` return types (#14911 ) Co-authored-by: David Robertson <davidr@element.io> Co-authored-by: Patrick Cloke <patrickc@matrix.org> Co-authored-by: Erik Johnston <erik@matrix.org> Assert that the return type of callables wrapped in @cached and @cachedList are cachable (aka immutable).	2023-10-02 14:22:36 +00:00
Patrick Cloke	7ec0a141b4	Convert more cached return values to immutable types (#16356 )	2023-09-20 07:48:55 -04:00
Patrick Cloke	d7c89c5908	Return immutable objects for cachedList decorators (#16350 )	2023-09-19 15:26:44 -04:00
Erik Johnston	fc1e534e41	Speed up updating state in large rooms (#15971 ) This should speed up updating state in rooms with lots of state.	2023-07-20 15:51:28 +01:00
Eric Eastwood	e536f02f68	Remove superfluous `room_memberships` join from background update (#15733 ) Spawning from https://github.com/matrix-org/synapse/pull/15731	2023-06-07 11:47:01 -05:00
Eric Eastwood	9d911b0da6	No need for the extra join since `membership` is built-in to `current_state_events` (#15731 ) This helps with the upstream `is_host_joined()` and `is_host_invited()` functions. `membership` was added to `current_state_events` in https://github.com/matrix-org/synapse/pull/5706 and forced in https://github.com/matrix-org/synapse/pull/13745	2023-06-06 22:19:57 -05:00
Patrick Cloke	1f55c04cbc	Improve type hints for cached decorator. (#15658 ) The cached decorators always return a Deferred, which was not properly propagated. It was close enough when wrapping coroutines, but failed if a bare function was wrapped.	2023-05-24 12:59:31 +00:00
Sean Quah	04e79e6a18	Add config option to forget rooms automatically when users leave them (#15224 ) This is largely based off the stats and user directory updater code. Signed-off-by: Sean Quah <seanq@matrix.org>	2023-05-03 12:27:33 +01:00
Erik Johnston	79d2e2e79c	Speed up membership queries for users with forgotten rooms (#15385 )	2023-04-04 14:11:34 +01:00
Sean Quah	d0c713cc85	Return read-only collections from `@cached` methods (#13755 ) It's important that collections returned from `@cached` methods are not modified, otherwise future retrievals from the cache will return the modified collection. This applies to the return values from `@cached` methods and the values inside the dictionaries returned by `@cachedList` methods. It's not necessary for the dictionaries returned by `@cachedList` methods themselves to be read-only. Signed-off-by: Sean Quah <seanq@matrix.org> Co-authored-by: David Robertson <davidr@element.io>	2023-02-10 23:29:00 +00:00
David Robertson	2186ebed6c	Fetch fewer events when getting hosts in room (#14962 )	2023-02-02 16:49:14 +00:00
David Robertson	80d44060c9	Faster joins: omit partial rooms from eager syncs until the resync completes (#14870 ) * Allow `AbstractSet` in `StrCollection` Or else frozensets are excluded. This will be useful in an upcoming commit where I plan to change a function that accepts `List[str]` to accept `StrCollection` instead. * `rooms_to_exclude` -> `rooms_to_exclude_globally` I am about to make use of this exclusion mechanism to exclude rooms for a specific user and a specific sync. This rename helps to clarify the distinction between the global config and the rooms to exclude for a specific sync. * Better function names for internal sync methods * Track a list of excluded rooms on SyncResultBuilder I plan to feed a list of partially stated rooms for this sync to ignore * Exclude partial state rooms during eager sync using the mechanism established in the previous commit * Track un-partial-state stream in sync tokens So that we can work out which rooms have become fully-stated during a given sync period. * Fix mutation of `@cached` return value This was fouling up a complement test added alongside this PR. Excluding a room would mean the set of forgotten rooms in the cache would be extended. This means that room could be erroneously considered forgotten in the future. Introduced in #12310, Synapse 1.57.0. I don't think this had any user-visible side effects (until now). * SyncResultBuilder: track rooms to force as newly joined Similar plan as before. We've omitted rooms from certain sync responses; now we establish the mechanism to reintroduce them into future syncs. * Read new field, to present rooms as newly joined * Force un-partial-stated rooms to be newly-joined for eager incremental syncs only, provided they're still fully stated * Notify user stream listeners to wake up long polling syncs * Changelog * Typo fix Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com> * Unnecessary list cast Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com> * Rephrase comment Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com> * Another comment Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com> * Fixup merge(?) * Poke notifier when receiving un-partial-stated msg over replication * Fixup merge whoops Thanks MV :) Co-authored-by: Mathieu Velen <mathieuv@matrix.org> Co-authored-by: Mathieu Velten <mathieuv@matrix.org> Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2023-01-23 15:44:39 +00:00
David Robertson	1eed795fc5	Include heroes in partial join responses' state (#14442 ) * Pull out hero selection logic * Include heroes in partial join response's state * Changelog * Fixup trial test * Remove TODO	2022-11-15 17:35:19 +00:00
Eric Eastwood	aa70556699	Check appservice user interest against the local users instead of all users (`get_users_in_room` mis-use) (#13958 )	2022-10-27 18:29:23 +00:00
Mathieu Velten	4dc05f3019	Fix presence bug introduced in 1.64 by #13313 (#14243 ) * Fix presence bug introduced in 1.64 by #13313 Signed-off-by: Mathieu Velten <mathieuv@matrix.org> * Add changelog * Add DISTINCT * Apply suggestions from code review Signed-off-by: Mathieu Velten <mathieuv@matrix.org>	2022-10-27 13:16:00 +01:00
dependabot[bot]	0b7830e457	Bump flake8-bugbear from 21.3.2 to 22.9.23 (#14042 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Erik Johnston <erik@matrix.org> Co-authored-by: David Robertson <davidr@element.io>	2022-10-19 19:38:24 +00:00
Nick Mills-Barrett	f9bc5428c4	Batch up calls to `get_rooms_for_users` (#14109 )	2022-10-12 11:36:22 +01:00
Erik Johnston	3dfc4a08dc	Fix performance regression in `get_users_in_room` (#13972 ) Fixes #13942. Introduced in #13575. Basically, let's only get the ordered set of hosts out of the DB if we need an ordered set of hosts. Since we split the function up the caching won't be as good, but I think it will still be fine as e.g. multiple backfill requests for the same room will hit the cache.	2022-09-30 13:15:32 +01:00
Nick Mills-Barrett	a466164647	Optimise get_rooms_for_user (drop with_stream_ordering) (#13787 )	2022-09-29 13:55:12 +00:00
Erik Johnston	e8318a4333	Handle the case of remote users leaving a partial join room for device lists (#13885 )	2022-09-27 13:01:08 +01:00
reivilibre	6302753012	Deduplicate `is_server_notices_room`. (#13780 )	2022-09-14 15:53:18 +00:00
Nick Mills-Barrett	da41a7cd61	Remove check current state membership up to date (#13745 ) * Remove checks for membership column in current_state_events * Add schema script to force through the `current_state_events_membership` background job Contributed by Nick @ Beeper (@fizzadar).	2022-09-12 12:58:33 +01:00
Sean Quah	906cead9ca	Update docstrings to explain the impact of partial state (#13750 ) Update the docstrings for `get_users_in_room` and `get_current_hosts_in_room` to explain the impact of partial state. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-09-08 15:55:29 +01:00
Sean Quah	89e8b98b65	Avoid raising errors due to malformed IDs in `get_current_hosts_in_room` (#13748 ) Handle malformed user IDs with no colons in `get_current_hosts_in_room`. It's not currently possible for a malformed user ID to join a room, so this error would never be hit. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-09-08 15:55:03 +01:00
reivilibre	d3d9ca156e	Cancel the processing of key query requests when they time out. (#13680 )	2022-09-07 12:03:32 +01:00
Nick Mills-Barrett	42b11d5565	Remove cached wrap on `_get_joined_users_from_context` method (#13569 ) The method doesn't actually do any data fetching and the method that does, `_get_joined_profile_from_event_id`, has its own cache. Signed off by Nick @ Beeper (@Fizzadar).	2022-08-31 12:19:39 +01:00
Eric Eastwood	51d732db3b	Optimize how we calculate `likely_domains` during backfill (#13575 ) Optimize how we calculate `likely_domains` during backfill because I've seen this take 17s in production just to `get_current_state` which is used to `get_domains_from_state` (see case [2. Loading tons of events in the `/messages` investigation issue](https://github.com/matrix-org/synapse/issues/13356)). There are 3 ways we currently calculate hosts that are in the room: 1. `get_current_state` -> `get_domains_from_state` - Used in `backfill` to calculate `likely_domains` and `/timestamp_to_event` because it was cargo-culted from `backfill` - This one is being eliminated in favor of `get_current_hosts_in_room` in this PR 🕳 1. `get_current_hosts_in_room` - Used for other federation things like sending read receipts and typing indicators 1. `get_hosts_in_room_at_events` - Used when pushing out events over federation to other servers in the `_process_event_queue_loop` Fix https://github.com/matrix-org/synapse/issues/13626 Part of https://github.com/matrix-org/synapse/issues/13356 Mentioned in [internal doc](https://docs.google.com/document/d/1lvUoVfYUiy6UaHB6Rb4HicjaJAU40-APue9Q4vzuW3c/edit#bookmark=id.2tvwz3yhcafh) ### Query performance #### Before The query from `get_current_state` sucks just because we have to get all 80k events. And we see almost the exact same performance locally trying to get all of these events (16s vs 17s): ``` synapse=# SELECT type, state_key, event_id FROM current_state_events WHERE room_id = '!OGEhHVWSdvArJzumhm:matrix.org'; Time: 16035.612 ms (00:16.036) synapse=# SELECT type, state_key, event_id FROM current_state_events WHERE room_id = '!OGEhHVWSdvArJzumhm:matrix.org'; Time: 4243.237 ms (00:04.243) ``` But what about `get_current_hosts_in_room`: When there is 8M rows in the `current_state_events` table, the previous query in `get_current_hosts_in_room` took 13s from complete freshness (when the events were first added). But takes 930ms after a Postgres restart or 390ms if running back to back to back. ```sh $ psql synapse synapse=# \timing on synapse=# SELECT COUNT(DISTINCT substring(state_key FROM '@[^:]:(.)$')) FROM current_state_events WHERE type = 'm.room.member' AND membership = 'join' AND room_id = '!OGEhHVWSdvArJzumhm:matrix.org'; count ------- 4130 (1 row) Time: 13181.598 ms (00:13.182) synapse=# SELECT COUNT() from current_state_events where room_id = '!OGEhHVWSdvArJzumhm:matrix.org'; count ------- 80814 synapse=# SELECT COUNT() from current_state_events; count --------- 8162847 synapse=# SELECT pg_size_pretty( pg_total_relation_size('current_state_events') ); pg_size_pretty ---------------- 4702 MB ``` #### After I'm not sure how long it takes from complete freshness as I only really get that opportunity once (maybe restarting computer but that's cumbersome) and it's not really relevant to normal operating times. Maybe you get closer to the fresh times the more access variability there is so that Postgres caches aren't as exact. Update: The longest I've seen this run for is 6.4s and 4.5s after a computer restart. After a Postgres restart, it takes 330ms and running back to back takes 260ms. ```sh $ psql synapse synapse=# \timing on Timing is on. synapse=# SELECT substring(c.state_key FROM '@[^:]:(.)$') as host FROM current_state_events c /* Get the depth of the event from the events table */ INNER JOIN events AS e USING (event_id) WHERE c.type = 'm.room.member' AND c.membership = 'join' AND c.room_id = '!OGEhHVWSdvArJzumhm:matrix.org' GROUP BY host ORDER BY min(e.depth) ASC; Time: 333.800 ms ``` #### Going further To improve things further we could add a `limit` parameter to `get_current_hosts_in_room`. Realistically, we don't need 4k domains to choose from because there is no way we're going to query that many before we a) probably get an answer or b) we give up. Another thing we can do is optimize the query to use a index skip scan: - https://wiki.postgresql.org/wiki/Loose_indexscan - Index Skip Scan, https://commitfest.postgresql.org/37/1741/ - https://www.timescale.com/blog/how-we-made-distinct-queries-up-to-8000x-faster-on-postgresql/	2022-08-30 01:38:14 -05:00
Eric Eastwood	d58615c82c	Directly lookup local membership instead of getting all members in a room first (`get_users_in_room` mis-use) (#13608 ) See https://github.com/matrix-org/synapse/pull/13575#discussion_r953023755	2022-08-24 14:13:12 -05:00
Erik Johnston	05c9c7363b	Fix regression caused by #13573 (#13600 ) Broke in #13573.	2022-08-23 14:14:05 +00:00
Nick Mills-Barrett	5e7847dc92	Cache user IDs instead of profile objects (#13573 ) The profile objects are never used and increase cache size significantly.	2022-08-23 09:49:59 +00:00
Dirk Klimpel	d75512d19e	Add forgotten status to Room Details API (#13503 )	2022-08-17 09:42:01 +00:00
reivilibre	c3516e9dec	Faster room joins: make `/joined_members` block whilst the room is partial stated. (#13514 )	2022-08-16 13:16:56 +01:00
Nick Mills-Barrett	41320a0554	Optimise async get event lookups (#13435 ) Still maintains local in memory lookup optimisation, but does any external lookup as part of the deferred that prevents duplicate lookups for the same event at once. This makes the assumption that fetching from an external cache is a non-zero load operation.	2022-08-04 15:49:55 +01:00
Erik Johnston	43adf2521c	Refactor presence so we can prune user in room caches (#13313 ) See #10826 and #10786 for context as to why we had to disable pruning on those caches. Now that `get_users_who_share_room_with_user` is called frequently only for presence, we just need to make calls to it less frequent and then we can remove the various levels of caching that is going on.	2022-07-25 09:21:06 +00:00
Shay	7864f33e28	Increase batch size of `bulk_get_push_rules` and `_get_joined_profiles_from_event_ids`. (#13300 )	2022-07-18 13:15:23 -07:00
Shay	15edf23626	Improve performance of query ` _get_subset_users_in_room_with_profiles` (#13299 )	2022-07-18 12:35:45 -07:00
Nick Mills-Barrett	cc21a431f3	Async get event cache prep (#13242 ) Some experimental prep work to enable external event caching based on #9379 & #12955. Doesn't actually move the cache at all, just lays the groundwork for async implemented caches. Signed off by Nick @ Beeper (@Fizzadar)	2022-07-15 09:30:46 +00:00
Erik Johnston	0ca4172b5d	Don't pull out state in `compute_event_context` for unconflicted state (#13267 )	2022-07-14 13:57:02 +00:00
Erik Johnston	e5716b631c	Don't pull out the full state when calculating push actions (#13078 )	2022-07-11 20:08:39 +00:00
Erik Johnston	44de53bb79	Reduce state pulled from DB due to sending typing and receipts over federation (#12964 ) Reducing the amount of state we pull from the DB is useful as fetching state is expensive in terms of DB, CPU and memory.	2022-06-06 16:46:11 +01:00
Jonathan de Jong	6be4953b99	Mutual rooms: Remove dependency on user directory (#12836 )	2022-05-30 10:05:31 +01:00
David Robertson	5331fb5b47	allow `on_invalidate=None` in `@cached` methods (#12769 )	2022-05-17 16:06:45 +00:00
Dirk Klimpel	6edefef602	Add some type hints to datastore (#12717 )	2022-05-17 15:29:06 +01:00
Sean Quah	800ba87cc8	Refactor and convert `Linearizer` to async (#12357 ) Refactor and convert `Linearizer` to async. This makes a `Linearizer` cancellation bug easier to fix. Also refactor to use an async context manager, which eliminates an unlikely footgun where code that doesn't immediately use the context manager could forget to release the lock. Signed-off-by: Sean Quah <seanq@element.io>	2022-04-05 15:43:52 +01:00

1 2

90 Commits (898655fd1240138600c96cfa763603c3e5ca3e0e)