MatrixSynapse

Commit Graph

Author	SHA1	Message	Date
Sean Quah	f792dd74e1	Remove option to skip locking of tables during emulated upserts (#14469 ) To perform an emulated upsert into a table safely, we must either: * lock the table, * be the only writer upserting into the table * or rely on another unique index being present. When the 2nd or 3rd cases were applicable, we previously avoided locking the table as an optimization. However, as seen in #14406, it is easy to slip up when adding new schema deltas and corrupt the database. The only time we lock when performing emulated upserts is while waiting for background updates on postgres. On sqlite, we do no locking at all. Let's remove the option to skip locking tables, so that we don't shoot ourselves in the foot again. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-28 13:42:06 +00:00
schmop	c2e06c36d4	Fix crash admin media list api when info is None (#14537 ) Fixes https://github.com/matrix-org/synapse/issues/14536	2022-11-24 10:49:04 +00:00
Erik Johnston	f38d7d79c8	Add another index to `device_lists_changes_in_room` (#14534 ) This helps avoid reading unnecessarily large amounts of data from the table when querying with a set of room IDs.	2022-11-23 14:09:00 +00:00
Eric Eastwood	7f78b383ca	Optimize `filter_events_for_client` for faster `/messages` - v2 (#14527 ) Fix #14108	2022-11-22 21:56:28 +00:00
Sean Quah	9cae44f49e	Track unconverted device list outbound pokes using a position instead (#14516 ) When a local device list change is added to `device_lists_changes_in_room`, the `converted_to_destinations` flag is set to `FALSE` and the `_handle_new_device_update_async` background process is started. This background process looks for unconverted rows in `device_lists_changes_in_room`, copies them to `device_lists_outbound_pokes` and updates the flag. To update the `converted_to_destinations` flag, the database performs a `DELETE` and `INSERT` internally, which fragments the table. To avoid this, track unconverted rows using a `(stream ID, room ID)` position instead of the flag. From now on, the `converted_to_destinations` column indicates rows that need converting to outbound pokes, but does not indicate whether the conversion has already taken place. Closes #14037. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-22 16:46:52 +00:00
Patrick Cloke	6d7523ef14	Batch fetch bundled references (#14508 ) Avoid an n+1 query problem and fetch the bundled aggregations for m.reference relations in a single query instead of a query per event. This applies similar logic for as was previously done for edits in `8b309adb43` (#11660; threads in `b65acead42` (#11752); and annotations in `1799a54a54` (#14491).	2022-11-22 09:41:09 -05:00
Patrick Cloke	1799a54a54	Batch fetch bundled annotations (#14491 ) Avoid an n+1 query problem and fetch the bundled aggregations for m.annotation relations in a single query instead of a query per event. This applies similar logic for as was previously done for edits in `8b309adb43` (#11660) and threads in `b65acead42` (#11752).	2022-11-22 07:26:11 -05:00
Andrew Morgan	e7132c3f81	Fix check to ignore blank lines in incoming TCP replication (#14449 )	2022-11-17 16:09:56 +00:00
David Robertson	115f0eb233	Reintroduce #14376 , with bugfix for monoliths (#14468 ) * Add tests for StreamIdGenerator * Drive-by: annotate all defs * Revert "Revert "Remove slaved id tracker (#14376)" (#14463)" This reverts commit `d63814fd73`, which in turn reverted `36097e88c4`. This restores the latter. * Fix StreamIdGenerator not handling unpersisted IDs Spotted by @erikjohnston. Closes #14456. * Changelog Co-authored-by: Nick Mills-Barrett <nick@fizzadar.com> Co-authored-by: Erik Johnston <erik@matrix.org>	2022-11-16 22:16:46 +00:00
Patrick Cloke	d8cc86eff4	Remove redundant types from comments. (#14412 ) Remove type hints from comments which have been added as Python type hints. This helps avoid drift between comments and reality, as well as removing redundant information. Also adds some missing type hints which were simple to fill in.	2022-11-16 15:25:24 +00:00
Sean Quah	882277008c	Fix background updates failing to add unique indexes on receipts (#14453 ) As part of the database migration to support threaded receipts, there is a possible window in between `73/08thread_receipts_non_null.sql.postgres` removing the original unique constraints on `receipts_linearized` and `receipts_graph` and the `reeipts_linearized_unique_index` and `receipts_graph_unique_index` background updates from `72/08thread_receipts.sql` completing where the unique constraints on `receipts_linearized` and `receipts_graph` are missing. Any emulated upserts on these tables must therefore be performed with a lock held, otherwise duplicate rows can end up in the tables when there are concurrent emulated upserts. Fix the missing lock. Note that emulated upserts no longer happen by default on sqlite, since the minimum supported version of sqlite supports native upserts by default now. Finally, clean up any duplicate receipts that may have crept in before trying to create the `receipts_graph_unique_index` and `receipts_linearized_unique_index` unique indexes. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-16 15:01:22 +00:00
Erik Johnston	d63814fd73	Revert "Remove slaved id tracker (#14376 )" (#14463 ) This reverts commit `36097e88c4`.	2022-11-16 13:50:07 +00:00
David Robertson	1eed795fc5	Include heroes in partial join responses' state (#14442 ) * Pull out hero selection logic * Include heroes in partial join response's state * Changelog * Fixup trial test * Remove TODO	2022-11-15 17:35:19 +00:00
reivilibre	634359b083	Update docstring to clarify that `get_partial_state_events_batch` does not just give you completely arbitrary partial-state events. (#14417 )	2022-11-15 10:43:17 +00:00
Nick Mills-Barrett	36097e88c4	Remove slaved id tracker (#14376 ) This matches the multi instance writer ID generator class which can both handle advancing the current token over replication and by calling the database.	2022-11-14 17:31:36 +00:00
Patrick Cloke	fb66fae84b	Clean-up events persistance code (#14411 ) By removing unused variables and making some arguments required which are always provided.	2022-11-14 08:13:11 -05:00
Nick Mills-Barrett	3a4f80f8c6	Merge/remove `Slaved*` stores into `WorkerStores` (#14375 )	2022-11-11 10:51:49 +00:00
Sean Quah	b2c2b03079	Fix PostgreSQL sometimes using table scans for `event_search` (#14409 ) PostgreSQL may underestimate the number of distinct `room_id`s in `event_search`, which can cause it to use table scans for queries for multiple rooms. Fix this by setting `n_distinct` on the column. Resolves #14402. Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-10 19:02:27 +00:00
Patrick Cloke	e9a4343cb2	Drop support for Postgres 10 in full text search code. (#14397 )	2022-11-09 09:55:34 -05:00
Sean Quah	a5fcdea090	Remove support for PostgreSQL 10 (#14392 ) Signed-off-by: Sean Quah <seanq@matrix.org>	2022-11-08 17:17:13 +00:00
Richard van der Hoff	2193513346	Fix background update table-scanning `events` (#14374 ) When this background update did its last batch, it would try to update all the events that had been inserted since the bgupdate started, which could cause a table-scan. Make sure we limit the update correctly.	2022-11-07 14:28:00 +00:00
dependabot[bot]	8bcdd712b8	Bump flake8-bugbear from 22.9.23 to 22.10.27 (#14329 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: GitHub Actions <github-actions[bot]@users.noreply.github.com> Co-authored-by: Olivier Wilkinson (reivilibre) <oliverw@matrix.org>	2022-11-04 18:43:14 +00:00
Brendan Abolivier	86c5a710d8	Implement MSC3912: Relation-based redactions (#14260 ) Co-authored-by: Sean Quah <8349537+squahtx@users.noreply.github.com>	2022-11-03 16:21:31 +00:00
Quentin Gliech	cc3a52b33d	Support OIDC backchannel logouts (#11414 ) If configured an OIDC IdP can log a user's session out of Synapse when they log out of the identity provider. The IdP sends a request directly to Synapse (and must be configured with an endpoint) when a user logs out.	2022-10-31 13:07:30 -04:00
Andrew Morgan	7911e2835d	Prevent federation user keys query from returning device names if disallowed (#14304 )	2022-10-28 18:06:02 +01:00
Patrick Cloke	81815e0561	Switch search SQL to triple-quote strings. (#14311 ) For ease of reading we switch from concatenated strings to triple quote strings.	2022-10-28 11:44:10 -04:00
Eric Eastwood	aa70556699	Check appservice user interest against the local users instead of all users (`get_users_in_room` mis-use) (#13958 )	2022-10-27 18:29:23 +00:00
Patrick Cloke	67583281e3	Fix tests for change in PostgreSQL 14 behavior change. (#14310 ) PostgreSQL 14 changed the behavior of `websearch_to_tsquery` to improve some behaviour. The tests were hitting those edge-cases about handling of hanging double quotes. This fixes the tests to take into account the PostgreSQL version.	2022-10-27 13:58:12 +00:00
Mathieu Velten	4dc05f3019	Fix presence bug introduced in 1.64 by #13313 (#14243 ) * Fix presence bug introduced in 1.64 by #13313 Signed-off-by: Mathieu Velten <mathieuv@matrix.org> * Add changelog * Add DISTINCT * Apply suggestions from code review Signed-off-by: Mathieu Velten <mathieuv@matrix.org>	2022-10-27 13:16:00 +01:00
Quentin Gliech	8756d5c87e	Save login tokens in database (#13844 ) * Save login tokens in database Signed-off-by: Quentin Gliech <quenting@element.io> * Add upgrade notes * Track login token reuse in a Prometheus metric Signed-off-by: Quentin Gliech <quenting@element.io>	2022-10-26 11:45:41 +01:00
James Salter	d902181de9	Unified search query syntax using the full-text search capabilities of the underlying DB. (#11635 ) Support a unified search query syntax which leverages more of the full-text search of each database supported by Synapse. Supports, with the same syntax across Postgresql 11+ and Sqlite: - quoted "search terms" - `AND`, `OR`, `-` (negation) operators - Matching words based on their stem, e.g. searches for "dog" matches documents containing "dogs". This is achieved by - If on postgresql 11+, pass the user input to `websearch_to_tsquery` - If on sqlite, manually parse the query and transform it into the sqlite-specific query syntax. Note that postgresql 10, which is close to end-of-life, falls back to using `phraseto_tsquery`, which only supports a subset of the features. Multiple terms separated by a space are implicitly ANDed. Note that: 1. There is no escaping of full-text syntax that might be supported by the database; e.g. `NOT`, `NEAR`, `*` in sqlite. This runs the risk that people might discover this as accidental functionality and depend on something we don't guarantee. 2. English text is assumed for stemming. To support other languages, either the target language needs to be known at the time of indexing the message (via room metadata, or otherwise), or a separate index for each language supported could be created. Sqlite docs: https://www.sqlite.org/fts3.html#full_text_index_queries Postgres docs: https://www.postgresql.org/docs/11/textsearch-controls.html	2022-10-25 14:05:22 -04:00
Olivier Wilkinson (reivilibre)	85fcbba595	Merge branch 'release-v1.70' into develop	2022-10-25 15:39:35 +01:00
DeepBlueV7.X	2d0ba3f89a	Implementation for MSC3664: Pushrules for relations (#11804 )	2022-10-25 14:38:01 +01:00
asymmetric	8c94dd3a27	Enable WAL for SQLite (#13897 ) Signed-off-by: Lorenzo Manacorda <lorenzo@mailbox.org>	2022-10-25 10:22:55 +01:00
Patrick Cloke	581b37b5d6	Revert behavior change for bundling edits of non-message events (#14283 )	2022-10-24 17:07:16 +01:00
Richard van der Hoff	1469fed0e3	Add debugging to help diagnose lost device-list-update (#14268 )	2022-10-24 10:45:10 +01:00
Patrick Cloke	4dd7aa371b	Properly update the threads table when thread events are redacted. (#14248 ) When the last event in a thread is redacted we need to update the threads table: * Find the new latest event in the thread and store it into the table; or * Remove the thread from the table if it is no longer a thread (i.e. all events in the thread were redacted).	2022-10-21 09:11:19 -04:00
Tadeusz Sośnierz	1433b5d5b6	Show erasure status when listing users in the Admin API (#14205 ) * Show erasure status when listing users in the Admin API * Use USING when joining erased_users * Add changelog entry * Revert "Use USING when joining erased_users" This reverts commit `30bd2bf106`. * Make the erased check work on postgres * Add a testcase for showing erased user status * Appease the style linter * Explicitly convert `erased` to bool to make SQLite consistent with Postgres This also adds us an easy way in to fix the other accidentally integered columns. * Move erasure status test to UsersListTestCase * Include user erased status when fetching user info via the admin API * Document the erase status in user_admin_api * Appease the linter and mypy * Signpost comments in tests Co-authored-by: Tadeusz Sośnierz <tadeusz@sosnierz.com> Co-authored-by: David Robertson <david.m.robertson1@gmail.com>	2022-10-21 13:52:44 +01:00
dependabot[bot]	0b7830e457	Bump flake8-bugbear from 21.3.2 to 22.9.23 (#14042 ) Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Erik Johnston <erik@matrix.org> Co-authored-by: David Robertson <davidr@element.io>	2022-10-19 19:38:24 +00:00
Eric Eastwood	fa8616e65c	Fix MSC3030 `/timestamp_to_event` returning `outliers` that it has no idea whether are near a gap or not (#14215 ) Fix MSC3030 `/timestamp_to_event` endpoint returning `outliers` that it has no idea whether are near a gap or not (and therefore unable to determine whether it's actually the closest event). The reason Synapse doesn't know whether an `outlier` is next to a gap is because our gap checks rely on entries in the `event_edges`, `event_forward_extremeties`, and `event_backward_extremities` tables which is [not the case for `outliers`](`2c63cdcc3f/docs/development/room-dag-concepts.md (outliers)`). Also fixes MSC3030 Complement `can_paginate_after_getting_remote_event_from_timestamp_to_event_endpoint` test flake. Although this acted flakey in Complement, if `sync_partial_state` raced and beat us before `/timestamp_to_event`, then even if we retried the failing `/context` request it wouldn't work until we made this Synapse change. With this PR, Synapse will never return an `outlier` event so that test will always go and ask over federation. Fix https://github.com/matrix-org/synapse/issues/13944 ### Why did this fail before? Why was it flakey? Sleuthing the server logs on the [CI failure](https://github.com/matrix-org/synapse/actions/runs/3149623842/jobs/5121449357#step:5:5805), it looks like `hs2:/timestamp_to_event` found `$NP6-oU7mIFVyhtKfGvfrEQX949hQX-T-gvuauG6eurU` as an `outlier` event locally. Then when we went and asked for it via `/context`, since it's an `outlier`, it was filtered out of the results -> `You don't have permission to access that event.` This is reproducible when `sync_partial_state` races and persists `$NP6-oU7mIFVyhtKfGvfrEQX949hQX-T-gvuauG6eurU` as an `outlier` before we evaluate `get_event_for_timestamp(...)`. To consistently reproduce locally, just add a delay at the [start of `get_event_for_timestamp(...)`](`cb20b885cb/synapse/handlers/room.py (L1470-L1496)`) so it always runs after `sync_partial_state` completes. ```py from twisted.internet import task as twisted_task d = twisted_task.deferLater(self.hs.get_reactor(), 3.5) await d ``` In a run where it passes, on `hs2`, `get_event_for_timestamp(...)` finds a different event locally which is next to a gap and we request from a closer one from `hs1` which gets backfilled. And since the backfilled event is not an `outlier`, it's returned as expected during `/context`. With this PR, Synapse will never return an `outlier` event so that test will always go and ask over federation.	2022-10-18 19:46:25 -05:00
Aaron Raimist	2a76a7369f	Fix hiding devices names over federation (#10015 ) And don't include blank opentracing stuff in device list updates. Signed-off-by: Aaron Raimist <aaron@raim.ist>	2022-10-18 20:54:27 +00:00
Patrick Cloke	dbf18f514e	Update the thread_id right before use (in case the bg update hasn't finished) (#14222 ) This avoids running a forced-update of a null thread_id rows. An index is added (in the background) to hopefully make this easier in the future.	2022-10-18 14:55:41 +00:00
David Robertson	c3a4780080	When restarting a partial join resync, prioritise the server which actioned a partial join (#14126 )	2022-10-18 12:33:18 +01:00
Andrew Morgan	dc02d9f8c5	Avoid checking the event cache when backfilling events (#14164 )	2022-10-18 10:33:35 +01:00
Andrew Morgan	828b5502cf	Remove `_get_events_cache` check optimisation from `_have_seen_events_dict` (#14161 )	2022-10-18 10:33:21 +01:00
Patrick Cloke	4283bd1cf9	Support filtering the /messages API by relation type (MSC3874). (#14148 ) Gated behind an experimental configuration flag.	2022-10-17 11:32:11 -04:00
Nick Mills-Barrett	2c2c3f8b2c	Invalidate rooms for user caches when receiving membership events (#14155 ) This should fix a race where the event notification comes in over replication before the state replication, leaving a window during which a sync may get an incorrect list of rooms for the user.	2022-10-17 13:27:51 +01:00
Eric Eastwood	40bb37eb27	Stop getting missing `prev_events` after we already know their signature is invalid (#13816 ) While https://github.com/matrix-org/synapse/pull/13635 stops us from doing the slow thing after we've already done it once, this PR stops us from doing one of the slow things in the first place. Related to - https://github.com/matrix-org/synapse/issues/13622 - https://github.com/matrix-org/synapse/pull/13635 - https://github.com/matrix-org/synapse/issues/13676 Part of https://github.com/matrix-org/synapse/issues/13356 Follow-up to https://github.com/matrix-org/synapse/pull/13815 which tracks event signature failures. With this PR, we avoid the call to the costly `_get_state_ids_after_missing_prev_event` because the signature failure will count as an attempt before and we filter events based on the backoff before calling `_get_state_ids_after_missing_prev_event` now. For example, this will save us 156s out of the 185s total that this `matrix.org` `/messages` request. If you want to see the full Jaeger trace of this, you can drag and drop this `trace.json` into your own Jaeger, https://gist.github.com/MadLittleMods/4b12d0d0afe88c2f65ffcc907306b761 To explain this exact scenario around `/messages` -> backfill, we call `/backfill` and first check the signatures of the 100 events. We see bad signature for `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` and `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` (both member events). Then we process the 98 events remaining that have valid signatures but one of the events references `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` as a `prev_event`. So we have to do the whole `_get_state_ids_after_missing_prev_event` rigmarole which pulls in those same events which fail again because the signatures are still invalid. - `backfill` - `outgoing-federation-request` `/backfill` - `_check_sigs_and_hash_and_fetch` - `_check_sigs_and_hash_and_fetch_one` for each event received over backfill - ❗ `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` fails with `Signature on retrieved event was invalid.`: `unable to verify signature for sender domain xxx: 401: Failed to find any key to satisfy: _FetchKeyRequest(...)` - ❗ `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` fails with `Signature on retrieved event was invalid.`: `unable to verify signature for sender domain xxx: 401: Failed to find any key to satisfy: _FetchKeyRequest(...)` - `_process_pulled_events` - `_process_pulled_event` for each validated event - ❗ Event `$Q0iMdqtz3IJYfZQU2Xk2WjB5NDF8Gg8cFSYYyKQgKJ0` references `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` as a `prev_event` which is missing so we try to get it - `_get_state_ids_after_missing_prev_event` - `outgoing-federation-request` `/state_ids` - ❗ `get_pdu` for `$luA4l7QHhf_jadH3mI-AyFqho0U2Q-IXXUbGSMq6h6M` which fails the signature check again - ❗ `get_pdu` for `$zuOn2Rd2vsC7SUia3Hp3r6JSkSFKcc5j3QTTqW_0jDw` which fails the signature check	2022-10-15 00:36:49 -05:00
Patrick Cloke	bc2bd92b93	Merge remote-tracking branch 'origin/release-v1.69' into develop	2022-10-14 14:11:27 -04:00
Patrick Cloke	d1bdeccb50	Accept threaded receipts for events related to the root event. (#14174 ) The root node of a thread (and events related to it) are considered "part of a thread" when validating receipts. This allows clients which show the root node in both the main timeline and the threaded timeline to easily send receipts in either. Note that threaded notifications are not created for these events, these events created notifications on the main timeline.	2022-10-14 18:05:25 +00:00

1 2 3 4 5 ...

4734 Commits (d56f48038a07fd76d2ce08220a4061f85006bf3b)