MatrixSynapse

Commit Graph

Author	SHA1	Message	Date
Jonathan de Jong	95e47b2e78	[pyupgrade] `synapse/` (#10348 ) This PR is tantamount to running ``` pyupgrade --py36-plus --keep-percent-format `find synapse/ -type f -name "*.py"` ``` Part of #9744	2021-07-19 15:28:05 +01:00
Jonathan de Jong	bf72d10dbf	Use inline type hints in various other places (in `synapse/`) (#10380 )	2021-07-15 11:02:43 +01:00
Erik Johnston	85d237eba7	Add a distributed lock (#10269 ) This adds a simple best effort locking mechanism that works cross workers.	2021-06-29 19:15:47 +01:00
Richard van der Hoff	107c06081f	Ensure that errors during startup are written to the logs and the console. (#10191 ) * Defer stdio redirection until we are about to start the reactor * Catch and handle exceptions during startup	2021-06-21 11:41:25 +01:00
Brendan Abolivier	1b3e398bea	Standardise the module interface (#10062 ) This PR adds a common configuration section for all modules (see docs). These modules are then loaded at startup by the homeserver. Modules register their hooks and web resources using the new `register_[...]_callbacks` and `register_web_resource` methods of the module API.	2021-06-18 12:15:52 +01:00
Erik Johnston	5eed6348ce	Move some more endpoints off master (#10084 )	2021-05-27 22:45:43 +01:00
Erik Johnston	3e831f24ff	Don't hammer the database for destination retry timings every ~5mins (#10036 )	2021-05-21 17:57:08 +01:00
Erik Johnston	ef889c98a6	Optionally track memory usage of each LruCache (#9881 ) This will double count slightly in the presence of interned strings. It's off by default as it can consume a lot of resources.	2021-05-05 16:54:36 +01:00
Erik Johnston	1fb9a2d0bf	Limit how often GC happens by time. (#9902 ) Synapse can be quite memory intensive, and unless care is taken to tune the GC thresholds it can end up thrashing, causing noticable performance problems for large servers. We fix this by limiting how often we GC a given generation, regardless of current counts/thresholds. This does not help with the reverse problem where the thresholds are set too high, but that should only happen in situations where they've been manually configured. Adds a `gc_min_seconds_between` config option to override the defaults. Fixes #9890.	2021-05-05 16:53:45 +01:00
Richard van der Hoff	3ff2251754	Improved validation for received requests (#9817 ) * Simplify `start_listening` callpath * Correctly check the size of uploaded files	2021-04-23 19:20:44 +01:00
Richard van der Hoff	59d24c5bef	pass a reactor into SynapseSite (#9874 )	2021-04-23 17:06:47 +01:00
Erik Johnston	9d25a0ae65	Split presence out of master (#9820 )	2021-04-23 12:21:55 +01:00
Richard van der Hoff	5a153772c1	remove `HomeServer.get_config` (#9815 ) Every single time I want to access the config object, I have to remember whether or not we use `get_config`. Let's just get rid of it.	2021-04-14 19:09:08 +01:00
Erik Johnston	00a6db9676	Move some replication processing out of generic_worker (#9796 ) Co-authored-by: Richard van der Hoff <1389908+richvdh@users.noreply.github.com>	2021-04-14 17:06:06 +01:00
Jonathan de Jong	4b965c862d	Remove redundant "coding: utf-8" lines (#9786 ) Part of #9744 Removes all redundant `# -- coding: utf-8 --` lines from files, as python 3 automatically reads source code as utf-8 now. `Signed-off-by: Jonathan de Jong <jonathan@automatia.nl>`	2021-04-14 15:34:27 +01:00
Andrew Morgan	04819239ba	Add a Synapse Module for configuring presence update routing (#9491 ) At the moment, if you'd like to share presence between local or remote users, those users must be sharing a room together. This isn't always the most convenient or useful situation though. This PR adds a module to Synapse that will allow deployments to set up extra logic on where presence updates should be routed. The module must implement two methods, `get_users_for_states` and `get_interested_users`. These methods are given presence updates or user IDs and must return information that Synapse will use to grant passing presence updates around. A method is additionally added to `ModuleApi` which allows triggering a set of users to receive the current, online presence information for all users they are considered interested in. This is the equivalent of that user receiving presence information during an initial sync. The goal of this module is to be fairly generic and useful for a variety of applications, with hard requirements being: * Sending state for a specific set or all known users to a defined set of local and remote users. * The ability to trigger an initial sync for specific users, so they receive all current state.	2021-04-06 14:38:30 +01:00
Patrick Cloke	da75d2ea1f	Add type hints for the federation sender. (#9681 ) Includes an abstract base class which both the FederationSender and the FederationRemoteSendQueue must implement.	2021-03-29 11:43:20 -04:00
Richard van der Hoff	7c8402ddb8	Suppress CryptographyDeprecationWarning (#9698 ) This warning is somewhat confusing to users, so let's suppress it	2021-03-26 17:33:55 +00:00
Brendan Abolivier	0b56481caa	Fix lint	2021-03-19 16:11:08 +01:00
Brendan Abolivier	066c703729	Move support for MSC3026 behind an experimental flag	2021-03-18 18:37:19 +01:00
Brendan Abolivier	405aeb0b2c	Implement MSC3026: busy presence state	2021-03-18 16:34:47 +01:00
Richard van der Hoff	4db07f9aef	Set X-Forwarded-Proto header when frontend-proxy proxies a request (#9539 ) Should fix some remaining warnings	2021-03-03 18:49:08 +00:00
Erik Johnston	2927921942	Clean up `ShardedWorkerHandlingConfig` (#9466 ) * Split ShardedWorkerHandlingConfig This is so that we have a type level understanding of when it is safe to call `get_instance(..)` (as opposed to `should_handle(..)`). * Remove special cases in ShardedWorkerHandlingConfig. `ShardedWorkerHandlingConfig` tried to handle the various different ways it was possible to configure federation senders and pushers. This led to special cases that weren't hit during testing. To fix this the handling of the different cases is moved from there and `generic_worker` into the worker config class. This allows us to have the logic in one place and allows the rest of the code to ignore the different cases.	2021-02-24 13:23:18 +00:00
Erik Johnston	66f4949e7f	Fix deleting pushers when using sharded pushers. (#9465 )	2021-02-22 21:14:42 +00:00
Eric Eastwood	0a00b7ff14	Update black, and run auto formatting over the codebase (#9381 ) - Update black version to the latest - Run black auto formatting over the codebase - Run autoformatting according to [`docs/code_style.md `](`80d6dc9783/docs/code_style.md`) - Update `code_style.md` docs around installing black to use the correct version	2021-02-16 22:32:34 +00:00
Richard van der Hoff	9c715a5f19	Fix SSO on workers (#9271 ) Fixes #8966. * Factor out build_synapse_client_resource_tree Start a function which will mount resources common to all workers. * Move sso init into build_synapse_client_resource_tree ... so that we don't have to do it for each worker * Fix SSO-login-via-a-worker Expose the SSO login endpoints on workers, like the documentation says. * Update workers config for new endpoints Add documentation for endpoints recently added (#8942, #9017, #9262) * remove submit_token from workers endpoints list this doesn't work on workers (yet). * changelog * Add a comment about the odd path for SAML2Resource	2021-02-01 15:47:59 +00:00
Erik Johnston	6633a4015a	Allow moving account data and receipts streams off master (#9104 )	2021-01-18 15:47:59 +00:00
Patrick Cloke	d1eb1b96e8	Register the /devices endpoint on workers. (#9092 )	2021-01-13 12:35:40 -05:00
Erik Johnston	c9195744a4	Move more encryption endpoints off master (#9068 )	2021-01-11 18:01:27 +00:00
Richard van der Hoff	671138f658	Clean up exception handling in the startup code (#9059 ) Factor out the exception handling in the startup code to a utility function, and fix the some logging and exit code stuff.	2021-01-11 15:55:05 +00:00
Erik Johnston	b530eaa262	Allow running sendToDevice on workers (#9044 )	2021-01-07 20:19:26 +00:00
Patrick Cloke	68bb26da69	Allow redacting events on workers (#8994 ) Adds the redacts endpoint to workers that have the client listener.	2020-12-29 07:40:12 -05:00
Patrick Cloke	30fba62108	Apply an IP range blacklist to push and key revocation requests. (#8821 ) Replaces the `federation_ip_range_blacklist` configuration setting with an `ip_range_blacklist` setting with wider scope. It now applies to: * Federation * Identity servers * Push notifications * Checking key validitity for third-party invite events The old `federation_ip_range_blacklist` setting is still honored if present, but with reduced scope (it only applies to federation and identity servers).	2020-12-02 11:09:24 -05:00
Erik Johnston	921a3f8a59	Fix not sending events over federation when using sharded event persisters (#8536 ) * Fix outbound federaion with multiple event persisters. We incorrectly notified federation senders that the minimum persisted stream position had advanced when we got an `RDATA` from an event persister. Notifying of federation senders already correctly happens in the notifier, so we just delete the offending line. * Change some interfaces to use RoomStreamToken. By enforcing use of `RoomStreamTokens` we make it less likely that people pass in random ints that they got from somewhere random.	2020-10-14 13:27:51 +01:00
Patrick Cloke	e4f72ddc44	Move additional tasks to the background worker (#8458 )	2020-10-07 11:27:56 -04:00
Patrick Cloke	62894673e6	Allow background tasks to be run on a separate worker. (#8369 )	2020-10-02 08:23:15 -04:00
Patrick Cloke	8a4a4186de	Simplify super() calls to Python 3 syntax. (#8344 ) This converts calls like super(Foo, self) -> super(). Generated with: sed -i "" -Ee 's/super\([^\(]+\)/super()/g' */.py	2020-09-18 09:56:44 -04:00
Patrick Cloke	c619253db8	Stop sub-classing object (#8249 )	2020-09-04 06:54:56 -04:00
Erik Johnston	0f1afbe8dc	Change HomeServer definition to work with typing. Duplicating function signatures between server.py and server.pyi is silly. This commit changes that by changing all `build_` methods to `get_` methods and changing the `_make_dependency_method` to work work as a descriptor that caches the produced value. There are some changes in other files that were made to fix the typing in server.py.	2020-08-11 18:00:17 +01:00
Erik Johnston	7620912d84	Add health check endpoint (#8048 )	2020-08-07 14:21:24 +01:00
Erik Johnston	a7bdf98d01	Rename database classes to make some sense (#8033 )	2020-08-05 21:38:57 +01:00
Olivier Wilkinson (reivilibre)	3aa36b782c	Merge branch 'master' into develop	2020-07-30 15:18:36 +01:00
Patrick Cloke	3950ae51ef	Ensure that remove_pusher is always async (#7981 )	2020-07-30 06:56:55 -04:00
Erik Johnston	2c1b9d6763	Update worker docs with recent enhancements (#7969 )	2020-07-29 23:22:13 +01:00
Erik Johnston	84d099ae11	Fix typing replication not being handled on master (#7959 ) Handling of incoming typing stream updates from replication was not hooked up on master, effecting set ups where typing was handled on a different worker. This is really only a problem if the master process is also handling sync requests, which is unlikely for those that are at the stage of moving typing off. The other observable effect is that if a worker restarts or a replication connect drops then the typing worker will issue a `POSITION typing`, triggering master process to try and stream all typing updates from position 0. Fixes #7907	2020-07-27 14:10:53 +01:00
Patrick Cloke	00e57b755c	Convert synapse.app to async/await. (#7868 )	2020-07-17 07:08:56 -04:00
Erik Johnston	f2e38ca867	Allow moving typing off master (#7869 )	2020-07-16 15:12:54 +01:00
Erik Johnston	f299441cc6	Add ability to shard the federation sender (#7798 )	2020-07-10 18:26:36 +01:00
Patrick Cloke	8fa7fdd4cb	Pass original request headers from workers to the main process. (#7797 )	2020-07-09 07:34:46 -04:00
Richard van der Hoff	03619324fc	Create a ListenerConfig object (#7681 ) This ended up being a bit more invasive than I'd hoped for (not helped by generic_worker duplicating some of the code from homeserver), but hopefully it's an improvement. The idea is that, rather than storing unstructured `dict`s in the config for the listener configurations, we instead parse it into a structured `ListenerConfig` object.	2020-06-16 12:44:07 +01:00
Patrick Cloke	7d2532be36	Discard RDATA from already seen positions. (#7648 )	2020-06-15 08:44:54 -04:00
Erik Johnston	ef3934ec8f	Ensure we persist and ack the same token	2020-05-27 19:45:42 +01:00
Erik Johnston	35c308731d	Speed up processing of federation stream RDATA rows. Instead of storing and sending an ACK for every single row we send synchronously, we instead do it asynchronously while batching up updates.	2020-05-27 19:34:07 +01:00
Richard van der Hoff	04729b86f8	Fix incorrect exception handling in KeyUploadServlet.on_POST (#7563 ) Introduced in #7556	2020-05-26 11:42:22 +01:00
Richard van der Hoff	00db90f409	Fix recording of federation stream token (#7564 ) A couple of changes of significance: * remove the `_last_ack < federation_position` condition, so that updates will still be correctly processed after restart * Correctly wire up send_federation_ack to the right class.	2020-05-26 11:41:38 +01:00
Erik Johnston	e5c67d04db	Add option to move event persistence off master (#7517 )	2020-05-22 16:11:35 +01:00
Patrick Cloke	4429764c9f	Return 200 OK for all OPTIONS requests (#7534 )	2020-05-22 09:30:07 -04:00
Erik Johnston	547e4dd83e	Fix exception reporting due to HTTP request errors. (#7556 ) These are business as usual errors, rather than stuff we want to log at error.	2020-05-22 11:39:20 +01:00
Richard van der Hoff	0bbbd10513	Stub out GET presence requests in the frontend proxy (#7545 ) We don't really make any promises about returning accurate presence data when presence is disabled, so we may as well just return a static response, rather than making the master handle a request.	2020-05-21 14:36:46 +01:00
Erik Johnston	51055c8c44	Allow ReplicationRestResource to be added to workers (#7515 ) This allows workers to talk to each other over HTTP replication.	2020-05-18 12:24:48 +01:00
Erik Johnston	03aff4c75e	Add a worker store for search insertion. (#7516 ) This is required as both event persistence and the background update needs access to this function. It should be perfectly safe for two workers to write to that table at the same time.	2020-05-15 17:22:47 +01:00
Erik Johnston	4734a7bbe4	Move EventStream handling into default ReplicationDataHandler (#7493 ) This is so that the logic can happen on both master and workers when we move event persistence out.	2020-05-14 14:01:39 +01:00
Erik Johnston	1124111a12	Allow censoring of events to happen on workers. (#7492 ) This is safe as we can now write to cache invalidation stream on workers, and is required for when we move event persistence off master.	2020-05-13 17:15:40 +01:00
Erik Johnston	0e719f2398	Thread through instance name to replication client. (#7369 ) For in memory streams when fetching updates on workers we need to query the source of the stream, which currently is hard coded to be master. This PR threads through the source instance we received via `POSITION` through to the update function in each stream, which can then be passed to the replication client for in memory streams.	2020-05-01 17:19:56 +01:00
Erik Johnston	3085cde577	Use `stream.current_token()` and remove `stream_positions()` (#7172 ) We move the processing of typing and federation replication traffic into their handlers so that `Stream.current_token()` points to a valid token. This allows us to remove `get_streams_to_replicate()` and `stream_positions()`.	2020-05-01 15:21:35 +01:00
Patrick Cloke	627b0f5f27	Persist user interactive authentication sessions (#7302 ) By persisting the user interactive authentication sessions to the database, this fixes situations where a user hits different works throughout their auth session and also allows sessions to persist through restarts of Synapse.	2020-04-30 13:47:49 -04:00
Erik Johnston	38919b521e	Run replication streamers on workers (#7146 ) Currently we never write to streams from workers, but that will change soon	2020-04-28 13:34:12 +01:00
Richard van der Hoff	71a1abb8a1	Stop the master relaying USER_SYNC for other workers (#7318 ) Long story short: if we're handling presence on the current worker, we shouldn't be sending USER_SYNC commands over replication. In an attempt to figure out what is going on here, I ended up refactoring some bits of the presencehandler code, so the first 4 commits here are non-functional refactors to move this code slightly closer to sanity. (There's still plenty to do here :/). Suggest reviewing individual commits. Fixes (I hope) #7257.	2020-04-22 22:39:04 +01:00
Richard van der Hoff	2aa5bf13c8	Merge branch 'release-v1.12.4' into develop	2020-04-22 13:09:23 +01:00
Richard van der Hoff	974c0d726a	Support GET account_data requests on a worker (#7311 )	2020-04-21 10:46:30 +01:00
Erik Johnston	5016b162fc	Move client command handling out of TCP protocol (#7185 ) The aim here is to move the command handling out of the TCP protocol classes and to also merge the client and server command handling (so that we can reuse them for redis protocol). This PR simply moves the client paths to the new `ReplicationCommandHandler`, a future PR will move the server paths too.	2020-04-06 09:58:42 +01:00
Richard van der Hoff	bae32740da	Remove some `run_in_background` calls in replication code (#7203 ) By running this stuff with `run_in_background`, it won't be correctly reported against the relevant CPU usage stats. Fixes #7202	2020-04-03 12:29:30 +01:00
Erik Johnston	db098ec994	Fix starting workers when federation sending not split out.	2020-03-31 11:25:21 +01:00
Erik Johnston	4f21c33be3	Remove usage of "conn_id" for presence. (#7128 ) * Remove `conn_id` usage for UserSyncCommand. Each tcp replication connection is assigned a "conn_id", which is used to give an ID to a remotely connected worker. In a redis world, there will no longer be a one to one mapping between connection and instance, so instead we need to replace such usages with an ID generated by the remote instances and included in the replicaiton commands. This really only effects UserSyncCommand. * Add CLEAR_USER_SYNCS command that is sent on shutdown. This should help with the case where a synchrotron gets restarted gracefully, rather than rely on 5 minute timeout.	2020-03-30 16:37:24 +01:00
Erik Johnston	4cff617df1	Move catchup of replication streams to worker. (#7024 ) This changes the replication protocol so that the server does not send down `RDATA` for rows that happened before the client connected. Instead, the server will send a `POSITION` and clients then query the database (or master out of band) to get up to date.	2020-03-25 14:54:01 +00:00
Erik Johnston	b1cfaf08af	Merge pull request #7133 from matrix-org/erikj/fix_worker_startup Fix starting workers when federation sending not split out.	2020-03-25 09:42:39 +00:00
Erik Johnston	c816072d47	Fix starting workers when federation sending not split out.	2020-03-24 10:35:00 +00:00
Richard van der Hoff	a564b92d37	Convert `*StreamRow` classes to inner classes (#7116 ) This just helps keep the rows closer to their streams, so that it's easier to see what the format of each stream is.	2020-03-23 13:59:11 +00:00
Richard van der Hoff	b3cee0ce67	Fix processing of `groups` stream, and use symbolic names for streams (#7117 ) `groups` != `receipts` Introduced in #6964	2020-03-23 11:39:36 +00:00
Erik Johnston	6e6476ef07	Comments from review	2020-03-18 10:13:55 +00:00
Erik Johnston	e53744c737	Fix worker handling	2020-03-02 12:52:28 +00:00
Erik Johnston	9ce4e344a8	Change device list replication to match new semantics. Instead of sending down batches of user ID/host tuples, send down a row per entity (user ID or host).	2020-02-28 11:25:34 +00:00
Erik Johnston	2201bc9795	Don't refuse to start worker if media listener configured. (#7002 ) Instead lets just warn if the worker has a media listener configured but has the media repository disabled. Previously non media repository workers would just ignore the media listener.	2020-02-27 16:33:21 +00:00
Erik Johnston	bbf8886a05	Merge worker apps into one. (#6964 )	2020-02-25 16:56:55 +00:00

1 2 3

134 Commits (99b7b801c31b9428f9503ad6f83f11804fef048a)