tractor

ntorres

tractor

Author	SHA1	Message	Date
Tyler Goodlet	b5a27e7864	Ignore drainer-task nursery RTE during context exit	2023-04-14 16:35:25 -04:00
Tyler Goodlet	a7faa26686	Set `Context._scope_nursery` on callee side too Because obviously we probably want to support `allow_overruns` on the remote callee side as well XD Only found the bugs fixed in this patch this thanks to writing a much more exhaustive test set for overrun cases B)	2023-04-14 16:35:25 -04:00
Tyler Goodlet	1183276653	Seriously cover all overrun cases This actually caught further runtime bugs so it's gud i tried.. Add overrun-ignore enabled / disabled cases and error catching for all of them. More or less this should cover every possible outcome when it comes to setting `allow_overruns: bool` i hope XD	2023-04-14 16:35:25 -04:00
Tyler Goodlet	45f601a035	Flip allocate log msgs to debug	2023-04-14 16:35:25 -04:00
Tyler Goodlet	e97ed377b0	Remote `Context` cancellation semantics rework B) This adds remote cancellation semantics to our `tractor.Context` machinery to more closely match that of `trio.CancelScope` but with operational differences to handle the nature of parallel tasks interoperating across multiple memory boundaries: - if an actor task cancels some context it has opened via `Context.cancel()`, the remote (scope linked) task will be cancelled using the normal `CancelScope` semantics of `trio` meaning the remote cancel scope surrounding the far side task is cancelled and `trio.Cancelled`s are expected to be raised in that scope as per normal `trio` operation, and in the case where no error is raised in that remote scope, a `ContextCancelled` error is raised inside the runtime machinery and relayed back to the opener/caller side of the context. - if any actor task cancels a full remote actor runtime using `Portal.cancel_actor()` the same semantics as above apply except every other remote actor task which also has an open context with the actor which was cancelled will also be sent a `ContextCancelled` but with the `.canceller` field set to the uid of the original cancel requesting actor. This changeset also includes a more "proper" solution to the issue of "allowing overruns" during streaming without attempting to implement any form of IPC streaming backpressure. Implementing task-granularity backpressure cross-process turns out to be more or less impossible without augmenting out streaming protocol (likely at the cost of performance). Further allowing overruns requires special care since any blocking of the runtime RPC msg loop task effectively can block control msgs such as cancels and stream terminations. The implementation details per abstraction layer are as follows. ._streaming.Context: - add a new contructor factor func `mk_context()` which provides a strictly private init-er whilst allowing us to not have to define an `.__init__()` on the type def. - add public `.cancel_called` and `.cancel_called_remote` properties. - general rename of what was the internal `._backpressure` var to `._allow_overruns: bool`. - move the old contents of `Actor._push_result()` into a new `._deliver_msg()` allowing for better encapsulation of per-ctx msg handling. - always check for received 'error' msgs and process them with the new `_maybe_cancel_and_set_remote_error()` before any msg delivery to the local task, thus guaranteeing error and cancellation handling despite any overflow handling. - add a new `._drain_overflows()` task-method for use with new `._allow_overruns: bool = True` mode. - add back a `._scope_nursery: trio.Nursery` (allocated in `Portal.open_context()`) who's sole purpose is to spawn a single task which runs the above method; anything else is an error. - augment `._deliver_msg()` to start a task and run the above method when operating in no overrun mode; the task queues overflow msgs and attempts to send them to the underlying mem chan using a blocking `.send()` call. - on context exit, any existing "drainer task" will be cancelled and remaining overflow queued msgs are discarded with a warning. - rename `._error` -> `_remote_error` and set it in a new method `_maybe_cancel_and_set_remote_error()` which is called before processing - adjust `.result()` to always call `._maybe_raise_remote_err()` at its start such that whenever a `ContextCancelled` arrives we do logic for whether or not to immediately raise that error or ignore it due to the current actor being the one who requested the cancel, by checking the error's `.canceller` field. - set the default value of `._result` to be `id(Context()` thus avoiding conflict with any `.result()` actually being `False`.. ._runtime.Actor: - augment `.cancel()` and `._cancel_task()` and `.cancel_rpc_tasks()` to take a `requesting_uid: tuple` indicating the source actor of every cancellation request. - pass through the new `Context._allow_overruns` through `.get_context()` - call the new `Context._deliver_msg()` from `._push_result()` (since the factoring out that method's contents). ._runtime._invoke: - `TastStatus.started()` back a `Context` (unless an error is raised) instead of the cancel scope to make it easy to set/get state on that context for the purposes of cancellation and remote error relay. - always raise any remote error via `Context._maybe_raise_remote_err()` before doing any `ContextCancelled` logic. - assign any `Context._cancel_called_remote` set by the `requesting_uid` cancel methods (mentioned above) to the `ContextCancelled.canceller`. ._runtime.process_messages: - always pass a `requesting_uid: tuple` to `Actor.cancel()` and `._cancel_task` to that any corresponding `ContextCancelled.canceller` can be set inside `._invoke()`.	2023-04-14 16:35:25 -04:00
Tyler Goodlet	1ec30577de	Only tuplize `.canceller` if non-`None`	2023-04-14 16:35:25 -04:00
Tyler Goodlet	0e81350a42	Move `NoRuntime` import inside `current_actor()` to avoid cycle	2023-04-14 16:35:25 -04:00
Tyler Goodlet	e16e7ca82a	Add new remote error introspection attrs To handle both remote cancellation this adds `ContextCanceled.canceller: tuple` the uid of the cancel requesting actor and is expected to be set by the runtime when servicing any remote cancel request. This makes it possible for `ContextCancelled` receivers to know whether "their actor runtime" is the source of the cancellation. Also add an explicit `RemoteActor.src_actor_uid` which better formalizes the notion of "which remote actor" the error originated from. Both of these new attrs are expected to be packed in the `.msgdata` when the errors are loaded locally.	2023-04-14 16:35:25 -04:00
Tyler Goodlet	8913829511	Log waiter task cancelling msg as cancel-level	2023-04-14 16:35:25 -04:00
Tyler Goodlet	60bff71cd3	Assign `RemoteActorError` boxed error type for context cancelleds	2023-04-14 16:35:25 -04:00
Tyler Goodlet	29a1171142	Change a bunch of log levels to cancel, including any `ContextCancelled` handling	2023-04-14 16:35:25 -04:00
Tyler Goodlet	b6c7f423f0	Add some log-level method doc-strings	2023-04-14 16:35:25 -04:00
Tyler Goodlet	4db87d3c43	Tweak context doc str	2023-04-14 16:35:25 -04:00
Tyler Goodlet	5f1e83e741	More single doc-strs in discovery mod	2023-04-14 16:35:25 -04:00
Tyler Goodlet	ea2cc9ec75	Enable `Context` backpressure by default; avoid startup race-crashes?	2023-04-14 16:35:25 -04:00
Tyler Goodlet	8637778739	Expose `raise_on_lag: bool` flag through factory	2023-01-30 12:18:23 -05:00
Tyler Goodlet	47166e45f0	Be explicit with passthrough kwargs (there's so few)	2023-01-29 17:31:21 -05:00
Tyler Goodlet	4ce2dcd12b	Switch back to raising `Lagged` by default Makes the broadcast test suite not hang xD, and is our expected default behaviour. Also removes a ton of commented legacy cruft from before the refactor to remove the `.receive()` recursion and fixes some typing. Oh right, and in the case where there's only one subscriber left we warn log about it since in theory we could actually entirely unwind the bcaster back to the original underlying, though not sure if that's sane or works for some use cases (like wanting to have some other subscriber get added dynamically later).	2023-01-29 15:03:34 -05:00
Tyler Goodlet	80f983818f	Ignore monkey patched `.send()` type annot	2023-01-29 15:03:34 -05:00
Tyler Goodlet	6ba29f8d56	Recurse and get the last value when in warn mode	2023-01-29 15:03:34 -05:00
Tyler Goodlet	2707a0e971	Add `._raise_on_lag` flag to disable `Lag` raising	2023-01-29 15:03:34 -05:00
Tyler Goodlet	9f9907271b	Merge `ReceiveMsgStream` and `MsgStream` Since one-way streaming can be accomplished by just not sending on one side (and/or thus wrapping such usage in a more restrictive API), we just drop the recv-only parent type. The only method different was `MsgStream.send()`, now merged in. Further in usage of `.subscribe()` we monkey patch the underlying stream's `.send()` onto the delivered broadcast receiver so that subscriber tasks can two-way stream as though using the stream directly. This allows us to more definitively drop `tractor.open_stream_from()` in the longer run if we so choose as well; note currently this will potentially create an issue if a caller tries to `.send()` on such a one way stream.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	c2367c1c5e	Better `trio`-ize `BroadcastReceiver` internals Driven by a bug found in `piker` where we'd get an inf recursion error due to `BroadcastReceiver.receive()` being called when consumer tasks are awoken but no value is ready to `.nowait_receive()`. This new rework takes an approach closer to the interface and internals of `trio.MemoryReceiveChannel` particularly in terms of, - implementing a `BroadcastReceiver.receive_nowait()` and using it within the async `.receive()`. - failing over to an internal `._receive_from_underlying()` when the `_nowait()` call raises `trio.WouldBlock`. - adding `BroadcastState.statistics()` for debugging and testing dropping recursion from `.receive()`.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	13c9eadc8f	Move result log msg up and drop else block	2023-01-29 14:55:02 -05:00
Tyler Goodlet	aa4871b13d	Call `MsgStream.aclose()` in `Context.open_stream.__aexit__()` We weren't doing this originally I think just because of the path dependent nature of the way the code was developed (originally being mega pedantic about one-way vs. bidirectional streams) but, it doesn't seem like there's any issue just calling the stream's `.aclose()`; also have the benefit of just being less code and logic checks B)	2023-01-29 14:55:02 -05:00
Tyler Goodlet	556f4626db	Tweak warning msg for still-alive-after-cancelled actor	2023-01-29 14:55:02 -05:00
Tyler Goodlet	df01294bb2	Show more functiony syntax in ctx-cancelled log msgs	2023-01-29 14:55:02 -05:00
Tyler Goodlet	ddf3d0d1b3	Show tracebacks for un-shipped/propagated errors	2023-01-29 14:55:02 -05:00
Tyler Goodlet	97d5f7233b	Fix uid2nursery lookup table type annot	2023-01-29 14:55:02 -05:00
Tyler Goodlet	d27c081a15	Ensure arbiter sockaddr type before usage	2023-01-29 14:55:02 -05:00
Tyler Goodlet	a4874a3227	Always set the `parent_exit: trio.Event` on exit	2023-01-29 14:55:02 -05:00
Tyler Goodlet	de04bbb2bb	Don't raise on a broken IPC-context when sending stop msg	2023-01-29 14:55:02 -05:00
Tyler Goodlet	4f977189c0	Handle broken mem chan on `Actor._push_result()` When backpressure is used and a feeder mem chan breaks during msg delivery (usually because the IPC allocating task already terminated) instead of raising we simply warn as we do for the non-backpressure case. Also, add a proper `Actor.is_arbiter` test inside `._invoke()` to avoid doing an arbiter-registry lookup if the current actor is the registrar.	2023-01-29 14:55:02 -05:00
Tyler Goodlet	121a8cc891	Drop `Optional` usage from root mod	2023-01-26 16:00:08 -05:00
Tyler Goodlet	c54b8ca4ba	Begin deprecation of `arbiter_addr` -> `registry_addr`	2023-01-26 16:00:08 -05:00
Tyler Goodlet	5b8a87d0f6	Slightly better `xonsh` check hack, fix typing	2023-01-26 15:48:15 -05:00
Tyler Goodlet	2e278ceb74	Add a super hacky check for `xonsh`, smh..	2023-01-26 15:26:43 -05:00
Tyler Goodlet	dba8118553	Always attempt prompt redraw on ctl-c in REPL The stdlib has all sorts of muckery with ignoring SIGINT in the `Pdb._cmdloop()` but here we just override all that since we don't trust their decisions about cancellation handling whatsoever. Adds a `Lock.repl: MultiActorPdb` attr which is set by any task which acquires root TTY lock indicating (via actor global state) that the current actor is using the debugger REPL and can be expected to re-draw the prompt on SIGINT. Further we mask out log messages from any actor who also has the `shield_sigint_handler()` enabled to avoid logging noise when debugging.	2023-01-26 12:44:13 -05:00
Tyler Goodlet	fca2e7c10e	Simplify closed abruptly log msg	2023-01-26 12:44:13 -05:00
Tyler Goodlet	5ed62c5c54	Add note about intermediary-actor in debug issue	2023-01-26 12:44:13 -05:00
Tyler Goodlet	6c8cacc9d1	Adjust all default is `None` annots (per new `mypy`)	2022-12-12 13:18:22 -05:00
Tyler Goodlet	38326e8c15	Avoid error on context double pops	2022-12-11 23:46:33 -05:00
Tyler Goodlet	b5192cca8e	Always greedily `list`-cast`mngrs` input sequence	2022-12-11 23:20:58 -05:00
Tyler Goodlet	c606be8c64	Passthrough runtime kwargs from `open_actor_cluster()`	2022-12-11 19:56:08 -05:00
Tyler Goodlet	f2641c8964	Avoid "task never called `.started()`" runtime erros when cancelling	2022-10-14 19:42:23 -04:00
Tyler Goodlet	f39414ce12	Drop error-repacking for `.run_in_actor()`s block If we pack the nursery parent task's error into the `errors` table directly in the handler, we don't need to specially handle packing that same error into any exception group raised while handling sub-actor cancellation; drops some ugly indentation ;)	2022-10-14 19:42:23 -04:00
Tyler Goodlet	e298b70edf	Drop added `.pdp()` level msgs used duringn dev	2022-10-14 19:42:23 -04:00
Tyler Goodlet	38f9d35dee	Fix errors table type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	88448f7281	Fix handler type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	0956d5f461	Restore the `trio` SIGINT handler, cancel root lock tasks on no-peers Pretty sure this is the final touch to alleviate all our debug lock headaches! Instead of trying to revert to the "last" handler (as `pdb` does internally in the stdlib) we always just revert to the handler `trio` registers during startup. Further this seems to allow cancelling the root-side locking task if it's detected as stale IFF we only do this when the root actor is in a "no more IPC peers" state. Deatz: - (always) set `._debug.Lock._trio_handler` as the `trio` version, not some last used handler to make sure we're getting the ctrl-c handling we want when not in debug mode. - assign the trio handler in `open_root_actor()` `._runtime._async_main()` to be sure it's applied in subactors as well as the root. - only do debug lock blocking and root-side-locking-task cancels when a "no peers" condition is detected in the root actor: i.e. no IPC channels are detected by the root meaning it's impossible any actor has a sane lock-state ongoing for debug mode.	2022-10-14 18:18:01 -04:00

1 2 3 4 5 ...

716 Commits (97446b84c04e8e4e95309e650059e04de4c367e3)