tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	1f8e1cccbb	Only pop contexts on decorated entrypoints	2021-12-06 13:48:19 -05:00
Tyler Goodlet	318027ebd1	Raise stream overruns on one side never opened A context stream overrun should normally never take place since if a stream is opened (via ``Context.open_stream()``) backpressure is applied on the message buffer (unless explicitly disabled by the ``backpressure=False`` flag) such that an overrun on the receiving task should result in blocking the (remote) sender task (eventually depending on the underlying ``MsgStream`` transport). Here we add a special error message that reports if one side never opened a stream and let's the user know in the overrun error message that they may be trying to push messages to a task that isn't ready to receive them. Further fixes / details: - pop any `Context` at the end of any `_invoke()` task that creates one and registers with the runtime. - ignore but warn about messages received for a context that either no longer exists or is unknown (guarding against crashes by malicious packets in the latter case)	2021-12-06 11:54:21 -05:00
Tyler Goodlet	b826ec8103	Better idea, enable backpressure on opened streams Keeping it disabled on context open will help with detecting any stream connection which was never opened on one side of the task pair. In that case we can report that there was an overrun and a stream wasn't opened versus if the stream is explicitly configured not to use bp then we throw the standard overflow. Use `trio.Nursery._closed` to detect "closure" XD since it seems to be the most reliable way to determine if a spawn call will trigger a runtime error.	2021-12-06 11:54:21 -05:00
Tyler Goodlet	4ea5c9b5db	Pop context on `.open_context()` exit	2021-12-06 11:54:21 -05:00
Tyler Goodlet	41a3e6a9ca	Type check fixes	2021-12-05 20:00:58 -05:00
Tyler Goodlet	185dbc7e3f	Disable msg stream backpressure by default Half of portal API usage requires a 1 message response (`.run()`, `.run_in_actor()`) and the streaming APIs should probably be explicitly enabled for backpressure if desired by the user. This makes more sense in (psuedo) realtime systems where it's better to notify on a block then freeze without notice. Make this default behaviour with a new error to be raised: `tractor._exceptions.StreamOverrun` when a sender overruns a stream by the default size (2**6 for now). The old behavior can be enabled with `Context.open_stream(backpressure=True)` but now with warning log messages when there are overruns. Add task-linked-context error propagation using a "nursery raising" technique such that if either end of context linked pair of tasks errors, that error can be relayed to other side and raised as a form of interrupt at the receiving task's next `trio` checkpoint. This enables reliable error relay without expecting the (error) receiving task to call an API which would raise the remote exception (which it might never currently if using `tractor.MsgStream` APIs). Further internal implementation details: - define the default msg buffer size as `Actor.msg_buffer_size` - expose a `msg_buffer_size: int` kwarg from `Actor.get_context()` - maybe raise aforementioned context errors using `Context._maybe_error_from_remote_msg()` inside `Actor._push_result()` - support optional backpressure on a stream when pushing messages in `Actor._push_result()` - in `_invote()` handle multierrors raised from a `@tractor.context` entrypoint as being potentially caused by a relayed error from the remote caller task, if `Context._error` has been set then raise that error inside the `RemoteActorError` that will be relayed back to that caller more or less proxying through the source side error back to its origin.	2021-12-05 19:31:41 -05:00
Tyler Goodlet	2680a9473d	Always set `Context._portal` on the caller task side	2021-12-05 19:28:00 -05:00
Tyler Goodlet	92b540d518	Add internal msg stream backpressure controls In preparation for supporting both backpressure detection (through an optional error) as well as control over the msg channel buffer size, add internal configuration flags for both to contexts. Also adjust `Context._err_on_from_remote_msg()` -> `._maybe..` such that it can be called and will only raise if a scope nursery has been set. Add a `Context._error` for stashing the remote task's error that may be delivered in an `'error'` message.	2021-12-05 19:19:53 -05:00
Tyler Goodlet	6751349987	Add a stream overrun exception	2021-12-05 18:28:02 -05:00
Tyler Goodlet	d307eab118	Rework `Actor.send_cmd()` to `.start_remote_task()` This more formally declares the runtime's remote task startingn API and uses it throughout all the dependent `Portal` API methods. Allows dropping `Portal._submit()` and simplifying `.run_in_actor()` style result waiting to be delegated to the context APIs at remote task `return` response time. We now also track the remote entrypoint "type` as `Context._remote_func_type`.	2021-12-04 18:20:43 -05:00
Tyler Goodlet	c5c3f7e789	Use `tractor.Context` throughout the runtime core Instead of tracking feeder mem chans per RPC dialog, store `Context` instances which (now) hold refs to the underlying RPC-task feeder chans and track them inside a `Actor._contexts` map. This begins a transition to making the "context" idea the primitive abstraction for representing messaging dialogs between tasks in different memory domains (i.e. usually separate processes). A slew of changes made this possible: - change `Actor.get_memchans()` -> `.get_context()`. - Add new `Context._send_chan` and `._recv_chan` vars. - implicitly create a new context on every `Actor.send_cmd()` call. - use the context created by `.send_cmd()` in `Portal.open_context()` instead of manually creating one. - call `Actor.get_context()` inside tasks run from `._invoke()` such that feeder chans are implicitly created for callee tasks thus fixing the bug #265. NB: We might change some of the internal semantics to do with when the feeder chans are actually created to denote whether or not a far end task is actually read to receive messages. For example, in the cases where it never will be ready to receive messages (one-way streaming, a context that never opens a stream, etc.) we will likely want some kind of error or at least warning to the caller that messages can't be sent (yet).	2021-12-03 14:49:55 -05:00
Tyler Goodlet	f4793af2b9	Error on mal-use of `Context.started()` Previously we were ignoring a race where the callee an opened task context could enter `Context.open_stream()` before calling `.started(). Disallow this as well as calling `.started()` more then once.	2021-12-03 10:08:55 -05:00
Tyler Goodlet	08e9593306	Suppress broken resources errors in `Portal.cancel_actor()`	2021-12-02 15:29:04 -05:00
Tyler Goodlet	14f84571fb	Don't cancel receive streams inside `.cancel_actor()` We don't need to any more presuming you get ideal remote cancellation conditions where the remote actor should teardown and kill the streams from its end.	2021-12-02 15:29:04 -05:00
Tyler Goodlet	e561a4908f	Appease mypy	2021-12-02 15:29:04 -05:00
Tyler Goodlet	46070f99de	Factor soft-wait logic into a helper, use with mp	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d81eb1a51e	Finally, deterministic remote cancellation support On msg loop termination we now check and see if a channel is associated with a child-actor registered in some local task's nursery. If so, we attempt to wait on channel closure initiated from the child side (by draining the underlying msg stream) so as to avoid closing it too early resulting in the child not relaying its termination status response. This means we now support the ideal case in 2-general's where we get back the ack to the closure request instead of just ignoring it and timing out XD The main implementation detail is that when `Portal.cancel_actor()` remotely calls `Actor.cancel()` we actually wait for the RPC response from that request before allowing the channel shutdown sequence to engage. The new msg stream draining support enables this. Also, factor child-to-parent error propagation logic into a helper func and improve some docs (yeah yeah y'all don't like the ''', i don't care - it makes my eyes not hurt).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d817f1a658	Add a nursery "exited" signal Use a `trio.Event` to enable nursery closure detection such that core runtime tasks can be notified when a local nursery exits and allow shutdown protocols to operate without close-before-terminate issues (such as IPC channel closure during remote peer cancellation).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	a23afb0bb8	Set channel cancel called flag on cancel requests	2021-12-02 08:18:04 -05:00
Tyler Goodlet	1976e61d1a	Add `.drain()` support to msg streams Enables "draining" the last set of messages after a channel/stream has been terminated mostly for the purposes of receiving a final ACK to a remote cancel command. Also, add an internal `Channel._cancel_called` flag which can be set by `Portal.cancel_actor()`.	2021-12-02 08:18:04 -05:00
Tyler Goodlet	0ac3397dbb	Only soft-acquire debug lock if a proc was spawned	2021-12-02 08:17:03 -05:00
Tyler Goodlet	62b2867e07	Tweak doc strings	2021-12-02 08:16:49 -05:00
Tyler Goodlet	bf6958cdbe	Handle cancelled-before-proc-created spawn case It's definitely possible to have a nursery spawn task be cancelled before a `trio.Process` handle is ever returned; we now handle this case as a cancelled-during-spawn scenario. Zombie collection logic also is bypassed in this case.	2021-12-02 08:16:05 -05:00
Tyler Goodlet	77fc705b1f	Add nooz	2021-11-29 22:52:19 -05:00
Tyler Goodlet	7eb465a699	Graceful cancel actors before hard reaping	2021-11-29 16:03:23 -05:00
Tyler Goodlet	f6de7e0afd	Factor out msg unwrapping into a func	2021-11-29 08:46:35 -05:00
Tyler Goodlet	0e7234aa68	Cache the return message instead of the value Thanks to @richardsheridan for pointing out the limitations of using any kind of value as the result-cached-flag and how it might cause problems for anyone returning pickled blob-data. This changes the `Portal` internal result value tracking to stash the full message from which the value can be retrieved by any `Portal.result()` caller. The internal change is that `Portal._return_once()` now returns a tuple of the message and its value.	2021-11-29 07:44:44 -05:00
Tyler Goodlet	095c94b1d2	Fix `Portal.run_in_actor()` returns `None` bug Fixes the issue where if the main remote task returns `None`, `Portal.result()` would erroneously wait again on the underlying feeder mem chan since `None` was being used as the cache flag. Instead set the flag as the channel uid and consider the result collected when set to anything else (since it would be odd to return that value from a remote task when you already can read it as part of portal/channel apis).	2021-11-20 13:02:08 -05:00
Tyler Goodlet	6b0366fe04	Guard against TCP server never started on cancel	2021-11-07 23:49:32 -05:00
Tyler Goodlet	dbe5d96d66	Fix missing yield in lock acquirer	2021-11-07 23:48:05 -05:00
Tyler Goodlet	74f460eba7	Make auto generated child names <parent_name>.<name>	2021-11-02 15:40:15 -04:00
Tyler Goodlet	083b73ad4a	Test: don't grab debug lock if not in mode	2021-10-25 10:22:41 -04:00
Tyler Goodlet	d0f5c7a5e2	Change to `gather_contexts()`, use event for graceful exit The api we've made here is actually closer to `asyncio.gather()` but with opening async context managers instead of funcs. Use another event to allow for graceful teardown of children on non-cancellation exits and add a doc string.	2021-10-24 14:00:01 -04:00
overclockworked64	50400359b8	Fix type annotations	2021-10-24 00:47:26 +02:00
overclockworked64	87e3d32992	Get rid of external teardown trigger because #245 resolves the problem	2021-10-23 16:17:30 -04:00
overclockworked64	04895b9d5e	Get rid of dumb random uid and use current actor's uid	2021-10-23 16:17:30 -04:00
overclockworked64	b7a4641674	Allow specifying start_method and hard_kill	2021-10-23 16:17:30 -04:00
overclockworked64	3130a04c61	Rename a variable and fix type annotations	2021-10-23 16:17:29 -04:00
overclockworked64	6f9229cd09	Cancel nursery	2021-10-23 16:17:29 -04:00
overclockworked64	6e6baf250b	Make sure the ID is a str	2021-10-23 16:17:29 -04:00
overclockworked64	73cbb2388a	Avoid RuntimeError by not using current_actor's uid	2021-10-23 16:17:29 -04:00
overclockworked64	2815f1c343	Make 'async_enter_all' take a teardown trigger which '_enter_and_wait' will wait on	2021-10-23 16:17:29 -04:00
overclockworked64	21afc69ac7	Postpone evaluation of annotations	2021-10-23 16:17:29 -04:00
overclockworked64	7d502cef74	Add 'open_actor_cluster' to __all__	2021-10-23 16:17:29 -04:00
Tyler Goodlet	c372367cc2	Fix *args-like type annot	2021-10-23 15:54:40 -04:00
Tyler Goodlet	9ddd75733c	Lul, fix everything for cluster helper	2021-10-23 15:54:40 -04:00
Tyler Goodlet	97006c904c	Expose `Lagged` for broadcasting	2021-10-23 15:54:40 -04:00
Tyler Goodlet	79fb1d0ebc	Fix top level nursery import	2021-10-23 15:54:40 -04:00
Tyler Goodlet	1e917fdb1d	Add an async actor cluster spawner prototype	2021-10-23 15:54:40 -04:00
Tyler Goodlet	4114eb1d25	Move broadcast channel parts into trionics	2021-10-23 15:54:40 -04:00
Tyler Goodlet	680a841282	Start `trionics` sub-pkg with `async_enter_all()` Since it seems we're building out more and more higher level primitives in order to support certain parallel style actor trees and messaging patterns (eg. task broadcast channels), we might as well start a new sub-package for purely `trio` constructions. We hereby dub this the realm of `trionics` (like electronics but for trios instead of electrons). To kick things off, add an `async_enter_all()` concurrent exit-stack-like context manager API which will concurrently spawn a sequence of provided async context managers and deliver their ordered results but with proper support for `trio` cancellation semantics. The stdlib's `AsyncExitStack` is not compatible with nurseries not `trio` tasks (which are cancelled) since as task will be suspended on the stack after push and does not ever hit a checkpoint until the stack is closed.	2021-10-23 15:54:40 -04:00
Tyler Goodlet	340ddba4ae	Rename the nursery module to `_supervise`	2021-10-23 15:54:40 -04:00
Tyler Goodlet	b3c4851ffb	Grab lock if cancelled during spawn before hard kill	2021-10-15 18:26:46 -04:00
Tyler Goodlet	5cfac58873	Don't pop a child entry that was never inserted	2021-10-15 18:16:58 -04:00
Tyler Goodlet	e4ed0fd2b3	Right, only worry about pdb lock when in debug mode	2021-10-15 09:29:25 -04:00
Tyler Goodlet	51259c4809	Pass uid not actor object	2021-10-14 13:46:27 -04:00
Tyler Goodlet	9d83ef82b2	Remove union type for root getter	2021-10-14 13:39:46 -04:00
Tyler Goodlet	fa317d1600	Change lock helper to take an actor uid tuple	2021-10-14 13:39:46 -04:00
Tyler Goodlet	6f5c35dd1b	Fix missing task status type	2021-10-14 13:39:46 -04:00
Tyler Goodlet	daa28ea0e9	Handle depth > 1 nursery owners which use debug mode	2021-10-14 13:39:46 -04:00
Tyler Goodlet	4b2710b8a5	Add tty lock acquire ctx mngr	2021-10-14 13:39:46 -04:00
Tyler Goodlet	d30ce96740	Breakout `wait_for_parent_stdin_hijack()`, increase root pdb checker poll time	2021-10-14 13:39:46 -04:00
Tyler Goodlet	f3a6ab62af	Use debugger helper in nursery and spawn tasks	2021-10-14 13:39:46 -04:00
Tyler Goodlet	62035078ce	Reduce some loglevels, stick in comment about blocking till next tick	2021-10-14 13:39:46 -04:00
Tyler Goodlet	893bad72d5	Add a maybe-open-debugger helper	2021-10-14 13:39:46 -04:00
Tyler Goodlet	77ec29008d	Simplify to soft and hard reap sequences This is actually surprisingly easy to grok having gone through a lot of pain understanding edge cases in the zombie lord dev branch. Basically we just need to make sure actors are managed in a 2 step reap sequence. In the "soft" reap phase we wait for the process to terminate on its own concurrently with (maybe) waiting for its portal's final result (if it's a `.run_in_actor()`). If this path is cancelled or errors, then we do a "hard" reap where we timeout and send a signal to the proc to terminate immediately. The only last remaining trick is to tie in the root-is-debugger-aware logic to yet again avoid tty clobbers.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	46ff558556	Unwind process opening and shield hard reap	2021-10-14 13:39:46 -04:00
Tyler Goodlet	bb9d9c74b1	Do immediate remote task cancels As for `Actor.cancel()` requests, do the same for `Actor._cancel_task()` but use `_invoke()` to ensure correct msg transactions with caller. Don't cancel task cancels on a cancel-all-tasks operation in attempt at more determinism.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	41f0992445	Don't whine about ; it ain't rpc	2021-10-14 13:39:46 -04:00
Tyler Goodlet	7643bbf183	Make actor runtime cancellation immediate	2021-10-14 13:39:46 -04:00
Tyler Goodlet	1f0cc15675	Just set flag for use-after-closed service nursery calls	2021-10-06 17:02:13 -04:00
Tyler Goodlet	10f66e5141	De-noise warnings, add a 'cancel' log level Now that we're on our way to a (somewhat) serious beta release I think it's about time to start de-noising the logging emissions. Since we're trying out this approach of "stack layer oriented" log levels, I figured this is a good time to move most of the "warnings" to what they should be: cancellation monitoring status messages. The level is set to 16 which is just above our "runtime" level but just below the traditional "info" level. I think this will be a decent approach since usually if you're confused about why your `tractor` app is behaving unlike you expect, it's 90% of the time going to be to do with cancellation or error propagation. This this setup a user can specify the 'cancel' level and see all the msgs pertaining to both actor and task-in-actor cancellation mechanics.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	4d5a5c147a	Move core actor runtime logging to, well, "runtime"	2021-10-06 17:02:13 -04:00
Tyler Goodlet	d2f0843041	Make custom log levels report the right stack frame The stdlib's `logging.LoggingAdapter` doesn't currently pass through `stacklevel: int` down to its wrapped logger instance. Hack it here and get our msgs looking like they would if using a built-in level.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	3f6d4d6af4	Don't log.error if it was intentional	2021-10-06 17:02:13 -04:00
Tyler Goodlet	b496e790fe	Use from `.from_stream()` in TCP handler	2021-10-06 15:54:27 -04:00
Tyler Goodlet	c6dc96b08c	Add "message transport" structured sub-typing In an effort to have some kind of more formal interface around the transport layer, add a `MsgTransport` protocol type and use with the channel composition of message streams. Start a little "key map" of `(<codec>, <protocol>)` to `MsgTransport` types which can be dynamically loaded. Add a `Channel.from_stream()` constructor thus cleaning up the mangled logic that was in the constructor based on inputs. Drop all the "auto reconnect" channel logic for now since nothing is using it (internally) and it's likely it will need rework once we bring in a protocol besides TCP.	2021-10-06 15:54:27 -04:00
Tyler Goodlet	135459ca25	Tolerate one decode error; may have been a registry ping	2021-10-05 13:37:17 -04:00
Tyler Goodlet	07e8821cd5	Add a stream type factory	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1382ad653d	Ugh, appease mypy yet again	2021-10-05 13:37:17 -04:00
Tyler Goodlet	076f37c589	Attempt to gracefully handle channel breakage?	2021-10-05 13:37:17 -04:00
Tyler Goodlet	19d6885243	Ensure tuple for passed in arbiter addr	2021-10-05 13:37:17 -04:00
Tyler Goodlet	486e983964	Cast `defaultdict` to `dict` for registry get	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1ab495a64d	Map broken stream errs to transport closed; msgspec seems to be racy	2021-10-05 13:37:17 -04:00
Tyler Goodlet	562419c907	Convert actor UIDs to hashable tuples `msgspec` sends python lists over the wire (https://github.com/jcrist/msgspec/issues/30) which is fine and dandy but we use them as lookup keys so we need to be sure we tuple-cast first.	2021-10-05 13:37:17 -04:00
Tyler Goodlet	3facfb6d4c	Fix log levels	2021-10-05 13:37:17 -04:00
Tyler Goodlet	aa080543d0	Mypy fixes to enforce uid tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	b64396f708	Pkg `msgpec` as optional dep, load transport type if importable	2021-10-05 13:37:17 -04:00
Tyler Goodlet	96b3f94c72	Accept transport closed error during handshake and msg loop	2021-10-05 13:37:17 -04:00
Tyler Goodlet	ecd8c4bc7e	Drop happy eyeballs inf delay	2021-10-05 13:37:17 -04:00
Tyler Goodlet	112117c1fc	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-10-05 13:37:17 -04:00
Tyler Goodlet	95e35f3d60	Add streaming decode support for `msgspec` Add a `tractor._ipc.MsgspecStream` type which can be swapped in for `msgspec` serialization transparently. A small msg-length-prefix framing is implemented as part of the type and we use `tricycle.BufferedReceieveStream` to handle buffering logic for the underlying transport. Notes: - had to force cast a few more list -> tuple spots due to no native `tuple`decode-by-default in `msgspec`: https://github.com/jcrist/msgspec/issues/30 - the framing can be understood by this protobuf walkthrough: https://eli.thegreenplace.net/2011/08/02/length-prefix-framing-for-protocol-buffers - `tricycle` becomes a new dependency	2021-10-05 13:37:17 -04:00
Tyler Goodlet	e39ee3a9cc	Always cast arbiter addr to tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	dda0b22870	Try out `msgspec` in our msgpack stream channel Can only really use an encoder currently since there is no streaming api in `msgspec` as of currently. See jcrist/msgspec#27. Not sure if any encoding speedups are currently noticeable especially without any validation going on yet XD. First experiments toward #196	2021-10-05 13:37:17 -04:00
Tyler Goodlet	4079f02acf	Cast to tuples for all uids explicitly	2021-10-05 13:37:17 -04:00
Tyler Goodlet	518a0d5e14	Add todo for log msg filename..	2021-10-04 10:38:44 -04:00
Tyler Goodlet	8b416e6bba	Stream and context api tweaks - drop `shield` input to `MsgStream` - check for cancel called prior to loading the feeder mem chan in `Context.open_stream()` - warn on a timeout when trying to cancel a remote task from `Context.cancel()` - drop noop endofchannel handler block	2021-10-04 10:38:44 -04:00
Tyler Goodlet	bd31f47d5f	Handle kbi in ctx blocks via `BaseException` Fixes prior committed tests by more generally handling `BaseExcepion` in context blocks. Left in the commented concrete list for reference.	2021-10-04 10:38:44 -04:00
Tyler Goodlet	4259738864	Flip to using the `trio` spawner on windows Was able to try it manually on a windows 10 system and the debugger works great!	2021-09-18 14:10:32 -04:00
Tyler Goodlet	1137a9e7ac	Fix 404ed tokio urls	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2745a2b1dc	Solve first-recv-cancelled by recursive `.receive()` on wake	2021-09-02 21:12:54 -04:00
Tyler Goodlet	d9bb52fe7b	Store array `maxlen` in state singleton The `collections.deque` takes care of array length truncation of values for us implicitly but in the future we'll likely want this value exposed to alternate array implementations. This patch is to provide for that as well as make `mypy` happy since the `dequeu.maxlen` can also be `None`.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	9258f79510	Don't wake sibling bcast consumers on a cancelled call	2021-09-02 21:12:54 -04:00
Tyler Goodlet	44ef26bb18	Shorten default feeder mem chan size to 64	2021-09-02 21:12:54 -04:00
Tyler Goodlet	7857a9ac6d	Add `shield: bool` kwarg to `Portal.open_stream_from()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	63ec740e27	Add some bcaster ref sanity asserts around subscriptions	2021-09-02 21:12:54 -04:00
Tyler Goodlet	093e7d921c	Instance ids are ints	2021-09-02 21:12:54 -04:00
Tyler Goodlet	bec3f5999d	Drop uuid4 keys, raise closed error on subscription after close	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a4cb0ef21f	Fix `.receive()` re-assignment, drop `.clone()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	346b5d2eda	Blade runner it Get rid of all the (requirements for) clones of the underlying receivable. We can just use a uuid generated key for each instance (thinking now this can probably just be `id(self)`). I'm fully convinced now that channel cloning is only a source of confusion and anti-patterns when we already have nurseries to define resource lifetimes. There is no benefit in particular when you allocate subscriptions using a context manager (not sure why `trio.open_memory_channel()` doesn't enforce this). Further refinements: - add a `._closed` state that will error the receiver on reuse - drop module script section; it's been moved to a real test - call the "receiver" duck-type stub a new name	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6c17c7367a	Store handle to underlying channel's `.receive()` This allows for wrapping an existing stream by re-assigning its receive method to the allocated broadcaster's `.receive()` so as to avoid expecting any original consumer(s) of the stream to now know about the broadcaster; this instead mutates the stream to delegate to the new receive call behind the scenes any time `.subscribe()` is called. Add a `typing.Protocol` for so called "cloneable channels" until we decide/figure out a better keying system for each subscription and mask all undesired typing failures.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2d1c24112b	Add subscription support to message streams Add `ReceiveMsgStream.subscribe()` which allows allocating a broadcast receiver around the stream for use by multiple actor-local consumer tasks. Entering this context manager idempotently mutates the stream's receive machinery which for now can not be undone. Move `.clone()` to the receive stream type. Resolves #204	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a12b1fc631	Drop optimization check, binance made its point	2021-09-02 21:12:54 -04:00
Tyler Goodlet	ceed96aa3f	Add common state delegate type for all consumers For every set of broadcast receivers which pull from the same producer, we need a singleton state for all of, - subscriptions - the sender ready event - the queue Add a `BroadcastState` dataclass for this and pass it to all subscriptions. This makes the design much more like the built-in memory channels which do something very similar with `MemoryChannelState`. Use a `filter()` on the subs list in the sequence update step, plus some other commented approaches we can try for speed.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6e78bcf898	Facepalm: use single `_subs` per clone set	2021-09-02 21:12:54 -04:00
Tyler Goodlet	4ad75a3287	Obviously keying on tasks isn't going to work Using the current task as a subscription key fails horribly as soon as you hand off new subscription receiver to another task you've spawned.. Instead use the underlying ``trio.abc.ReceiveChannel.clone()`` as a key (so i guess we're assuming cloning is supported by the underlying?) which makes this all work just like default mem chans. As a bonus, now we can just close the underlying rx (which may be a clone) on `.aclose()` and everything should just work in terms of the underlying channels lifetime (i think?). Change `.subscribe()` to be async since the receive channel type interface only expects `.aclose()` and it actually ends up being nicer for 3.9+ style `async with` parentheses style anyway.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	64358f6525	Rename to broadcast mod, don't expect mem chan specifically	2021-09-02 21:12:54 -04:00
Tyler Goodlet	1af7dbb732	`Task` is hashable, so key on it	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6a2c3da1bb	Simplify api around receive channel Buncha improvements: - pass in the queue via constructor - tracking over all underlying memory channel closure using cloning - do it like `tokio` and set lagged consumers to the last sequence before raising - copy the subs on first receiver wakeup for iteration instead of iterating the table directly (and being forced to skip the current tasks sequence increment) - implement `.aclose()` to close the underlying clone for this task - make `broadcast_receiver()` just take the recv chan since it doesn't need anything on the send side.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3817b4fb5e	Ultra naive broadcast channel prototype	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3208b67f57	Drop shielding on root lock acquire; seems to prevent hangs	2021-09-02 16:23:38 -04:00
Tyler Goodlet	61d2307e52	Unlock pdb tty on all possible net faults	2021-09-02 16:23:38 -04:00
Tyler Goodlet	79f0d6fda0	Attempt to avoid pdb lockups on channel breakage Always try to release the root tty lock on broken connection errors.	2021-09-02 16:23:10 -04:00
Tyler Goodlet	4f166500d0	Add return type to debugger factory	2021-09-02 16:22:59 -04:00
Tyler Goodlet	d906c81f14	Export portal type at top level	2021-09-02 16:22:59 -04:00
Tyler Goodlet	68d56d5df0	Try not masking SIGINT in child processes	2021-09-02 16:22:59 -04:00
Tyler Goodlet	497fa72c96	Add a SIGINT handler that kills the process tree We're not actually using this but it's for reference if we do end up needing it. The std lib's `pdb` internals override SIGINT handling whenever one enters the debugger repl. Force a handler that kills the tree if SIGINT is triggered from the root actor, otherwise ignore it since supervised children should be managed already. This resolves an issue with guest mode where `pdb` causes SIGINTs to be swallowed resulting in the host loop never terminating the process tree.	2021-09-02 16:22:02 -04:00
Tyler Goodlet	b4d95e9543	Update docs to new close semantics	2021-09-02 08:24:18 -04:00
Tyler Goodlet	af85d35685	Drop stream shielding; it was from a legacy design The whole origin was not having an explicit open/close semantic for streams. We have that now so this internal mechanic isn't needed and further our streams become more correct by having `.aclose()` be independent of cancellation.	2021-09-02 08:24:18 -04:00
Tyler Goodlet	7431e8ea01	Don't log cancelled inceptions seen by the root	2021-08-02 21:15:42 -04:00
Tyler Goodlet	82999801a6	Drop leftover noisy exception logging..	2021-08-02 16:56:00 -04:00
Tyler Goodlet	674fbbc6b3	Docs and comments tidying	2021-08-01 10:44:13 -04:00
Tyler Goodlet	6006adc0de	Hide `_invoke()` tb, move actor error to exceptions mod	2021-07-31 13:56:26 -04:00
Tyler Goodlet	0afa7f0f8e	Fix lock context manager return type	2021-07-31 12:50:58 -04:00
Tyler Goodlet	b3d28a1ee4	Drop debugger path and duplicate func from rebasing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	09f00a5a00	Go back to only logging tbs on no debugger	2021-07-31 12:46:40 -04:00
Tyler Goodlet	44bfacc0c2	Comment hard-kill-sidestep for now since nursery version covers it?	2021-07-31 12:46:40 -04:00
Tyler Goodlet	551816e80d	Solve the root-cancels-child-in-tty-lock race Finally this makes a cancelled root actor nursery not clobber child tasks which request and lock the root's tty for the debugger repl. Using an edge triggered event which is set after all fifo-lock-queued tasks are complete, we can be sure that no lingering child tasks are going to get interrupted during pdb use and tty lock acquisition. Further, even if new tasks do queue up to get the lock, the root will incrementally send cancel msgs to each sub-actor only once the tty is not locked by a (set of) child request task(s). Add shielding around all the critical sections where the child attempts to allocate the lock from the root such that it won't be disrupted from cancel messages from the root after the acquire lock transaction has started.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	be1fcb2a5b	Distinguish between a local pdb unlock and the tty unlock in root	2021-07-31 12:46:40 -04:00
Tyler Goodlet	ef89ed947a	Fix hard kill in debug mode; only do it when debug lock is empty	2021-07-31 12:46:40 -04:00
Tyler Goodlet	5b3894827f	Move some infos to runtime level	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0fdcfa0ba1	Move debugger wait inside OCA nursery	2021-07-31 12:46:40 -04:00
Tyler Goodlet	37a1897c47	Don't shield debugger status wait; it causes hangs	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0f2a39a311	Catch and delay errors in the root if debugger is active	2021-07-31 12:46:40 -04:00
Tyler Goodlet	23a1622256	Don't kill root's immediate children when in debug If the root calls `trio.Process.kill()` on immediate child proc teardown when the child is using pdb, we can get stdstreams clobbering that results in a pdb++ repl where the user can't see what's been typed. Not killing such children on cancellation / error seems to resolve this issue whilst still giving reliable termination. For now, code that special path until a time it becomes a problem for ensuring zombie reaps.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	49d439b681	Add some brief todo notes on idea of shielded breakpoint	2021-07-31 12:46:40 -04:00
Tyler Goodlet	6f05f5d5e6	Wait for debugger lock task context termination	2021-07-31 12:46:40 -04:00
Tyler Goodlet	b369b91357	Fix up var naming and typing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	969bce3aa4	Use context for remote debugger locking A context is the natural fit (vs. a receive stream) for locking the root proc's tty usage via it's `.started()` sync point. Simplify the `_breakpoin()` routine to be a simple async func instead of all this "returning a coroutine" stuff from before we decided that `tractor.breakpoint()` must be async. Use `runtime` level for locking logging making it easier to trace.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	443ebea165	Use "pdb" level logging in debug mode	2021-07-08 13:02:33 -04:00
Tyler Goodlet	25779d48a8	Define explicit adapter level methods for mypy	2021-07-08 12:51:35 -04:00
Tyler Goodlet	fde52d2464	Mypy fixes	2021-07-08 12:48:34 -04:00
Tyler Goodlet	8c927d708d	Change trace to transport level	2021-07-07 14:31:15 -04:00
Tyler Goodlet	31590e82a3	Flip "trace" level to "transport" level logging	2021-07-07 14:31:03 -04:00
Tyler Goodlet	2513c652c1	Go back to only logging crashes if no pdb gets engaged	2021-07-06 08:23:30 -04:00
Tyler Goodlet	9ddb654783	Avoid mutate on iterate race	2021-07-06 08:23:30 -04:00
Tyler Goodlet	7f86d63e77	Drop trip kwarg	2021-07-06 08:23:30 -04:00
Tyler Goodlet	12f987514d	Don't enter debug on closed resource errors	2021-07-06 08:23:30 -04:00
Tyler Goodlet	98bbf8e0df	Move join event trigger to direct exit path	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b1cd7fdedf	Don't shield on root cancel it can causes hangs	2021-07-06 08:23:30 -04:00
Tyler Goodlet	ef725c5972	Always hard kill sub-procs on teardown Adds a new hard kill routine for the `trio` spawning backend.	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b21e2a6caa	Add pre-stream open error conditions	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c6cdaf9c31	De-densify some code	2021-07-06 08:23:30 -04:00
Tyler Goodlet	91640facbc	Always shield cancel the caller on cancel-causing-errors, add teardown logging	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c2484e88a1	First try: pack cancelled tracebacks and ship to caller	2021-07-06 08:23:30 -04:00
Tyler Goodlet	3423ea4011	Add temp warning msg for context cancel call	2021-07-06 08:23:29 -04:00
Tyler Goodlet	af701c16ee	Consider relaying context error via raised-in-scope-nursery task	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1703171bea	Set stream "end of channel" after shielded check! Another face palm that was causing serious issues for code that is using the `.shielded` feature.. Add a bunch more detailed comments for all this subtlety and hopefully get it right once and for all. Also aggregated the `trio` errors that should trigger closure inside `.aclose()`, hopefully that's right too.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	3d633408fc	Don't clobber msg loop mem chan on rx stream close Revert this change since it really is poking at internals and doesn't make a lot of sense. If the context is going to be cancelled then the msg loop will tear down the feed memory channel when ready, we don't need to be clobbering it and confusing the runtime machinery lol.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	196dea80db	Drop trailing comma	2021-07-06 08:23:29 -04:00
Tyler Goodlet	54916be601	Adjustments for non-frozen context dataclass change	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1a69727b75	Fix exception typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	348148ff1e	Explicitly formalize context/streaming teardown Add clear teardown semantics for `Context` such that the remote side cancellation propagation happens only on error or if client code explicitly requests it (either by exit flag to `Portal.open_context()` or by manually calling `Context.cancel()`). Add `Context.result()` to wait on and capture the final result from a remote context function; any lingering msg sequence will be consumed/discarded. Changes in order to make this possible: - pass the runtime msg loop's feeder receive channel in to the context on the calling (portal opening) side such that a final 'return' msg can be waited upon using `Context.result()` which delivers the final return value from the callee side `@tractor.context` async function. - always await a final result from the target context function in `Portal.open_context()`'s `__aexit__()` if the context has not been (requested to be) cancelled by client code on block exit. - add an internal `Context._cancel_called` for context "cancel requested" tracking (much like `trio`'s cancel scope). - allow flagging a stream as terminated using an internal `._eoc` flag which will mark the stream as stopped for iteration. - drop `StopAsyncIteration` catching in `.receive()`; it does nothing.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	73302d9d16	Specially raise a `ContextCancelled` for a task-context rpc	2021-07-06 08:23:29 -04:00
Tyler Goodlet	409f7f0d5a	Expose streaming components at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	eb3662f981	Add a specially handled `ContextCancelled` error	2021-07-06 08:23:29 -04:00
Tyler Goodlet	39b9896a62	Only close recv chan if we get a ref	2021-07-06 08:23:29 -04:00
Tyler Goodlet	9a4244b9a6	Support no arg to `Context.started()` like trio	2021-07-06 08:23:29 -04:00
Tyler Goodlet	a2e2f7e7a8	Only send stop msg if not received from far end	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6559fb72aa	Expose msg stream types at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e311430d25	Be more pedantic with error handling	2021-07-06 08:23:29 -04:00
Tyler Goodlet	08eb6bd019	Fix typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1f8966ba64	Support passing `shield` at stream contruction	2021-07-06 08:23:29 -04:00
Tyler Goodlet	14114547e8	Expose `@context` decorator at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e3955bb62b	Add initial bi-directional streaming This mostly adds the api described in https://github.com/goodboy/tractor/issues/53#issuecomment-806258798 The first draft summary: - formalize bidir steaming using the `trio.Channel` style interface which we derive as a `MsgStream` type. - add `Portal.open_context()` which provides a `trio.Nursery.start()` remote task invocation style for setting up and tearing down tasks contexts in remote actors. - add a distinct `'started'` message to the ipc protocol to facilitate `Context.start()` with a first return value. - for our `ReceiveMsgStream` type, don't cancel the remote task in `.aclose()`; this is now done explicitly by the surrounding `Context` usage: `Context.cancel()`. - streams in either direction still use a `'yield'` message keeping the proto mostly symmetric without having to worry about which side is the caller / portal opener. - subtlety: only allow sending a `'stop'` message during a 2-way streaming context from `ReceiveStream.aclose()`, detailed comment with explanation is included. Relates to #53	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6aab16f877	Drop added logging around root cancel	2021-07-04 11:00:08 -04:00
Tyler Goodlet	caa70245e0	Try remapping all broken errs wholesale on windows	2021-07-04 10:47:15 -04:00
Tyler Goodlet	3f75732b02	Remap windows specific connection reset error	2021-07-04 10:25:19 -04:00
Tyler Goodlet	1edf5c2f06	Specially remap TCP 104-connection-reset to `TransportClosed` Since we currently have no real "discovery protocol" between process trees, the current naive approach is to check via a connect and drop to see if a TCP server is bound to a particular address during root actor startup. This was a historical decision and had no real grounding beyond taking a simple approach to get something working when the project was first started. This is obviously problematic from an error handling perspective since we need to be able to avoid such quick connect-and-drops from cancelling an "arbiter"'s (registry actor's) channel-msg loop machinery (which would propagate and cancel the actor). For now we map this particular TCP error, which gets remapped by `trio` as a `trio.BrokenResourceError` to our own internal `TransportClosed` which is swallowed by channel message loop processing and indicates a graceful teardown of the far end actor.	2021-07-03 18:57:54 -04:00
Tyler Goodlet	a2d400583f	Fix tuple type	2021-07-02 18:10:06 -04:00
Tyler Goodlet	32b4ae0603	Accept transport closed error during handshake and msg loop	2021-07-02 11:38:24 -04:00
Tyler Goodlet	80e100f818	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-07-02 11:36:22 -04:00
Tyler Goodlet	73e123bac7	Fix line length	2021-05-07 11:21:40 -04:00
Tyler Goodlet	1584c547cd	Drop run and rpc_module_paths from discovery tests	2021-05-07 11:21:40 -04:00
Tyler Goodlet	87971de1d9	Re-raise any sidestepped `trio.Cancelled`	2021-05-06 12:05:17 -04:00
Tyler Goodlet	9f38406e85	Appease mypy	2021-05-06 12:05:17 -04:00
Tyler Goodlet	c4b42000eb	Shield around root actor cancel	2021-05-06 12:05:17 -04:00
Tyler Goodlet	607c48f1ac	Distinctly separate and harden mp spawning It's clear now that special attention is needed to handle the case where a spawned `multiprocessing` proc is started but then the parent is cancelled before the child can connect back; in this case we need to be sure to kill the near-zombie child asap. This may end up being the solution to other resiliency issues seen around mp with nested process trees too. More testing is needed to be sure. Relates to #84 #89 #134 #146	2021-05-06 12:05:17 -04:00
Tyler Goodlet	fc36e73628	Comment out `MsgStream` for now	2021-04-28 16:40:38 -04:00
Tyler Goodlet	f59346d854	Add func type checking to `.run_in_actor()`	2021-04-28 12:23:08 -04:00
Tyler Goodlet	86fc418050	Error on bad registry pops	2021-04-28 12:23:08 -04:00
Tyler Goodlet	83af295b45	Fix func type checking	2021-04-28 12:23:08 -04:00
Tyler Goodlet	ad9256bcdb	Drop stream exhaustion; no longer needed	2021-04-28 12:23:08 -04:00
Tyler Goodlet	3e19fd311b	Move debugger locking to new stream api	2021-04-28 12:23:08 -04:00
Tyler Goodlet	80c96cab01	Add a warning for soon to be deprecated `ctx` use in `@stream` func	2021-04-28 12:23:08 -04:00
Tyler Goodlet	36251357b3	Add a new one-way stream API NB: this is a breaking change removing support for `Portal.run()` being able to invoke remote streaming functions and instead replacing the method call with an async context manager api `Portal.open_stream_from()` This style explicitly defines stream teardown at the call site instead of expecting the user to handle tricky things correctly themselves: eg. `async_geneartor.aclosing()`. Going forward `Portal.run()` can be used only for invoking async functions.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	81f3558494	Formatting	2021-04-28 12:23:08 -04:00
Tyler Goodlet	897ab79946	Add a no runtime error	2021-04-28 12:23:08 -04:00
Tyler Goodlet	7f38b7225d	Aggregate and organize streaming components Move receive stream into streaming modules and rebrand as a "message stream". Factor out cancellation mechanics in `.aclose()` into the `Context` type which will soon provide the api for for cancelling portal invocations. Comment-stage a few methods on both types in anticipation of a new bi-directional streaming api. Add a `MsgStream` bidirectional channel type which will be the eventual type yielded from `Context.open_stream()`. Adjust the response/dialog types to be the set `{'asyncfun', 'asyncgen', 'context'}`. OH, and add async func checking in `Portal.run()` to catch and error on sync funcs early.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	d0eacc3fd6	Appease mypy	2021-04-27 12:08:30 -04:00
Tyler Goodlet	89ce1a63e4	Only accept asyncfunc response type	2021-04-27 12:08:30 -04:00
Tyler Goodlet	5798ef6796	Enforce async funcs on callee side, convert arbiter methods	2021-04-27 12:08:30 -04:00
Tyler Goodlet	c2a1612bf5	Drop sync function support You can always wrap a sync function in an async one and there seems to be no good reason to support invoking them directly especially since cancellation won't work without some thread hackery. If it's requested we'll point users to `trio-parallel`. Resolves #77	2021-04-27 12:08:30 -04:00
Tyler Goodlet	be22a2526a	Add `Actor.cancel_soon()` for sync self destruct Add a sync method that can be used to cancel the current actor from a synchronous context. This is useful in debugging situations where sync debugger code may need to kill the process tree. Also, make the internal "lifetime stack" a global var; easier to manage from client code that may was to add callbacks prior to the actor runtime being fully setup.	2021-04-27 11:35:28 -04:00
Tyler Goodlet	47565cfbf3	Use root as default name from `tractor.run()`	2021-02-25 08:51:28 -05:00
Tyler Goodlet	cd636b270e	Update debug tests to expect 'root' actor name	2021-02-24 13:38:20 -05:00
Tyler Goodlet	983e66b31b	Add second implicit-runtime-boot branch	2021-02-24 13:13:45 -05:00
Tyler Goodlet	b285db4c58	Factor OCA supervisor into new func	2021-02-24 13:13:38 -05:00
Tyler Goodlet	5ffd2d2ab3	Ignore type checks on stdlib overrides	2021-02-21 14:08:23 -05:00
Tyler Goodlet	7888ef6f01	Fix more stdlib typing issues with latest mypy	2021-02-21 12:48:03 -05:00
Tyler Goodlet	109066dda9	Support sync code breakpointing via built-in Override `breakpoint()` for sync code making it work properly with `trio` as per: https://github.com/python-trio/trio/issues/1155#issuecomment-742964018 Relates to #193	2021-02-21 12:36:00 -05:00
Tyler Goodlet	9f4e497b9c	Don't shield proc waits	2021-01-14 18:21:26 -05:00
Tyler Goodlet	e546ead2ff	Pub sub internals type fixes	2021-01-14 18:20:59 -05:00
Tyler Goodlet	3df001f3a9	Fix msg pub global lock sharing Using `None` as the default key for a `@msg.pub` can cause conflicts if there is more then one "taskless" (no tasks={,} passed) pub offered on an actor... So instead use the first trio "task name" (usually just the function name) instead thus avoiding this very hard to debug and understand problem. Probably should throw in a test but I'm super lazy today.	2021-01-14 18:20:49 -05:00
Tyler Goodlet	5ed5d18ccb	Begin rpc_module_paths deprecation	2021-01-08 22:08:45 -05:00
Tyler Goodlet	32b10681a1	Drop tractor.run() from @tractor_test	2021-01-08 20:56:03 -05:00
Tyler Goodlet	41a4de5af2	Use actual task name lel	2021-01-08 20:55:42 -05:00
Tyler Goodlet	59421d9f3a	Fix some borked tests	2021-01-08 20:55:11 -05:00
Tyler Goodlet	333ddcf93f	Can we ever really appease mypy?	2021-01-03 11:18:31 -05:00
Tyler Goodlet	0bb2163b0c	Implicitly open root actor on first nursery use.	2021-01-02 21:39:30 -05:00
Tyler Goodlet	bd3059f01b	Allow for error bypass	2021-01-02 21:39:30 -05:00
Tyler Goodlet	803152ead5	Use explicit named args	2021-01-02 21:39:30 -05:00
Tyler Goodlet	e6245671b0	Use runtime level on attach	2021-01-02 21:38:55 -05:00
goodboy	bfe500060f	Merge pull request #181 from goodboy/drop_tractor_run Deprecate `tractor.run()`	2020-12-28 12:53:04 -05:00
Tyler Goodlet	723fb17394	Add deprecation warning to run()	2020-12-27 13:29:30 -05:00
Tyler Goodlet	f05534e472	Re-org root actor startup into context manager This begins the move to dropping support for `tractor.run()` which we don't really need since the runtime is started (as it always has been) from a new sub-task / nursery. Instead this introduces starting the actor tree through a `open_root_actor()` async context manager which we'll likely implicitly call (from the root) on the first use of an actor nursery. Drop `_actor._start_actor()` and factor its contents into this new api. Make `run()` and `run_daemon()` use `open_root_actor()` until we decide to remove them. Relates to #168 and #177	2020-12-27 13:29:30 -05:00
Tyler Goodlet	b040cdc0c9	Add null byte guard from mainline	2020-12-27 13:28:54 -05:00
Tyler Goodlet	6b650c0fe6	Add a "runtime" log level	2020-12-26 15:45:45 -05:00
Tyler Goodlet	0d05a727b6	Use error log level by default	2020-12-25 15:28:32 -05:00
Tyler Goodlet	c28ffd8b1c	Don't exception log multi-cancels	2020-12-25 15:23:59 -05:00
Tyler Goodlet	5d7a4e2b12	Denoise some common teardown "errors" to warnings.	2020-12-25 15:10:20 -05:00
Tyler Goodlet	8522f90000	Add type annots to exceptions mod Also add a `is_multi_cancelled()` predicate to test for `trio.MultiError`s that contain entirely cancel signals. Resolves #125	2020-12-25 15:07:36 -05:00
Tyler Goodlet	4bf9b27f57	Drop all .statespace refs; it was a silly idea	2020-12-22 19:33:16 -05:00
Tyler Goodlet	9fd3c42eb1	Port inter-process method calls to `Portal.run_from_ns()`	2020-12-22 10:39:47 -05:00
Tyler Goodlet	7134f35d6e	Add `Portal.run_from_ns()` It turns out in order to maintain our sneaky little "call an `Actor` method in this remote process" we still need the ability to invoke functions from a namespace. We're currently using a "self" namespace as a way to do this for internal inter-process method calling. Either way, I see no reason not to keep a public method for this invoke style (we just won't market it) since it is still how the machinery works underneath.	2020-12-22 10:39:47 -05:00
Tyler Goodlet	a668f714d5	Allow passing function refs to `Portal.run()` This resolves and completes #69 allowing all RPC invocation APIs to pass function references directly instead of explicit `str` names for the target namespace and function (this is still done implicitly underneath). This brings us closer to `trio`'s task running API as well as acknowledges that any inter-host RPC system (and API) will likely need to be implemented on top of local RPC primitives anyway. Even if this ends up not being true we can always go to "function stubs" as part of our IAC protocol or, add a new method to do explicit namespace calls: `.run_from_module()` or whatever everyone votes on. Resolves #69 Further, this commit drops `Actor.statespace` from the entire system since a user can easily get this same functionality using module level variables. Fix docs to match all these changes (luckily mostly already done due to example scripts referencing).	2020-12-21 09:09:55 -05:00
Tyler Goodlet	0d67ce4abc	Fix collections type import for py3.10	2020-12-18 17:58:07 -05:00
Tyler Goodlet	797bcc1df2	Handle early timeouts on last debugger test	2020-12-17 13:35:45 -05:00
Tyler Goodlet	201771a521	'Fix mypy, change interal type name to `ReceiveStream`, settle on `.shield()`'	2020-12-17 12:01:49 -05:00
Tyler Goodlet	15ead6b561	Add a way to shield a stream's underlying channel Add a ``tractor._portal.StreamReceiveChannel.shield_channel()`` context manager which allows for avoiding the closing of an IPC stream's underlying channel for the purposes of task re-spawning. Sometimes you might want to cancel a task consuming a stream but not tear down the IPC between actors (the default). A common use can might be where the task's "setup" work might need to be redone but you want to keep the established portal / channel in tact despite the task restart. Includes a test.	2020-12-16 21:42:28 -05:00
Tyler Goodlet	d497078eb7	Appease 3.8 mypy	2020-12-11 20:04:56 -05:00
Tyler Goodlet	e51c2620e5	End the `pdb` SIGINT handling madness Turns out this is a lower level issue in terms of the stdlib's default `pdb.Pdb` settings and how they conflict with `trio`s cancellation and KBI handling. The details are hashed out more thoroughly in python-trio/trio#1155. Maybe we can get a fix in trio so things are solved under our feet :)	2020-12-11 00:15:09 -05:00
Tyler Goodlet	12f425137c	Drop duplicate project-package name in msg header	2020-11-03 12:15:49 -05:00
Tyler Goodlet	1580cc6fa0	Add explanation to module load error	2020-10-15 23:16:56 -04:00
Tyler Goodlet	5822d38ae4	Set _is_root runtime var in _main()	2020-10-15 23:16:54 -04:00
Tyler Goodlet	3b8684f655	Always call `Actor.cancel()` at end of root's main task It's simpler and the only real logical difference is logging messages. This should also give us an overall consistent tear down sequence.	2020-10-14 13:59:57 -04:00
Tyler Goodlet	02a9cac557	Drop remaining warn()s	2020-10-14 13:48:14 -04:00
Tyler Goodlet	f60321a35a	Always cancel service nursery last The channel server should be torn down before the rpc task/service nursery. Do this explicitly even in the root's main task to avoid a strange hang I found in the pubsub tests. Start dropping the `warnings.warn()` usage.	2020-10-14 13:46:05 -04:00
goodboy	7115d6c3bd	Merge pull request #129 from goodboy/multiproc_debug Wen? Multiprocessing-native debugger now!	2020-10-14 09:14:03 -04:00
Tyler Goodlet	e3c26943ba	Support debug mode only on the trio backend	2020-10-13 14:20:44 -04:00
Tyler Goodlet	08ff989631	Add some comments	2020-10-13 11:59:18 -04:00
Tyler Goodlet	573b8fef73	Add better actor cancellation tracking Add `Actor._cancel_called` and `._cancel_complete` making it possible to determine whether the actor has started the cancellation sequence and whether that sequence has fully completed. This allows for blocking in internal machinery tasks as necessary. Also, always trigger the end of ongoing rpc tasks even if the last task errors; there's no guarantee the trio cancellation semantics will guarantee us a nice internal "state" without this.	2020-10-13 11:48:52 -04:00
Tyler Goodlet	c375a2d028	mypy fixes	2020-10-13 11:03:55 -04:00
Tyler Goodlet	c41e5c8313	Fix missing await	2020-10-13 00:45:29 -04:00
Tyler Goodlet	79c38b04e7	Report `trio.Cancelled` when exhausting portals.. For reliable remote cancellation we need to "report" `trio.Cancelled`s (just like any other error) when exhausting a portal such that the caller can make decisions about cancelling the respective actor if need be. Resolves #156	2020-10-12 23:28:36 -04:00
Tyler Goodlet	07112089d0	Add mention subactor uid during locking	2020-10-07 05:53:26 -04:00
Tyler Goodlet	d43d367153	Facepalm: tty locking from root doesn't require an extra task	2020-10-05 11:58:58 -04:00
Tyler Goodlet	83a45119e9	Add "root mailbox" contact info passing Every subactor in the tree now receives the socket (or whatever the mailbox type ends up being) during startup and can call the new `tractor._discovery.get_root()` function to get a portal to the current root actor in their tree. The main reason for adding this atm is to support nested child actors gaining access to the root's tty lock for debugging. Also, when a channel disconnects from a message loop, might as well kill all its rpc tasks.	2020-10-05 11:58:58 -04:00
Tyler Goodlet	a2151cdd4d	Allow re-entrant breakpoints during pdb stepping	2020-10-05 11:58:58 -04:00
Tyler Goodlet	9067bb2a41	Shorten arbiter contact timeout	2020-10-05 11:58:58 -04:00
Tyler Goodlet	29ed065dc4	Ack our inability to hard kill sub-procs	2020-09-28 13:56:42 -04:00
Tyler Goodlet	fc2cb610b9	Make "hard kill" just a `Process.terminate()` It's not like any of this code is really being used anyway since we aren't indefinitely blocking for cancelled subactors to terminate (yet). Drop the `do_hard_kill()` bit for now and just rely on the underlying process api. Oh, and mark the nursery as cancelled asap.	2020-09-28 13:49:45 -04:00
Tyler Goodlet	5dd2d35fc5	Huh, maybe we don't need to block SIGINT Seems like the request task cancel scope is actually solving all the deadlock issues and masking SIGINT isn't changing much behaviour at all. I think let's keep it unmasked for now in case it does turn out useful in cancelling from unrecoverable states while in debug.	2020-09-28 13:11:22 -04:00
Tyler Goodlet	25e93925b0	Add a cancel scope around child debugger requests This is needed in order to avoid the deadlock condition where a child actor is waiting on the root actor's tty lock but it's parent (possibly the root) is waiting on it to terminate after sending a cancel request. The solution is simple: create a cancel scope around the request in the child and always cancel it when a cancel request from the parent arrives.	2020-09-28 13:02:33 -04:00
Tyler Goodlet	363498b882	Disable SIGINT handling in child processes There seems to be no good reason not too since our cancellation machinery/protocol should do this work when the root receives the signal. This also (hopefully) helps with some debugging race condition stuff.	2020-09-28 09:24:36 -04:00
Tyler Goodlet	f1b242f913	Block SIGINT handling while in the debugger This seems to prevent a certain class of bugs to do with the root actor cancelling local tasks and getting into deadlock while children are trying to acquire the tty lock. I'm not sure it's the best idea yet since you're pretty much guaranteed to get "stuck" if a child activates the debugger after the root has been cancelled (at least "stuck" in terms of SIGINT being ignored). That kinda race condition seems to still exist somehow: a child can "beat" the root to activating the tty lock and the parent is stuck waiting on the child to terminate via its nursery.	2020-09-28 08:54:21 -04:00
Tyler Goodlet	76e1c83161	Add matrix room link	2020-09-24 11:12:45 -04:00
Tyler Goodlet	9e1d9a8ce1	Add an internal context stack This aids with tearing down resources after the crash handling and debugger have completed. Leaving this internal for now but should eventually get a public convenience function like `tractor.context_stack()`.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	09daba4c9c	Explicitly handle `debug_mode` flag correctly	2020-09-24 10:12:33 -04:00
Tyler Goodlet	8b6e9f5530	Port to new debug api, set `_is_root` state flag on startup	2020-09-24 10:12:33 -04:00
Tyler Goodlet	150179bfe4	Support entering post mortem on crashes in root actor	2020-09-24 10:12:33 -04:00
Tyler Goodlet	291ecec070	Maybe not sticky by default	2020-09-24 10:12:33 -04:00
Tyler Goodlet	bd157e05ef	Port to service nursery	2020-09-24 10:12:33 -04:00
Tyler Goodlet	fd5fb9241a	Sparsen some lines	2020-09-24 10:12:33 -04:00
Tyler Goodlet	ebb21b9ba3	Support re-entrant breakpoints Keep an actor local (bool) flag which determines if there is already a running debugger instance for the current process. If another task tries to enter in this case, simply ignore it since allowing entry may result in a deadlock where the new task will be sync waiting on the parent stdio lock (a case that will never arrive due to the current debugger's active use of it). In the future we may want to allow FIFO queueing of local tasks where instead of ignoring re-entrant breakpoints we allow tasks to async wait for debugger release, though not sure the implications of that since you'd likely want to support switching the debugger to the new task and that could cause deadlocks where tasks are inter-dependent. It may be more sane to just error on multiple breakpoint requests within an actor.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	f9ef3fc5de	Cleanups and more comments	2020-09-24 10:12:33 -04:00
Tyler Goodlet	68773d51fd	Always expose the debug module	2020-09-24 10:12:33 -04:00
Tyler Goodlet	abaa2f5da0	Drop uneeded `parent_chan_cs()` cancel call	2020-09-24 10:12:33 -04:00
Tyler Goodlet	8eb9a742dd	Add multi-process debugging support using `pdbpp` This is the first step in addressing #113 and the initial support of #130. Basically this allows (sub)processes to engage the `pdbpp` debug machinery which read/writes the root actor's tty but only in a FIFO semaphored way such that no two processes are using it simultaneously. That means you can have multiple actors enter a trace or crash and run the debugger in a sensible way without clobbering each other's access to stdio. It required adding some "tear down hooks" to a custom `pdbpp.Pdb` type such that we release a child's lock on the parent on debugger exit (in this case when either of the "continue" or "quit" commands are issued to the debugger console). There's some code left commented in anticipation of full support for issue #130 where we're need to actually capture and feed stdin to the target (remote) actor which won't necessarily being running on the same host.	2020-09-24 10:12:10 -04:00
Tyler Goodlet	b06d4b023e	Add support for "debug mode" When enabled a crashed actor will connect to the parent with `pdb` in post mortem mode.	2020-09-24 10:12:10 -04:00
Tyler Goodlet	b11e91375c	Initial attempt at multi-actor debugging Allow entering and attaching to a `pdb` instance in a child process. The current hackery is to have the child make an rpc to the parent and ask it to hijack stdin, once complete the child enters a `pdb` blocking method. The parent then relays all stdin input to the child thus controlling the "remote" debugger. A few things were added to accomplish this: - tracking the mapping of subactors to their parent nurseries - in the root actor, cancelling all nurseries under the root `trio` task on cancellation (i.e. `Actor.cancel()`) - pass a "runtime vars" map down the actor tree for propagating global state	2020-09-24 10:12:10 -04:00
Tyler Goodlet	8c97f7bbb3	Create runtime variables	2020-09-24 10:12:10 -04:00
Tyler Goodlet	ec5d443ee5	Always log actor errors	2020-08-13 11:55:22 -04:00
Tyler Goodlet	1ae0efb033	Make rpc_module_paths a list	2020-08-13 11:53:45 -04:00
Tyler Goodlet	8a995beb6a	Docs fixes	2020-08-08 22:29:57 -04:00
Tyler Goodlet	292513b353	Module define default accept addr	2020-08-08 20:58:04 -04:00
Tyler Goodlet	b3eba00c3a	Appease the great mypy	2020-08-08 20:57:43 -04:00
Tyler Goodlet	42be410076	Handle mp accept_addr	2020-08-08 20:27:43 -04:00
Tyler Goodlet	8477d21499	Restructure actor runtime nursery scoping In an effort acquire more deterministic actor cancellation, this adds a clearer and more resilient (whilst possibly a bit slower) internal nursery structure with explicit semantics for clarifying the task-scope shutdown sequence. Namely, on cancellation, the explicit steps are now: - cancel all currently running rpc tasks and wait for them to complete - cancel the channel server and wait for it to complete - cancel the msg loop for the channel with the immediate parent - de-register with arbiter if possible - wait on remaining connections to release - exit process To accomplish this add a new nursery called the "service nursery" which spawns all rpc tasks instead of using the "root nursery". The root is now used solely for async launching the msg loop for the primary channel with the parent such that it is (nearly) the last thing torn down on cancellation. In the future it should also be possible to have `self.cancel()` return a result to the parent once the runtime is sure that the rest of the shutdown is atomic; this would allow for a true unbounded shield in `Portal.cancel_actor()`. This will likely require that the error handling blocks in `Actor._async_main()` are moved "inside" the root nursery block such that the msg loop with the parent truly is the last thing to terminate.	2020-08-08 14:55:41 -04:00
Tyler Goodlet	90c7fa6963	Allow shielding in `open_portal()`	2020-08-08 14:47:52 -04:00

... 4 5 6 7 8 ...

736 Commits (339d787cf8d7164df39b1ae7051bd2a76b7c179d)