tractor

jc211

tractor

Author	SHA1	Message	Date
Tyler Goodlet	11e64426f6	Wake all sleeping consumers on bcaster closure	2021-12-16 18:02:03 -05:00
Tyler Goodlet	213447008b	Add draft code for waiting on all nurseries in root	2021-12-16 18:02:03 -05:00
Tyler Goodlet	52627a6326	Rework interface: pass func and kwargs After more extensive testing I realized that keying on the context manager instance id isn't going to work since each entering task is going to create a unique key XD Instead pass the manager function as `acm_func` and optionally allow keying the resource on the passed `kwargs` (if hashable) or the `key:str`. Further, pass the key to the enterer task and avoid a separate keying scheme for the manager versus the value it delivers. Don't bother with checking and releasing the lock in `finally:` block, it should be an error if it's still locked.	2021-12-16 18:02:03 -05:00
Tyler Goodlet	3826bc9972	Don't catch key errors from the yielded to scope	2021-12-16 18:02:03 -05:00
Tyler Goodlet	b210278e2f	Naming change `cache` -> `_Cache`	2021-12-16 18:02:03 -05:00
Tyler Goodlet	ac22b4a875	Fix type annots in resource cacher internals	2021-12-16 18:02:03 -05:00
Tyler Goodlet	5f41dbf34f	Add `maybe_open_context()` an actor wide task-resource cache	2021-12-16 18:02:03 -05:00
Tyler Goodlet	57f2aca18c	Set eoc on closure (again)	2021-12-16 16:19:15 -05:00
Tyler Goodlet	f2ba961e81	Mark stream with EOC when stop message is received	2021-12-16 16:18:58 -05:00
Tyler Goodlet	3deb1b91e6	Wake all broadcast consumers on EOC Without this wakeup you can have tasks which re-enter `.receive()` and get stuck waiting on the wakeup event indefinitely. Whenever a ``trio.EndOfChannel`` arrives we want to make sure all consumers at least know about it and don't block. This previous behaviour was basically a bug. Add some state flags for tracking if the broadcaster was either cancelled or terminated via EOC mostly for testing and debugging purposes though this info might be useful if we decide to offer a `.statistics()` like API in the future.	2021-12-16 16:16:14 -05:00
Tyler Goodlet	61e134dc5d	Wake up consumers on end of channel as well	2021-12-16 16:15:54 -05:00
Tyler Goodlet	6f94ffc304	Re-license code base for distribution under AGPL This commit obviously denotes a re-license of all applicable parts of the code base. Acknowledgement of this change was completed in #274 by the majority of the current set of contributors. From here henceforth all changes will be AGPL licensed and distributed. This is purely an effort to maintain the same copy-left policy whilst closing the (perceived) SaaS loophole the GPL allows for. It is merely for this loophole: to avoid code hiding by any potential "network providers" who are attempting to use the project to make a profit without either compensating the authors or re-distributing their changes. I thought quite a bit about this change and can't see a reason not to close the SaaS loophole in our current license. We still are (hard) copy-left and I plan to keep the code base this way for a couple reasons: - The code base produces income/profit through parent projects and is demonstrably of high value. - I believe firms should not get free lunch for the sake of "contributions from their employees" or "usage as a service" which I have found to be a dubious argument at best. - If a firm who intends to profit from the code base wants to use it they can propose a secondary commercial license to purchase with the proceeds going to the project's authors under some form of well defined contract. - Many successful projects like Qt use this model; I see no reason it can't work in this case until such a time as the authors feel it should be loosened. There has been detailed discussion in #103 on licensing alternatives. The main point of this AGPL change is to protect the code base for the time being from exploitation while it grows and as we move into the next phase of development which will include extension into the multi-host distributed software space.	2021-12-14 23:33:27 -05:00
Tyler Goodlet	a38a983225	Increase debugger poll delay back to prior value If we make it too fast a nursery with debug mode children can cancel too fast and causes some test failures. It's likely not a huge deal anyway since the purpose of this poll/check is for human interaction and the current delay isn't really that noticeable. Decrease log levels in the debug module to avoid console noise when in use. Toss in some more detailed comments around the new debugger lock points.	2021-12-10 11:54:27 -05:00
Tyler Goodlet	9bee513136	Use manual debugger-in-use flag in nursery and spawn task	2021-12-09 17:53:29 -05:00
Tyler Goodlet	5d9e3d1163	Add a manual debug mode kwarg to debugger waiter	2021-12-09 17:52:35 -05:00
Tyler Goodlet	92c6ec1882	`get_loglevel()` always returns a str	2021-12-07 13:17:00 -05:00
Tyler Goodlet	72eef2a4a1	Config debug mode log level after initial setup	2021-12-07 13:16:07 -05:00
Tyler Goodlet	9bd5226e76	Only adjust logging in debug mode if not noisy enough already	2021-12-07 13:13:04 -05:00
Tyler Goodlet	e899cc42bf	Add per actor debug mode toggle	2021-12-07 13:11:06 -05:00
Tyler Goodlet	4856285dee	Add back broken send chan ignore block	2021-12-06 17:04:17 -05:00
Tyler Goodlet	4b40599c48	Fix ignore warning log message	2021-12-06 16:32:23 -05:00
Tyler Goodlet	c9132de7dc	Move maybe-raise-error-msg logic into context A context method handling all this logic makes the most sense since it contains all the state related to whether the error should be raised in a nursery scope or is expected to be raised by a consumer task which reads and processes the msg directly (via a `Portal` API call). This also makes it easy to always process remote errors even when there is no (stream) overrun condition.	2021-12-06 16:32:23 -05:00
Tyler Goodlet	1f8e1cccbb	Only pop contexts on decorated entrypoints	2021-12-06 13:48:19 -05:00
Tyler Goodlet	318027ebd1	Raise stream overruns on one side never opened A context stream overrun should normally never take place since if a stream is opened (via ``Context.open_stream()``) backpressure is applied on the message buffer (unless explicitly disabled by the ``backpressure=False`` flag) such that an overrun on the receiving task should result in blocking the (remote) sender task (eventually depending on the underlying ``MsgStream`` transport). Here we add a special error message that reports if one side never opened a stream and let's the user know in the overrun error message that they may be trying to push messages to a task that isn't ready to receive them. Further fixes / details: - pop any `Context` at the end of any `_invoke()` task that creates one and registers with the runtime. - ignore but warn about messages received for a context that either no longer exists or is unknown (guarding against crashes by malicious packets in the latter case)	2021-12-06 11:54:21 -05:00
Tyler Goodlet	b826ec8103	Better idea, enable backpressure on opened streams Keeping it disabled on context open will help with detecting any stream connection which was never opened on one side of the task pair. In that case we can report that there was an overrun and a stream wasn't opened versus if the stream is explicitly configured not to use bp then we throw the standard overflow. Use `trio.Nursery._closed` to detect "closure" XD since it seems to be the most reliable way to determine if a spawn call will trigger a runtime error.	2021-12-06 11:54:21 -05:00
Tyler Goodlet	4ea5c9b5db	Pop context on `.open_context()` exit	2021-12-06 11:54:21 -05:00
Tyler Goodlet	41a3e6a9ca	Type check fixes	2021-12-05 20:00:58 -05:00
Tyler Goodlet	185dbc7e3f	Disable msg stream backpressure by default Half of portal API usage requires a 1 message response (`.run()`, `.run_in_actor()`) and the streaming APIs should probably be explicitly enabled for backpressure if desired by the user. This makes more sense in (psuedo) realtime systems where it's better to notify on a block then freeze without notice. Make this default behaviour with a new error to be raised: `tractor._exceptions.StreamOverrun` when a sender overruns a stream by the default size (2**6 for now). The old behavior can be enabled with `Context.open_stream(backpressure=True)` but now with warning log messages when there are overruns. Add task-linked-context error propagation using a "nursery raising" technique such that if either end of context linked pair of tasks errors, that error can be relayed to other side and raised as a form of interrupt at the receiving task's next `trio` checkpoint. This enables reliable error relay without expecting the (error) receiving task to call an API which would raise the remote exception (which it might never currently if using `tractor.MsgStream` APIs). Further internal implementation details: - define the default msg buffer size as `Actor.msg_buffer_size` - expose a `msg_buffer_size: int` kwarg from `Actor.get_context()` - maybe raise aforementioned context errors using `Context._maybe_error_from_remote_msg()` inside `Actor._push_result()` - support optional backpressure on a stream when pushing messages in `Actor._push_result()` - in `_invote()` handle multierrors raised from a `@tractor.context` entrypoint as being potentially caused by a relayed error from the remote caller task, if `Context._error` has been set then raise that error inside the `RemoteActorError` that will be relayed back to that caller more or less proxying through the source side error back to its origin.	2021-12-05 19:31:41 -05:00
Tyler Goodlet	2680a9473d	Always set `Context._portal` on the caller task side	2021-12-05 19:28:00 -05:00
Tyler Goodlet	92b540d518	Add internal msg stream backpressure controls In preparation for supporting both backpressure detection (through an optional error) as well as control over the msg channel buffer size, add internal configuration flags for both to contexts. Also adjust `Context._err_on_from_remote_msg()` -> `._maybe..` such that it can be called and will only raise if a scope nursery has been set. Add a `Context._error` for stashing the remote task's error that may be delivered in an `'error'` message.	2021-12-05 19:19:53 -05:00
Tyler Goodlet	6751349987	Add a stream overrun exception	2021-12-05 18:28:02 -05:00
Tyler Goodlet	d307eab118	Rework `Actor.send_cmd()` to `.start_remote_task()` This more formally declares the runtime's remote task startingn API and uses it throughout all the dependent `Portal` API methods. Allows dropping `Portal._submit()` and simplifying `.run_in_actor()` style result waiting to be delegated to the context APIs at remote task `return` response time. We now also track the remote entrypoint "type` as `Context._remote_func_type`.	2021-12-04 18:20:43 -05:00
Tyler Goodlet	c5c3f7e789	Use `tractor.Context` throughout the runtime core Instead of tracking feeder mem chans per RPC dialog, store `Context` instances which (now) hold refs to the underlying RPC-task feeder chans and track them inside a `Actor._contexts` map. This begins a transition to making the "context" idea the primitive abstraction for representing messaging dialogs between tasks in different memory domains (i.e. usually separate processes). A slew of changes made this possible: - change `Actor.get_memchans()` -> `.get_context()`. - Add new `Context._send_chan` and `._recv_chan` vars. - implicitly create a new context on every `Actor.send_cmd()` call. - use the context created by `.send_cmd()` in `Portal.open_context()` instead of manually creating one. - call `Actor.get_context()` inside tasks run from `._invoke()` such that feeder chans are implicitly created for callee tasks thus fixing the bug #265. NB: We might change some of the internal semantics to do with when the feeder chans are actually created to denote whether or not a far end task is actually read to receive messages. For example, in the cases where it never will be ready to receive messages (one-way streaming, a context that never opens a stream, etc.) we will likely want some kind of error or at least warning to the caller that messages can't be sent (yet).	2021-12-03 14:49:55 -05:00
Tyler Goodlet	f4793af2b9	Error on mal-use of `Context.started()` Previously we were ignoring a race where the callee an opened task context could enter `Context.open_stream()` before calling `.started(). Disallow this as well as calling `.started()` more then once.	2021-12-03 10:08:55 -05:00
Tyler Goodlet	08e9593306	Suppress broken resources errors in `Portal.cancel_actor()`	2021-12-02 15:29:04 -05:00
Tyler Goodlet	14f84571fb	Don't cancel receive streams inside `.cancel_actor()` We don't need to any more presuming you get ideal remote cancellation conditions where the remote actor should teardown and kill the streams from its end.	2021-12-02 15:29:04 -05:00
Tyler Goodlet	e561a4908f	Appease mypy	2021-12-02 15:29:04 -05:00
Tyler Goodlet	46070f99de	Factor soft-wait logic into a helper, use with mp	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d81eb1a51e	Finally, deterministic remote cancellation support On msg loop termination we now check and see if a channel is associated with a child-actor registered in some local task's nursery. If so, we attempt to wait on channel closure initiated from the child side (by draining the underlying msg stream) so as to avoid closing it too early resulting in the child not relaying its termination status response. This means we now support the ideal case in 2-general's where we get back the ack to the closure request instead of just ignoring it and timing out XD The main implementation detail is that when `Portal.cancel_actor()` remotely calls `Actor.cancel()` we actually wait for the RPC response from that request before allowing the channel shutdown sequence to engage. The new msg stream draining support enables this. Also, factor child-to-parent error propagation logic into a helper func and improve some docs (yeah yeah y'all don't like the ''', i don't care - it makes my eyes not hurt).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d817f1a658	Add a nursery "exited" signal Use a `trio.Event` to enable nursery closure detection such that core runtime tasks can be notified when a local nursery exits and allow shutdown protocols to operate without close-before-terminate issues (such as IPC channel closure during remote peer cancellation).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	a23afb0bb8	Set channel cancel called flag on cancel requests	2021-12-02 08:18:04 -05:00
Tyler Goodlet	1976e61d1a	Add `.drain()` support to msg streams Enables "draining" the last set of messages after a channel/stream has been terminated mostly for the purposes of receiving a final ACK to a remote cancel command. Also, add an internal `Channel._cancel_called` flag which can be set by `Portal.cancel_actor()`.	2021-12-02 08:18:04 -05:00
Tyler Goodlet	0ac3397dbb	Only soft-acquire debug lock if a proc was spawned	2021-12-02 08:17:03 -05:00
Tyler Goodlet	62b2867e07	Tweak doc strings	2021-12-02 08:16:49 -05:00
Tyler Goodlet	bf6958cdbe	Handle cancelled-before-proc-created spawn case It's definitely possible to have a nursery spawn task be cancelled before a `trio.Process` handle is ever returned; we now handle this case as a cancelled-during-spawn scenario. Zombie collection logic also is bypassed in this case.	2021-12-02 08:16:05 -05:00
Tyler Goodlet	77fc705b1f	Add nooz	2021-11-29 22:52:19 -05:00
Tyler Goodlet	7eb465a699	Graceful cancel actors before hard reaping	2021-11-29 16:03:23 -05:00
Tyler Goodlet	f6de7e0afd	Factor out msg unwrapping into a func	2021-11-29 08:46:35 -05:00
Tyler Goodlet	0e7234aa68	Cache the return message instead of the value Thanks to @richardsheridan for pointing out the limitations of using any kind of value as the result-cached-flag and how it might cause problems for anyone returning pickled blob-data. This changes the `Portal` internal result value tracking to stash the full message from which the value can be retrieved by any `Portal.result()` caller. The internal change is that `Portal._return_once()` now returns a tuple of the message and its value.	2021-11-29 07:44:44 -05:00
Tyler Goodlet	095c94b1d2	Fix `Portal.run_in_actor()` returns `None` bug Fixes the issue where if the main remote task returns `None`, `Portal.result()` would erroneously wait again on the underlying feeder mem chan since `None` was being used as the cache flag. Instead set the flag as the channel uid and consider the result collected when set to anything else (since it would be odd to return that value from a remote task when you already can read it as part of portal/channel apis).	2021-11-20 13:02:08 -05:00
Tyler Goodlet	6b0366fe04	Guard against TCP server never started on cancel	2021-11-07 23:49:32 -05:00
Tyler Goodlet	dbe5d96d66	Fix missing yield in lock acquirer	2021-11-07 23:48:05 -05:00
Tyler Goodlet	74f460eba7	Make auto generated child names <parent_name>.<name>	2021-11-02 15:40:15 -04:00
Tyler Goodlet	083b73ad4a	Test: don't grab debug lock if not in mode	2021-10-25 10:22:41 -04:00
Tyler Goodlet	d0f5c7a5e2	Change to `gather_contexts()`, use event for graceful exit The api we've made here is actually closer to `asyncio.gather()` but with opening async context managers instead of funcs. Use another event to allow for graceful teardown of children on non-cancellation exits and add a doc string.	2021-10-24 14:00:01 -04:00
overclockworked64	50400359b8	Fix type annotations	2021-10-24 00:47:26 +02:00
overclockworked64	87e3d32992	Get rid of external teardown trigger because #245 resolves the problem	2021-10-23 16:17:30 -04:00
overclockworked64	04895b9d5e	Get rid of dumb random uid and use current actor's uid	2021-10-23 16:17:30 -04:00
overclockworked64	b7a4641674	Allow specifying start_method and hard_kill	2021-10-23 16:17:30 -04:00
overclockworked64	3130a04c61	Rename a variable and fix type annotations	2021-10-23 16:17:29 -04:00
overclockworked64	6f9229cd09	Cancel nursery	2021-10-23 16:17:29 -04:00
overclockworked64	6e6baf250b	Make sure the ID is a str	2021-10-23 16:17:29 -04:00
overclockworked64	73cbb2388a	Avoid RuntimeError by not using current_actor's uid	2021-10-23 16:17:29 -04:00
overclockworked64	2815f1c343	Make 'async_enter_all' take a teardown trigger which '_enter_and_wait' will wait on	2021-10-23 16:17:29 -04:00
overclockworked64	21afc69ac7	Postpone evaluation of annotations	2021-10-23 16:17:29 -04:00
overclockworked64	7d502cef74	Add 'open_actor_cluster' to __all__	2021-10-23 16:17:29 -04:00
Tyler Goodlet	c372367cc2	Fix *args-like type annot	2021-10-23 15:54:40 -04:00
Tyler Goodlet	9ddd75733c	Lul, fix everything for cluster helper	2021-10-23 15:54:40 -04:00
Tyler Goodlet	97006c904c	Expose `Lagged` for broadcasting	2021-10-23 15:54:40 -04:00
Tyler Goodlet	79fb1d0ebc	Fix top level nursery import	2021-10-23 15:54:40 -04:00
Tyler Goodlet	1e917fdb1d	Add an async actor cluster spawner prototype	2021-10-23 15:54:40 -04:00
Tyler Goodlet	4114eb1d25	Move broadcast channel parts into trionics	2021-10-23 15:54:40 -04:00
Tyler Goodlet	680a841282	Start `trionics` sub-pkg with `async_enter_all()` Since it seems we're building out more and more higher level primitives in order to support certain parallel style actor trees and messaging patterns (eg. task broadcast channels), we might as well start a new sub-package for purely `trio` constructions. We hereby dub this the realm of `trionics` (like electronics but for trios instead of electrons). To kick things off, add an `async_enter_all()` concurrent exit-stack-like context manager API which will concurrently spawn a sequence of provided async context managers and deliver their ordered results but with proper support for `trio` cancellation semantics. The stdlib's `AsyncExitStack` is not compatible with nurseries not `trio` tasks (which are cancelled) since as task will be suspended on the stack after push and does not ever hit a checkpoint until the stack is closed.	2021-10-23 15:54:40 -04:00
Tyler Goodlet	340ddba4ae	Rename the nursery module to `_supervise`	2021-10-23 15:54:40 -04:00
Tyler Goodlet	b3c4851ffb	Grab lock if cancelled during spawn before hard kill	2021-10-15 18:26:46 -04:00
Tyler Goodlet	5cfac58873	Don't pop a child entry that was never inserted	2021-10-15 18:16:58 -04:00
Tyler Goodlet	e4ed0fd2b3	Right, only worry about pdb lock when in debug mode	2021-10-15 09:29:25 -04:00
Tyler Goodlet	51259c4809	Pass uid not actor object	2021-10-14 13:46:27 -04:00
Tyler Goodlet	9d83ef82b2	Remove union type for root getter	2021-10-14 13:39:46 -04:00
Tyler Goodlet	fa317d1600	Change lock helper to take an actor uid tuple	2021-10-14 13:39:46 -04:00
Tyler Goodlet	6f5c35dd1b	Fix missing task status type	2021-10-14 13:39:46 -04:00
Tyler Goodlet	daa28ea0e9	Handle depth > 1 nursery owners which use debug mode	2021-10-14 13:39:46 -04:00
Tyler Goodlet	4b2710b8a5	Add tty lock acquire ctx mngr	2021-10-14 13:39:46 -04:00
Tyler Goodlet	d30ce96740	Breakout `wait_for_parent_stdin_hijack()`, increase root pdb checker poll time	2021-10-14 13:39:46 -04:00
Tyler Goodlet	f3a6ab62af	Use debugger helper in nursery and spawn tasks	2021-10-14 13:39:46 -04:00
Tyler Goodlet	62035078ce	Reduce some loglevels, stick in comment about blocking till next tick	2021-10-14 13:39:46 -04:00
Tyler Goodlet	893bad72d5	Add a maybe-open-debugger helper	2021-10-14 13:39:46 -04:00
Tyler Goodlet	77ec29008d	Simplify to soft and hard reap sequences This is actually surprisingly easy to grok having gone through a lot of pain understanding edge cases in the zombie lord dev branch. Basically we just need to make sure actors are managed in a 2 step reap sequence. In the "soft" reap phase we wait for the process to terminate on its own concurrently with (maybe) waiting for its portal's final result (if it's a `.run_in_actor()`). If this path is cancelled or errors, then we do a "hard" reap where we timeout and send a signal to the proc to terminate immediately. The only last remaining trick is to tie in the root-is-debugger-aware logic to yet again avoid tty clobbers.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	46ff558556	Unwind process opening and shield hard reap	2021-10-14 13:39:46 -04:00
Tyler Goodlet	bb9d9c74b1	Do immediate remote task cancels As for `Actor.cancel()` requests, do the same for `Actor._cancel_task()` but use `_invoke()` to ensure correct msg transactions with caller. Don't cancel task cancels on a cancel-all-tasks operation in attempt at more determinism.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	41f0992445	Don't whine about ; it ain't rpc	2021-10-14 13:39:46 -04:00
Tyler Goodlet	7643bbf183	Make actor runtime cancellation immediate	2021-10-14 13:39:46 -04:00
Tyler Goodlet	1f0cc15675	Just set flag for use-after-closed service nursery calls	2021-10-06 17:02:13 -04:00
Tyler Goodlet	10f66e5141	De-noise warnings, add a 'cancel' log level Now that we're on our way to a (somewhat) serious beta release I think it's about time to start de-noising the logging emissions. Since we're trying out this approach of "stack layer oriented" log levels, I figured this is a good time to move most of the "warnings" to what they should be: cancellation monitoring status messages. The level is set to 16 which is just above our "runtime" level but just below the traditional "info" level. I think this will be a decent approach since usually if you're confused about why your `tractor` app is behaving unlike you expect, it's 90% of the time going to be to do with cancellation or error propagation. This this setup a user can specify the 'cancel' level and see all the msgs pertaining to both actor and task-in-actor cancellation mechanics.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	4d5a5c147a	Move core actor runtime logging to, well, "runtime"	2021-10-06 17:02:13 -04:00
Tyler Goodlet	d2f0843041	Make custom log levels report the right stack frame The stdlib's `logging.LoggingAdapter` doesn't currently pass through `stacklevel: int` down to its wrapped logger instance. Hack it here and get our msgs looking like they would if using a built-in level.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	3f6d4d6af4	Don't log.error if it was intentional	2021-10-06 17:02:13 -04:00
Tyler Goodlet	b496e790fe	Use from `.from_stream()` in TCP handler	2021-10-06 15:54:27 -04:00
Tyler Goodlet	c6dc96b08c	Add "message transport" structured sub-typing In an effort to have some kind of more formal interface around the transport layer, add a `MsgTransport` protocol type and use with the channel composition of message streams. Start a little "key map" of `(<codec>, <protocol>)` to `MsgTransport` types which can be dynamically loaded. Add a `Channel.from_stream()` constructor thus cleaning up the mangled logic that was in the constructor based on inputs. Drop all the "auto reconnect" channel logic for now since nothing is using it (internally) and it's likely it will need rework once we bring in a protocol besides TCP.	2021-10-06 15:54:27 -04:00
Tyler Goodlet	135459ca25	Tolerate one decode error; may have been a registry ping	2021-10-05 13:37:17 -04:00
Tyler Goodlet	07e8821cd5	Add a stream type factory	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1382ad653d	Ugh, appease mypy yet again	2021-10-05 13:37:17 -04:00
Tyler Goodlet	076f37c589	Attempt to gracefully handle channel breakage?	2021-10-05 13:37:17 -04:00
Tyler Goodlet	19d6885243	Ensure tuple for passed in arbiter addr	2021-10-05 13:37:17 -04:00
Tyler Goodlet	486e983964	Cast `defaultdict` to `dict` for registry get	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1ab495a64d	Map broken stream errs to transport closed; msgspec seems to be racy	2021-10-05 13:37:17 -04:00
Tyler Goodlet	562419c907	Convert actor UIDs to hashable tuples `msgspec` sends python lists over the wire (https://github.com/jcrist/msgspec/issues/30) which is fine and dandy but we use them as lookup keys so we need to be sure we tuple-cast first.	2021-10-05 13:37:17 -04:00
Tyler Goodlet	3facfb6d4c	Fix log levels	2021-10-05 13:37:17 -04:00
Tyler Goodlet	aa080543d0	Mypy fixes to enforce uid tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	b64396f708	Pkg `msgpec` as optional dep, load transport type if importable	2021-10-05 13:37:17 -04:00
Tyler Goodlet	96b3f94c72	Accept transport closed error during handshake and msg loop	2021-10-05 13:37:17 -04:00
Tyler Goodlet	ecd8c4bc7e	Drop happy eyeballs inf delay	2021-10-05 13:37:17 -04:00
Tyler Goodlet	112117c1fc	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-10-05 13:37:17 -04:00
Tyler Goodlet	95e35f3d60	Add streaming decode support for `msgspec` Add a `tractor._ipc.MsgspecStream` type which can be swapped in for `msgspec` serialization transparently. A small msg-length-prefix framing is implemented as part of the type and we use `tricycle.BufferedReceieveStream` to handle buffering logic for the underlying transport. Notes: - had to force cast a few more list -> tuple spots due to no native `tuple`decode-by-default in `msgspec`: https://github.com/jcrist/msgspec/issues/30 - the framing can be understood by this protobuf walkthrough: https://eli.thegreenplace.net/2011/08/02/length-prefix-framing-for-protocol-buffers - `tricycle` becomes a new dependency	2021-10-05 13:37:17 -04:00
Tyler Goodlet	e39ee3a9cc	Always cast arbiter addr to tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	dda0b22870	Try out `msgspec` in our msgpack stream channel Can only really use an encoder currently since there is no streaming api in `msgspec` as of currently. See jcrist/msgspec#27. Not sure if any encoding speedups are currently noticeable especially without any validation going on yet XD. First experiments toward #196	2021-10-05 13:37:17 -04:00
Tyler Goodlet	4079f02acf	Cast to tuples for all uids explicitly	2021-10-05 13:37:17 -04:00
Tyler Goodlet	518a0d5e14	Add todo for log msg filename..	2021-10-04 10:38:44 -04:00
Tyler Goodlet	8b416e6bba	Stream and context api tweaks - drop `shield` input to `MsgStream` - check for cancel called prior to loading the feeder mem chan in `Context.open_stream()` - warn on a timeout when trying to cancel a remote task from `Context.cancel()` - drop noop endofchannel handler block	2021-10-04 10:38:44 -04:00
Tyler Goodlet	bd31f47d5f	Handle kbi in ctx blocks via `BaseException` Fixes prior committed tests by more generally handling `BaseExcepion` in context blocks. Left in the commented concrete list for reference.	2021-10-04 10:38:44 -04:00
Tyler Goodlet	4259738864	Flip to using the `trio` spawner on windows Was able to try it manually on a windows 10 system and the debugger works great!	2021-09-18 14:10:32 -04:00
Tyler Goodlet	1137a9e7ac	Fix 404ed tokio urls	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2745a2b1dc	Solve first-recv-cancelled by recursive `.receive()` on wake	2021-09-02 21:12:54 -04:00
Tyler Goodlet	d9bb52fe7b	Store array `maxlen` in state singleton The `collections.deque` takes care of array length truncation of values for us implicitly but in the future we'll likely want this value exposed to alternate array implementations. This patch is to provide for that as well as make `mypy` happy since the `dequeu.maxlen` can also be `None`.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	9258f79510	Don't wake sibling bcast consumers on a cancelled call	2021-09-02 21:12:54 -04:00
Tyler Goodlet	44ef26bb18	Shorten default feeder mem chan size to 64	2021-09-02 21:12:54 -04:00
Tyler Goodlet	7857a9ac6d	Add `shield: bool` kwarg to `Portal.open_stream_from()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	63ec740e27	Add some bcaster ref sanity asserts around subscriptions	2021-09-02 21:12:54 -04:00
Tyler Goodlet	093e7d921c	Instance ids are ints	2021-09-02 21:12:54 -04:00
Tyler Goodlet	bec3f5999d	Drop uuid4 keys, raise closed error on subscription after close	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a4cb0ef21f	Fix `.receive()` re-assignment, drop `.clone()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	346b5d2eda	Blade runner it Get rid of all the (requirements for) clones of the underlying receivable. We can just use a uuid generated key for each instance (thinking now this can probably just be `id(self)`). I'm fully convinced now that channel cloning is only a source of confusion and anti-patterns when we already have nurseries to define resource lifetimes. There is no benefit in particular when you allocate subscriptions using a context manager (not sure why `trio.open_memory_channel()` doesn't enforce this). Further refinements: - add a `._closed` state that will error the receiver on reuse - drop module script section; it's been moved to a real test - call the "receiver" duck-type stub a new name	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6c17c7367a	Store handle to underlying channel's `.receive()` This allows for wrapping an existing stream by re-assigning its receive method to the allocated broadcaster's `.receive()` so as to avoid expecting any original consumer(s) of the stream to now know about the broadcaster; this instead mutates the stream to delegate to the new receive call behind the scenes any time `.subscribe()` is called. Add a `typing.Protocol` for so called "cloneable channels" until we decide/figure out a better keying system for each subscription and mask all undesired typing failures.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2d1c24112b	Add subscription support to message streams Add `ReceiveMsgStream.subscribe()` which allows allocating a broadcast receiver around the stream for use by multiple actor-local consumer tasks. Entering this context manager idempotently mutates the stream's receive machinery which for now can not be undone. Move `.clone()` to the receive stream type. Resolves #204	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a12b1fc631	Drop optimization check, binance made its point	2021-09-02 21:12:54 -04:00
Tyler Goodlet	ceed96aa3f	Add common state delegate type for all consumers For every set of broadcast receivers which pull from the same producer, we need a singleton state for all of, - subscriptions - the sender ready event - the queue Add a `BroadcastState` dataclass for this and pass it to all subscriptions. This makes the design much more like the built-in memory channels which do something very similar with `MemoryChannelState`. Use a `filter()` on the subs list in the sequence update step, plus some other commented approaches we can try for speed.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6e78bcf898	Facepalm: use single `_subs` per clone set	2021-09-02 21:12:54 -04:00
Tyler Goodlet	4ad75a3287	Obviously keying on tasks isn't going to work Using the current task as a subscription key fails horribly as soon as you hand off new subscription receiver to another task you've spawned.. Instead use the underlying ``trio.abc.ReceiveChannel.clone()`` as a key (so i guess we're assuming cloning is supported by the underlying?) which makes this all work just like default mem chans. As a bonus, now we can just close the underlying rx (which may be a clone) on `.aclose()` and everything should just work in terms of the underlying channels lifetime (i think?). Change `.subscribe()` to be async since the receive channel type interface only expects `.aclose()` and it actually ends up being nicer for 3.9+ style `async with` parentheses style anyway.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	64358f6525	Rename to broadcast mod, don't expect mem chan specifically	2021-09-02 21:12:54 -04:00
Tyler Goodlet	1af7dbb732	`Task` is hashable, so key on it	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6a2c3da1bb	Simplify api around receive channel Buncha improvements: - pass in the queue via constructor - tracking over all underlying memory channel closure using cloning - do it like `tokio` and set lagged consumers to the last sequence before raising - copy the subs on first receiver wakeup for iteration instead of iterating the table directly (and being forced to skip the current tasks sequence increment) - implement `.aclose()` to close the underlying clone for this task - make `broadcast_receiver()` just take the recv chan since it doesn't need anything on the send side.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3817b4fb5e	Ultra naive broadcast channel prototype	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3208b67f57	Drop shielding on root lock acquire; seems to prevent hangs	2021-09-02 16:23:38 -04:00
Tyler Goodlet	61d2307e52	Unlock pdb tty on all possible net faults	2021-09-02 16:23:38 -04:00
Tyler Goodlet	79f0d6fda0	Attempt to avoid pdb lockups on channel breakage Always try to release the root tty lock on broken connection errors.	2021-09-02 16:23:10 -04:00
Tyler Goodlet	4f166500d0	Add return type to debugger factory	2021-09-02 16:22:59 -04:00
Tyler Goodlet	d906c81f14	Export portal type at top level	2021-09-02 16:22:59 -04:00
Tyler Goodlet	68d56d5df0	Try not masking SIGINT in child processes	2021-09-02 16:22:59 -04:00
Tyler Goodlet	497fa72c96	Add a SIGINT handler that kills the process tree We're not actually using this but it's for reference if we do end up needing it. The std lib's `pdb` internals override SIGINT handling whenever one enters the debugger repl. Force a handler that kills the tree if SIGINT is triggered from the root actor, otherwise ignore it since supervised children should be managed already. This resolves an issue with guest mode where `pdb` causes SIGINTs to be swallowed resulting in the host loop never terminating the process tree.	2021-09-02 16:22:02 -04:00
Tyler Goodlet	b4d95e9543	Update docs to new close semantics	2021-09-02 08:24:18 -04:00
Tyler Goodlet	af85d35685	Drop stream shielding; it was from a legacy design The whole origin was not having an explicit open/close semantic for streams. We have that now so this internal mechanic isn't needed and further our streams become more correct by having `.aclose()` be independent of cancellation.	2021-09-02 08:24:18 -04:00
Tyler Goodlet	7431e8ea01	Don't log cancelled inceptions seen by the root	2021-08-02 21:15:42 -04:00
Tyler Goodlet	82999801a6	Drop leftover noisy exception logging..	2021-08-02 16:56:00 -04:00
Tyler Goodlet	674fbbc6b3	Docs and comments tidying	2021-08-01 10:44:13 -04:00
Tyler Goodlet	6006adc0de	Hide `_invoke()` tb, move actor error to exceptions mod	2021-07-31 13:56:26 -04:00
Tyler Goodlet	0afa7f0f8e	Fix lock context manager return type	2021-07-31 12:50:58 -04:00
Tyler Goodlet	b3d28a1ee4	Drop debugger path and duplicate func from rebasing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	09f00a5a00	Go back to only logging tbs on no debugger	2021-07-31 12:46:40 -04:00
Tyler Goodlet	44bfacc0c2	Comment hard-kill-sidestep for now since nursery version covers it?	2021-07-31 12:46:40 -04:00
Tyler Goodlet	551816e80d	Solve the root-cancels-child-in-tty-lock race Finally this makes a cancelled root actor nursery not clobber child tasks which request and lock the root's tty for the debugger repl. Using an edge triggered event which is set after all fifo-lock-queued tasks are complete, we can be sure that no lingering child tasks are going to get interrupted during pdb use and tty lock acquisition. Further, even if new tasks do queue up to get the lock, the root will incrementally send cancel msgs to each sub-actor only once the tty is not locked by a (set of) child request task(s). Add shielding around all the critical sections where the child attempts to allocate the lock from the root such that it won't be disrupted from cancel messages from the root after the acquire lock transaction has started.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	be1fcb2a5b	Distinguish between a local pdb unlock and the tty unlock in root	2021-07-31 12:46:40 -04:00
Tyler Goodlet	ef89ed947a	Fix hard kill in debug mode; only do it when debug lock is empty	2021-07-31 12:46:40 -04:00
Tyler Goodlet	5b3894827f	Move some infos to runtime level	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0fdcfa0ba1	Move debugger wait inside OCA nursery	2021-07-31 12:46:40 -04:00
Tyler Goodlet	37a1897c47	Don't shield debugger status wait; it causes hangs	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0f2a39a311	Catch and delay errors in the root if debugger is active	2021-07-31 12:46:40 -04:00
Tyler Goodlet	23a1622256	Don't kill root's immediate children when in debug If the root calls `trio.Process.kill()` on immediate child proc teardown when the child is using pdb, we can get stdstreams clobbering that results in a pdb++ repl where the user can't see what's been typed. Not killing such children on cancellation / error seems to resolve this issue whilst still giving reliable termination. For now, code that special path until a time it becomes a problem for ensuring zombie reaps.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	49d439b681	Add some brief todo notes on idea of shielded breakpoint	2021-07-31 12:46:40 -04:00
Tyler Goodlet	6f05f5d5e6	Wait for debugger lock task context termination	2021-07-31 12:46:40 -04:00
Tyler Goodlet	b369b91357	Fix up var naming and typing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	969bce3aa4	Use context for remote debugger locking A context is the natural fit (vs. a receive stream) for locking the root proc's tty usage via it's `.started()` sync point. Simplify the `_breakpoin()` routine to be a simple async func instead of all this "returning a coroutine" stuff from before we decided that `tractor.breakpoint()` must be async. Use `runtime` level for locking logging making it easier to trace.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	443ebea165	Use "pdb" level logging in debug mode	2021-07-08 13:02:33 -04:00
Tyler Goodlet	25779d48a8	Define explicit adapter level methods for mypy	2021-07-08 12:51:35 -04:00
Tyler Goodlet	fde52d2464	Mypy fixes	2021-07-08 12:48:34 -04:00
Tyler Goodlet	8c927d708d	Change trace to transport level	2021-07-07 14:31:15 -04:00
Tyler Goodlet	31590e82a3	Flip "trace" level to "transport" level logging	2021-07-07 14:31:03 -04:00
Tyler Goodlet	2513c652c1	Go back to only logging crashes if no pdb gets engaged	2021-07-06 08:23:30 -04:00
Tyler Goodlet	9ddb654783	Avoid mutate on iterate race	2021-07-06 08:23:30 -04:00
Tyler Goodlet	7f86d63e77	Drop trip kwarg	2021-07-06 08:23:30 -04:00
Tyler Goodlet	12f987514d	Don't enter debug on closed resource errors	2021-07-06 08:23:30 -04:00
Tyler Goodlet	98bbf8e0df	Move join event trigger to direct exit path	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b1cd7fdedf	Don't shield on root cancel it can causes hangs	2021-07-06 08:23:30 -04:00
Tyler Goodlet	ef725c5972	Always hard kill sub-procs on teardown Adds a new hard kill routine for the `trio` spawning backend.	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b21e2a6caa	Add pre-stream open error conditions	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c6cdaf9c31	De-densify some code	2021-07-06 08:23:30 -04:00
Tyler Goodlet	91640facbc	Always shield cancel the caller on cancel-causing-errors, add teardown logging	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c2484e88a1	First try: pack cancelled tracebacks and ship to caller	2021-07-06 08:23:30 -04:00
Tyler Goodlet	3423ea4011	Add temp warning msg for context cancel call	2021-07-06 08:23:29 -04:00
Tyler Goodlet	af701c16ee	Consider relaying context error via raised-in-scope-nursery task	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1703171bea	Set stream "end of channel" after shielded check! Another face palm that was causing serious issues for code that is using the `.shielded` feature.. Add a bunch more detailed comments for all this subtlety and hopefully get it right once and for all. Also aggregated the `trio` errors that should trigger closure inside `.aclose()`, hopefully that's right too.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	3d633408fc	Don't clobber msg loop mem chan on rx stream close Revert this change since it really is poking at internals and doesn't make a lot of sense. If the context is going to be cancelled then the msg loop will tear down the feed memory channel when ready, we don't need to be clobbering it and confusing the runtime machinery lol.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	196dea80db	Drop trailing comma	2021-07-06 08:23:29 -04:00
Tyler Goodlet	54916be601	Adjustments for non-frozen context dataclass change	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1a69727b75	Fix exception typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	348148ff1e	Explicitly formalize context/streaming teardown Add clear teardown semantics for `Context` such that the remote side cancellation propagation happens only on error or if client code explicitly requests it (either by exit flag to `Portal.open_context()` or by manually calling `Context.cancel()`). Add `Context.result()` to wait on and capture the final result from a remote context function; any lingering msg sequence will be consumed/discarded. Changes in order to make this possible: - pass the runtime msg loop's feeder receive channel in to the context on the calling (portal opening) side such that a final 'return' msg can be waited upon using `Context.result()` which delivers the final return value from the callee side `@tractor.context` async function. - always await a final result from the target context function in `Portal.open_context()`'s `__aexit__()` if the context has not been (requested to be) cancelled by client code on block exit. - add an internal `Context._cancel_called` for context "cancel requested" tracking (much like `trio`'s cancel scope). - allow flagging a stream as terminated using an internal `._eoc` flag which will mark the stream as stopped for iteration. - drop `StopAsyncIteration` catching in `.receive()`; it does nothing.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	73302d9d16	Specially raise a `ContextCancelled` for a task-context rpc	2021-07-06 08:23:29 -04:00
Tyler Goodlet	409f7f0d5a	Expose streaming components at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	eb3662f981	Add a specially handled `ContextCancelled` error	2021-07-06 08:23:29 -04:00
Tyler Goodlet	39b9896a62	Only close recv chan if we get a ref	2021-07-06 08:23:29 -04:00
Tyler Goodlet	9a4244b9a6	Support no arg to `Context.started()` like trio	2021-07-06 08:23:29 -04:00

... 2 3 4 5 6 ...

658 Commits (callable_key_maybe_open_context)