tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	15047341bd	Ignore forserver override attrs with `mypy`	2022-10-09 16:14:11 -04:00
Tyler Goodlet	e609183242	Expose lifetime stack as class attr, add base test suite	2022-09-15 23:50:15 -04:00
Tyler Goodlet	10eeda2d2b	Use built-ins for all data-structure-type annotations	2022-09-15 23:41:28 -04:00
Tyler Goodlet	ad19bf2cf1	Remove `tractor.run()` once and for all It's been deprecated for a while now and all docs and tests have been changed. Closes #183	2022-09-15 23:41:28 -04:00
Tyler Goodlet	9aef03772a	Expose `Actor` at pkg level, adjust debug type annots	2022-09-15 23:41:28 -04:00
Tyler Goodlet	7548dba8f2	Change to new doc string style	2022-09-15 23:41:28 -04:00
Tyler Goodlet	208d56af2c	Make `async_main()` a module func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	a3a5bc267e	Make `process_messages()` a mod func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	d4084b2032	Rename our core module to `_runtime`	2022-09-15 23:41:28 -04:00
Tyler Goodlet	bafd10a260	Make `maybe_open_context()` re-entrant safe, use per factory locks	2022-09-15 19:02:02 -04:00
Tyler Goodlet	5ad540c417	Add debug complete event `None`-guard for when already reset	2022-09-15 19:02:02 -04:00
Tyler Goodlet	8f1fe2376a	Simplify all hooks to a common `Lock.release()`	2022-08-02 18:14:05 -04:00
Tyler Goodlet	650313dfef	Drop legacy handler blocks factored into `_acquire_debug_lock()`	2022-08-02 12:50:27 -04:00
Tyler Goodlet	e4006da6f4	Drop `pdbpp` bug notes, add follow up issue #320 note	2022-08-02 12:48:40 -04:00
Tyler Goodlet	7f6169a050	Drop legacy commented/todo remote debug helper block	2022-08-02 12:43:14 -04:00
Tyler Goodlet	02c3b9a672	Put `pygments` back to default	2022-08-02 12:17:34 -04:00
Tyler Goodlet	c5c7a9027c	Line len lint and drop rpc log msg level again	2022-08-02 12:17:34 -04:00
Tyler Goodlet	937ed99e39	Factor sigint overriding into lock methods	2022-08-02 12:17:28 -04:00
Tyler Goodlet	91f034a136	Move all module vars into a `Lock` type	2022-08-02 12:17:28 -04:00
Tyler Goodlet	6f01c78122	Disable `pygments` highlighting on ctlc tests	2022-08-02 12:17:28 -04:00
Tyler Goodlet	c0cd99e374	Timeout on arbiter ping, avoid TCP SYN hangs in CI?	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b01daa5319	Factor lock-state release logic into helper The common logic to both remove our custom SIGINT handler as well as signal the actor global event that pdb is complete. Call this whenever we exit a post mortem call and thus any time some rpc task get's debugged inside `._actor._invoke()`. Further, we have to manually print the REPL prompt on 3.9 for some wack reason, so stick a version guard in the sigint handler for that..	2022-08-02 12:17:28 -04:00
Tyler Goodlet	bd362a05f0	Run release hook around `next` repl commands as well	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b21f2e16ad	Always consider the debugger when exiting contexts When in an uncertain teardown state and in debug mode a context can be popped from actor runtime before a child finished debugging (the case when the parent is tearing down but the child hasn't closed/completed its tty lock IPC exit phase) and the child sends the "stop" message to unlock the debugger but it's ignored bc the parent has already dropped the ctx. Instead we call `._debug.maybe_wait_for_deugger()` before these context removals to avoid the root getting stuck thinking the lock was never released. Further, add special `Actor._cancel_task()` handling code inside `_invoke()` which continues to execute the method despite the IPC channel to the caller being broken and thus avoiding potential hangs due to a target (child) actor task remaining alive.	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ba7b355d9c	Add note about default behaviour of `fancycompleter`	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ef8dc0204c	Just drop all longlisting for now and leave comments	2022-08-02 12:17:28 -04:00
Tyler Goodlet	a101971027	Go back to original longlist code	2022-08-02 12:17:28 -04:00
Tyler Goodlet	835836123b	Just don't call longlist on 3.10+ for now	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b9eb601265	General typing fixes for `mypy`	2022-08-02 12:17:27 -04:00
Tyler Goodlet	4dcc21234e	Only call `.poll()` if a method on the spawn backend	2022-08-02 12:17:27 -04:00
Tyler Goodlet	8b9f342eef	Port to new `.lowlevel.open_process()` API	2022-08-02 12:17:27 -04:00
Tyler Goodlet	a90ca4b384	Call longlist normally when on py < 3.10	2022-08-02 12:17:06 -04:00
Tyler Goodlet	d0dcd55f47	Only report disconnected actors if proc is still alive?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	519f4c300b	I dunno, seems like `breakpoint()` needs this?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	ff3f5959e9	Always enable debug level logging if mode enabled	2022-08-02 12:16:58 -04:00
Tyler Goodlet	abb00531d3	Add help msg for non `__main__` modules as well	2022-08-02 12:16:58 -04:00
Tyler Goodlet	18c525d2f1	Hack around double long list print issue.. See https://github.com/pdbpp/pdbpp/issues/496	2022-08-02 12:16:58 -04:00
Tyler Goodlet	e2453fd3da	Add spaces before values in log msg	2022-08-02 12:16:58 -04:00
Tyler Goodlet	b29def8b5d	Add runtime level msg around channel draining	2022-08-02 12:16:58 -04:00
Tyler Goodlet	f07e9dbb2f	Always undo SIGINT overrides, cancel detached children Ensure that even when `pdb` resumption methods are called during a crash where `trio`'s runtime has already terminated (eg. `Event.set()` will raise) we always revert our sigint handler to the original. Further inside the handler if we hit a case where a child is in debug and (thinks it) has the global pdb lock, if it has no IPC connection to a parent, simply presume tty sync-coordination is now lost and cancel the child immediately.	2022-08-02 12:16:49 -04:00
Tyler Goodlet	c7035be2fc	Tolerate double `.remove()`s of stream on portal teardowns	2022-07-27 11:40:02 -04:00
Tyler Goodlet	deaca7d6cc	Always propagate SIGINT when no locking peer found A hopefully significant fix here is to always avoid suppressing a SIGINT when the root actor can not detect an active IPC connections (via a connected channel) to the supposed debug lock holding actor. In that case it is most likely that the actor has either terminated or has lost its connection for debugger control and there is no way the root can verify the lock is in use; thus we choose to allow KBI cancellation. Drop the (by comment) `try`-`finally` block in `_hijoack_stdin_for_child()` around the `_acquire_debug_lock()` call since all that logic should now be handled internal to that locking manager. Try to catch a weird error around the `.do_longlist()` method call that seems to sometimes break on py3.10 and latest `pdbpp`.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	d47d0e7c37	Always call pdb hook even if tty locking fails	2022-07-27 11:40:02 -04:00
Tyler Goodlet	0062c96a3c	Log cancels with appropriate level	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4be13b7387	Just warn on IPC breaks	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7bb5addd4c	Only warn on `trio.BrokenResourceError`s from `_invoke()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	89b44f8163	Pre-declare disconnected flag	2022-07-27 11:40:02 -04:00
Tyler Goodlet	2819b6a5b2	Avoid attr error XD	2022-07-27 11:40:02 -04:00
Tyler Goodlet	f2671ed026	Type annot updates	2022-07-27 11:40:02 -04:00
Tyler Goodlet	41924c86a6	Drop uneeded backframe traceback hide annotation	2022-07-27 11:40:02 -04:00
Tyler Goodlet	206c7c0720	Make `Actor._process_messages()` report disconnects The method now returns a `bool` which flags whether the transport died to the caller and allows for reporting a disconnect in the channel-transport handler task. This is something a user will normally want to know about on the caller side especially after seeing a traceback from the peer (if in tree) on console.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	bf0ac3116c	Only cancel/get-result from a ctx if transport is up There's no point in sending a cancel message to the remote linked task and especially no reason to block waiting on a result from that task if the transport layer is detected to be disconnected. We expect that the transport shouldn't go down at the layer of the message loop (reconnection logic should be handled in the transport layer itself) so if we detect the channel is not connected we don't bother requesting cancels nor waiting on a final result message. Why? - if the connection goes down in error the caller side won't have a way to know "how long" it should block to wait for a cancel ack or result and causes a potential hang that may require an additional ctrl-c from the user especially if using the debugger or if the traceback is not seen on console. - obviously there's no point in waiting for messages when there's no transport to deliver them XD Further, add some more detailed cancel logging detailing the task and actor ids.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	74b819a857	Typing fixes, simplify `_set_trace()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8892204c84	Add notes around py3.10 stdlib bug from `pdb++` There's a bug that's triggered in the stdlib without latest `pdb++` installed; add a note for that. Further inside `wait_for_parent_stdin_hijack()` don't `.started()` until the interactor stream has been opened to avoid races when debugging this `._debug.py` module (at the least) since we usually don't want the spawning (parent) task to resume until we know for sure the tty lock has been acquired. Also, drop the random checkpoint we had inside `_breakpoint()`, not sure it was actually adding anything useful since we're (mostly) carefully shielded throughout this func.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8f4bbf1cbf	Add and use a pdb instance factory	2022-07-27 11:40:02 -04:00
Tyler Goodlet	aea8f63bae	Drop all the `@cm.__exit__()` override attempts.. None of it worked (you still will see `.__exit__()` frames on debugger entry - you'd think this would have been solved by now but, shrug) so instead wrap the debugger entry-point in a `try:` and put the SIGINT handler restoration inside `MultiActorPdb` teardown hooks. This seems to restore the UX as it was prior but with also giving the desired SIGINT override handler behaviour.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7964a9f6f8	Try overriding `_GeneratorContextManager.__exit__()`; didn't work.. Using either of `@pdb.hideframe` or `__tracebackhide__` on stdlib methods doesn't seem to work either.. This all seems to have something to do with async generator usage I think ?	2022-07-27 11:40:02 -04:00
Tyler Goodlet	e5195264a1	Handle a context cancel? Might be a noop	2022-07-27 11:40:02 -04:00
Tyler Goodlet	345573e602	Make `mypy` happy	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4e60c17375	Refine the handler for child vs. root cases This gets very close to avoiding any possible hangs to do with tty locking and SIGINT handling minus a special case that will be detailed below. Summary of implementation changes: - convert `_mk_pdb()` -> `with _open_pdb() as pdb:` which implicitly handles the `bdb.BdbQuit` case such that debugger teardown hooks are always called. - rename the handler to `shield_sigint()` and handle a variety of new cases: * the root is in debug but hasn't been cancelled -> call `Actor.cancel_soon()` * the root is in debug but has been called (`Actor.cancel_soon()` already called) -> raise KBI * a child is in debug and has a task locking the debugger -> ignore SIGINT in child and the root actor. - if the debugger instance is provided to the handler at acquire time, on SIGINT handling completion re-print the last pdb++ REPL output so that the user realizes they are still actively in debug. - ignore the unlock case where a race condition of "no task" holding the lock causes the `RuntimeError` normally associated with the "wrong task" doing so (not sure if this is a `trio` bug?). - change debug logs to runtime level. Unhandled case(s): - a child is maybe in debug mode but does not itself have any task using the debugger. * ToDo: we need a way to decide what to do with "intermediate" child actors who themselves either are not in `debug_mode=True` but have children who are such that a SIGINT won't cause cancellation of that child-as-parent-of-another-child iff any of their children are in in debug mode.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	6b7b58346f	(facepalm) Reraise `BdbQuit` and discard ownerless lock releases	2022-07-27 11:40:02 -04:00
Tyler Goodlet	3cac323421	Add WIP while-debugger-active SIGINT ignore handler	2022-07-27 11:40:02 -04:00
goodboy	4902e184e9	Merge pull request #318 from goodboy/aio_error_propagation Add context test that opens an inter-task-channel that errors	2022-07-15 12:42:19 -04:00
Tyler Goodlet	05790a20c1	Slight lint fixes	2022-07-15 11:18:48 -04:00
Tyler Goodlet	f0d78e1a6e	Use local task ref, fixes `mypy`	2022-07-15 10:39:49 -04:00
Tyler Goodlet	0906559ed9	Drop manual stack construction, fix attr typo	2022-07-14 20:43:17 -04:00
Tyler Goodlet	38d03858d7	Fix `asyncio`-task-sync and error propagation This fixes an previously undetected bug where if an `.open_channel_from()` spawned task errored the error would not be propagated to the `trio` side and instead would fail silently with a console log error. What was most odd is that it only seems easy to trigger when you put a slight task sleep before the error is raised (:eyeroll:). This patch adds a few things to address this and just in general improve iter-task lifetime syncing: - add `LinkedTaskChannel._trio_exited: bool` a flag set from the `trio` side when the channel block exits. - add a `wait_on_aio_task: bool` flag to `translate_aio_errors` which toggles whether to wait the `asyncio` task termination event on exit. - cancel the `asyncio` task if the trio side has ended, when `._trio_exited == True`. - always close the `trio` mem channel when the task exits such that the `asyncio` side can error on any next `.send()` call.	2022-07-14 16:35:41 -04:00
Tyler Goodlet	41983edc43	Use `str` \| `bytes` union for typing msg dump	2022-07-12 11:59:11 -04:00
Tyler Goodlet	5168700fbf	Tolerate non-decode-able bytes	2022-07-12 11:55:55 -04:00
Tyler Goodlet	673c4a8c66	Decode bytes prior to log msg	2022-07-12 11:55:55 -04:00
Tyler Goodlet	932b841176	Allow up to 4 `msgpsec` decode failures	2022-07-12 11:55:55 -04:00
Tyler Goodlet	f594f1bdda	Handle a connection reset on `msgspec` transport	2022-07-12 11:55:55 -04:00
Tyler Goodlet	4e7ab54452	Appease `mypy`	2022-07-12 11:22:30 -04:00
Tyler Goodlet	f94b7cd991	Drop `msgpack` lib and use `msgspec` for transport	2022-07-12 10:37:13 -04:00
Tyler Goodlet	8901272854	Fix typing	2022-04-13 08:20:53 -04:00
Tyler Goodlet	80897a8f2b	Add `tractor.query_actor()` an addr looker-upper Sometimes it's handy to just have a non-`Portal` yielding way to figure out if a "service" actor is up, so add this discovery helper for that. We'll prolly just leave it undocumented for now until we figure out a longer-term/better discovery system.	2022-04-13 07:50:42 -04:00
Tyler Goodlet	f3606d5bd8	Type fixes	2022-04-12 11:48:32 -04:00
Tyler Goodlet	c322a193f2	Make `LinkedTaskChannel` trio-task-broadcastable with `.subscribe()`	2022-04-12 11:42:44 -04:00
Tyler Goodlet	46963c2e63	Don't handle `GeneratorExit` on `asyncio` tasks	2022-04-12 11:42:44 -04:00
Tyler Goodlet	9b77b8c9ee	Add more explicit `asyncio` task error logging When an `asyncio` side task errors or is cancelled we now explicitly report the traceback and task name if possible as well as the source reason for the error (some come from the `trio` side). Further, properly set any `trio` side exception (after unwrapping it from the `outcome.Error`) on the future that runs the `trio` guest run.	2022-04-12 11:42:44 -04:00
Tyler Goodlet	c30cece37a	Fix one missing import/ref	2022-02-17 13:03:37 -05:00
Tyler Goodlet	509082c935	Port to new `msgspec` error type	2022-02-17 11:55:26 -05:00
Tyler Goodlet	75bb1added	Avoid importing mp for as long as possible	2022-02-17 11:55:26 -05:00
Tyler Goodlet	76a0492028	Fix type annot	2022-02-15 08:52:04 -05:00
Tyler Goodlet	4eab4a0213	Type fix	2022-02-15 08:51:25 -05:00
Tyler Goodlet	0edc6a26bc	Go back to strict map keys	2022-02-15 08:48:43 -05:00
Tyler Goodlet	c5acc3b969	Pack tuple keys as . delim strs in registry tests	2022-02-15 08:48:07 -05:00
Tyler Goodlet	17bfa120cc	Port to msgpec `0.4.0` imports	2022-02-14 14:05:55 -05:00
Tyler Goodlet	77ddc073e8	Use lists by default like `msgspec`	2022-02-09 10:07:33 -05:00
Tyler Goodlet	87de28fd88	Slight doc string update	2022-01-30 12:21:41 -05:00
Tyler Goodlet	56b29c27de	Add msg serialization coding todo resources list	2022-01-30 12:19:21 -05:00
Tyler Goodlet	25a27e780d	Add todo resources for eventual capability-based module filtering	2022-01-30 11:28:10 -05:00
Tyler Goodlet	c265f3f94e	Move namespace path type into `msg` mod	2022-01-30 11:27:34 -05:00
Tyler Goodlet	2900ceb003	Not all objects have a `.__name__`	2022-01-30 11:26:34 -05:00
Tyler Goodlet	b6ae77b5ac	Use `pkgutils.resolve_name()` and a `str` subtype Python 3.9's new object resolver + a `str` is much simpler then mucking with tuples (and easier to serialize). Include a `.to_tuple()` formatter since we still are passing the module namespace and function name separately inside the runtime's message format but in theory we might be able to simplify this depending on how we would change the support for `enable_modules:list[str]` in the spawn API. Thanks to @Fuyukai for pointing `resolve_name()` which I didn't know about before!	2022-01-30 11:26:34 -05:00
Tyler Goodlet	949cb2c9fe	First draft "namespace path" named tuple; probably will discard	2022-01-30 11:26:34 -05:00
Tyler Goodlet	7e004c0688	Add back blank `msg.py`	2022-01-29 14:22:15 -05:00
Tyler Goodlet	ffe88de53b	Better idea: start a `tractor.experimental` subpkg	2022-01-29 14:03:55 -05:00
Tyler Goodlet	d29a915d48	Update mod doc string	2022-01-29 14:02:04 -05:00
Tyler Goodlet	be87caa99b	Move legacy pubsub stuff from `msg.py` to trionics mod	2022-01-29 14:02:04 -05:00
Tyler Goodlet	9650055519	Use `.exitcode` which is poll + error handling	2022-01-21 12:49:26 -05:00
Tyler Goodlet	532974fb90	Drop leftover print	2022-01-21 12:49:26 -05:00
Tyler Goodlet	b1d72b77c9	Patch mp procs with a `.poll()` Not sure why they don't already expose this from the `Popen` backends but, k.	2022-01-21 12:49:26 -05:00
Tyler Goodlet	a2171c7e71	Cancel the `.cancel_actor()` request on proc death Adjust the `soft_wait()` strategy to avoid sending needless cancel requests if it is known that a child process is already terminated or does so before the cancel request times out. This should be no slower and should avoid needless waits on either closure-in-progress or already closed channels. Basic strategy is, - request child actor to cancel - if process termination is detected, cancel the cancel - if the process is still alive after a cancel request timeout warn the user and yield back to the hard reap handling	2022-01-21 12:49:26 -05:00
Tyler Goodlet	9b4cdb00e6	Add agpl header	2021-12-17 09:39:30 -05:00
Tyler Goodlet	24078f2d6e	More doc string style tweaks	2021-12-17 09:38:04 -05:00
Tyler Goodlet	56cc98375e	Return channel type from `_run_asyncio_task()` Better encapsulate all the mem-chan, Queue, sync-primitives inside our linked task channel in order to avoid `mypy`'s complaints about monkey patching. This also sets footing for adding an `asyncio`-side channel API that can be used more like this `trio`-side API.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	b69412a903	Drop cancel scope from linked task channel	2021-12-17 09:38:04 -05:00
Tyler Goodlet	6803891bd7	Collect `asyncio` task exceptions to avoid warning msg	2021-12-17 09:38:04 -05:00
Tyler Goodlet	5f4094691d	Re-wrap and raise `asyncio.CancelledError` For whatever reason `trio` seems to be swallowing this exception when raised in the `trio` task so instead wrap it in our own non-base exception type: `AsyncioCancelled` and raise that when the `asyncio` task cancels itself internally using `raise <err> from <src_err>` style. Further don't bother cancelling the `trio` task (via cancel scope) since we we can just use the recv mem chan closure error as a signal and explicitly lookup any set asyncio error.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	c48c68c0bc	Flip doc strings to my preferred format	2021-12-17 09:38:04 -05:00
Tyler Goodlet	44d0e9fc32	Add a `LinkedTaskChannel` for synced inter-loop-streaming Wraps the pairs of underlying `trio` mem chans and the `asyncio.Queue` with this new composite which will be delivered from `open_channel_from()`. This allows for both sending and receiving values from the `asyncio` task (2 way msg passing) as well controls for cancelling or waiting on the task. Factor `asyncio` translation and re-raising logic into a new closure which is run on both `trio` side error handling as well as on normal termination to avoid missing `asyncio` errors even when `trio` task cancellation is handled first. Only close the `trio` mem chans on `trio` task termination iff the task was spawned using `open_channel_from()`: - on `open_channel_from()` exit, mem chan closure is the desired semantic - on `run_task()` we normally only return a single value or error and if the channel is closed before the error is raised we may propagate a `trio.EndOfChannel` instead of the desired underlying `asyncio` task's error	2021-12-17 09:38:04 -05:00
Tyler Goodlet	9bc94b5ccc	Factor error translation into a ctx mngr Pull the common `asyncio` -> `trio` error translation logic into a common context manager and don't expect a final result to be captured when using `open_channel_from()` since it's a manager interface and it would be clunky to try and deliver some "final result" after exit.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	e6687bcdc4	Serious-ify doc string	2021-12-17 09:38:04 -05:00
Tyler Goodlet	8704664719	Reverse the order for asyncio cancelleds? I dunno why	2021-12-17 09:38:04 -05:00
Tyler Goodlet	1114b6980e	Adjust linked-loop-task tear down sequence Close the mem chan before cancelling the `trio` task in order to ensure we retrieve whatever error is shuttled from `asyncio` before the channel read is potentially cancelled (previously a race?). Handle `asyncio.CancelledError` specially such that we raise it directly (instead of `raise aio_cancelled from other_err`) since it is the source error in the case where the cancellation is `asyncio` internal.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	56357242e9	Add a `Portal.cancel_actor()` test	2021-12-17 09:38:04 -05:00
Tyler Goodlet	0ab5e5cadd	Fill out nursery docstring	2021-12-17 09:38:04 -05:00
Tyler Goodlet	06fa650ed0	Drop runtime logging for asyncio mode	2021-12-17 09:38:04 -05:00
Tyler Goodlet	446feff172	Clean type imports	2021-12-17 09:38:04 -05:00
Tyler Goodlet	41eddffc2c	Drop old (and deluded) "streaming" cruft	2021-12-17 09:38:04 -05:00
Tyler Goodlet	7a65165279	Facepalm, re-raise captured `asyncio` task error	2021-12-17 09:38:04 -05:00
Tyler Goodlet	b376b7cd32	First draft: `.to_asyncio.open_channel_from()`	2021-12-17 09:38:04 -05:00
Tyler Goodlet	c262b1a3e8	Always cancel the asyncio task?	2021-12-17 09:38:04 -05:00
Tyler Goodlet	d9dac3f36c	Drop old implementation cruft	2021-12-17 09:38:04 -05:00
Tyler Goodlet	325c0cdb1b	Fix error propagation on asyncio streaming tasks	2021-12-17 09:38:04 -05:00
Tyler Goodlet	55e210fec6	Drop bad .close() call	2021-12-17 09:38:04 -05:00
Tyler Goodlet	aa24bbc11c	Proxy asyncio cancelleds as well	2021-12-17 09:38:04 -05:00
Tyler Goodlet	793bcfb7d4	Pass `infect_asyncio` flag to mp actors as well	2021-12-17 09:38:04 -05:00
Tyler Goodlet	d80f8d7a39	WIP redo asyncio async gen streaming	2021-12-17 09:38:04 -05:00
Tyler Goodlet	340effae11	Add initial infected asyncio error propagation test	2021-12-17 09:38:01 -05:00
Tyler Goodlet	509ae132ec	Raise any asyncio errors if in trio task on cancel	2021-12-17 09:38:01 -05:00
Tyler Goodlet	80f47dece2	Raise from asyncio error; fixes mypy	2021-12-17 09:38:01 -05:00
Tyler Goodlet	2cf87146a3	Log any asyncio error	2021-12-17 09:38:01 -05:00
Tyler Goodlet	8070b16bd0	Support asyncio actors with the trio spawner backend	2021-12-17 09:38:01 -05:00
Tyler Goodlet	1406ddc5ee	Add `infect_asyncio: bool` flag to nursery methods	2021-12-17 09:37:41 -05:00
Tyler Goodlet	055788cf16	Attempt to make mypy happy..	2021-12-17 09:19:23 -05:00
Tyler Goodlet	1825b21d2c	Wow, fix all the broken async func invoking code.. Clearly this wasn't developed against a task that spawned just an async func in `asyncio`.. Fix all that and remove a bunch of unnecessary func layers. Add provisional support for the target receiving the `to_trio` and `from_trio` channels and for the @tractor.stream marker.	2021-12-17 09:19:23 -05:00
Tyler Goodlet	acd63d0c89	First draft "infected `asyncio` mode" This should mostly maintain top level SC principles for any task spawned using `tractor.to_asyncio.run()`. When the `asyncio` task completes make sure to cancel the pertaining `trio` cancel scope and raise any error that may have resulted. This interface uses `trio`'s "guest-mode" to run `asyncio` loop using a special entrypoint which is handed to Python during process spawn.	2021-12-17 09:17:59 -05:00
Tyler Goodlet	98a830ccba	Drop cancel traceback capture; don't seem to need it?	2021-12-16 19:59:10 -05:00
Tyler Goodlet	8c004c1f36	Add an explicit messaging error for reporting an illegal context transaction	2021-12-16 19:59:10 -05:00
Tyler Goodlet	e2139c2bf0	Don't set `Context._error` to expected `ContextCancelled` If the one side of an inter-actor context cancels the other then that side should always expect back a `ContextCancelled` message. However we should not set this error in this case (where the cancel request was sent and a `ContextCancelled` msg was received back) since it may override some other error that caused the cancellation request to be sent out in the first place. As an example when a context opens another context to a peer and some error happens which causes the second peer context to be cancelled but we want to propagate the original error. Fixes the issue found in https://github.com/pikers/piker/issues/244	2021-12-16 19:59:10 -05:00
Tyler Goodlet	5d424e3703	Hide the key error tb on remote starting errors	2021-12-16 19:59:10 -05:00
Tyler Goodlet	da5e36bf0c	Revert back to avoiding key errors on cancellation	2021-12-16 18:02:03 -05:00
Tyler Goodlet	26394dd8df	Type annot fixes	2021-12-16 18:02:03 -05:00
Tyler Goodlet	11e64426f6	Wake all sleeping consumers on bcaster closure	2021-12-16 18:02:03 -05:00
Tyler Goodlet	213447008b	Add draft code for waiting on all nurseries in root	2021-12-16 18:02:03 -05:00
Tyler Goodlet	52627a6326	Rework interface: pass func and kwargs After more extensive testing I realized that keying on the context manager instance id isn't going to work since each entering task is going to create a unique key XD Instead pass the manager function as `acm_func` and optionally allow keying the resource on the passed `kwargs` (if hashable) or the `key:str`. Further, pass the key to the enterer task and avoid a separate keying scheme for the manager versus the value it delivers. Don't bother with checking and releasing the lock in `finally:` block, it should be an error if it's still locked.	2021-12-16 18:02:03 -05:00
Tyler Goodlet	3826bc9972	Don't catch key errors from the yielded to scope	2021-12-16 18:02:03 -05:00
Tyler Goodlet	b210278e2f	Naming change `cache` -> `_Cache`	2021-12-16 18:02:03 -05:00
Tyler Goodlet	ac22b4a875	Fix type annots in resource cacher internals	2021-12-16 18:02:03 -05:00
Tyler Goodlet	5f41dbf34f	Add `maybe_open_context()` an actor wide task-resource cache	2021-12-16 18:02:03 -05:00
Tyler Goodlet	57f2aca18c	Set eoc on closure (again)	2021-12-16 16:19:15 -05:00
Tyler Goodlet	f2ba961e81	Mark stream with EOC when stop message is received	2021-12-16 16:18:58 -05:00
Tyler Goodlet	3deb1b91e6	Wake all broadcast consumers on EOC Without this wakeup you can have tasks which re-enter `.receive()` and get stuck waiting on the wakeup event indefinitely. Whenever a ``trio.EndOfChannel`` arrives we want to make sure all consumers at least know about it and don't block. This previous behaviour was basically a bug. Add some state flags for tracking if the broadcaster was either cancelled or terminated via EOC mostly for testing and debugging purposes though this info might be useful if we decide to offer a `.statistics()` like API in the future.	2021-12-16 16:16:14 -05:00
Tyler Goodlet	61e134dc5d	Wake up consumers on end of channel as well	2021-12-16 16:15:54 -05:00
Tyler Goodlet	6f94ffc304	Re-license code base for distribution under AGPL This commit obviously denotes a re-license of all applicable parts of the code base. Acknowledgement of this change was completed in #274 by the majority of the current set of contributors. From here henceforth all changes will be AGPL licensed and distributed. This is purely an effort to maintain the same copy-left policy whilst closing the (perceived) SaaS loophole the GPL allows for. It is merely for this loophole: to avoid code hiding by any potential "network providers" who are attempting to use the project to make a profit without either compensating the authors or re-distributing their changes. I thought quite a bit about this change and can't see a reason not to close the SaaS loophole in our current license. We still are (hard) copy-left and I plan to keep the code base this way for a couple reasons: - The code base produces income/profit through parent projects and is demonstrably of high value. - I believe firms should not get free lunch for the sake of "contributions from their employees" or "usage as a service" which I have found to be a dubious argument at best. - If a firm who intends to profit from the code base wants to use it they can propose a secondary commercial license to purchase with the proceeds going to the project's authors under some form of well defined contract. - Many successful projects like Qt use this model; I see no reason it can't work in this case until such a time as the authors feel it should be loosened. There has been detailed discussion in #103 on licensing alternatives. The main point of this AGPL change is to protect the code base for the time being from exploitation while it grows and as we move into the next phase of development which will include extension into the multi-host distributed software space.	2021-12-14 23:33:27 -05:00
Tyler Goodlet	a38a983225	Increase debugger poll delay back to prior value If we make it too fast a nursery with debug mode children can cancel too fast and causes some test failures. It's likely not a huge deal anyway since the purpose of this poll/check is for human interaction and the current delay isn't really that noticeable. Decrease log levels in the debug module to avoid console noise when in use. Toss in some more detailed comments around the new debugger lock points.	2021-12-10 11:54:27 -05:00
Tyler Goodlet	9bee513136	Use manual debugger-in-use flag in nursery and spawn task	2021-12-09 17:53:29 -05:00
Tyler Goodlet	5d9e3d1163	Add a manual debug mode kwarg to debugger waiter	2021-12-09 17:52:35 -05:00
Tyler Goodlet	92c6ec1882	`get_loglevel()` always returns a str	2021-12-07 13:17:00 -05:00
Tyler Goodlet	72eef2a4a1	Config debug mode log level after initial setup	2021-12-07 13:16:07 -05:00
Tyler Goodlet	9bd5226e76	Only adjust logging in debug mode if not noisy enough already	2021-12-07 13:13:04 -05:00
Tyler Goodlet	e899cc42bf	Add per actor debug mode toggle	2021-12-07 13:11:06 -05:00
Tyler Goodlet	4856285dee	Add back broken send chan ignore block	2021-12-06 17:04:17 -05:00
Tyler Goodlet	4b40599c48	Fix ignore warning log message	2021-12-06 16:32:23 -05:00
Tyler Goodlet	c9132de7dc	Move maybe-raise-error-msg logic into context A context method handling all this logic makes the most sense since it contains all the state related to whether the error should be raised in a nursery scope or is expected to be raised by a consumer task which reads and processes the msg directly (via a `Portal` API call). This also makes it easy to always process remote errors even when there is no (stream) overrun condition.	2021-12-06 16:32:23 -05:00
Tyler Goodlet	1f8e1cccbb	Only pop contexts on decorated entrypoints	2021-12-06 13:48:19 -05:00
Tyler Goodlet	318027ebd1	Raise stream overruns on one side never opened A context stream overrun should normally never take place since if a stream is opened (via ``Context.open_stream()``) backpressure is applied on the message buffer (unless explicitly disabled by the ``backpressure=False`` flag) such that an overrun on the receiving task should result in blocking the (remote) sender task (eventually depending on the underlying ``MsgStream`` transport). Here we add a special error message that reports if one side never opened a stream and let's the user know in the overrun error message that they may be trying to push messages to a task that isn't ready to receive them. Further fixes / details: - pop any `Context` at the end of any `_invoke()` task that creates one and registers with the runtime. - ignore but warn about messages received for a context that either no longer exists or is unknown (guarding against crashes by malicious packets in the latter case)	2021-12-06 11:54:21 -05:00
Tyler Goodlet	b826ec8103	Better idea, enable backpressure on opened streams Keeping it disabled on context open will help with detecting any stream connection which was never opened on one side of the task pair. In that case we can report that there was an overrun and a stream wasn't opened versus if the stream is explicitly configured not to use bp then we throw the standard overflow. Use `trio.Nursery._closed` to detect "closure" XD since it seems to be the most reliable way to determine if a spawn call will trigger a runtime error.	2021-12-06 11:54:21 -05:00
Tyler Goodlet	4ea5c9b5db	Pop context on `.open_context()` exit	2021-12-06 11:54:21 -05:00
Tyler Goodlet	41a3e6a9ca	Type check fixes	2021-12-05 20:00:58 -05:00
Tyler Goodlet	185dbc7e3f	Disable msg stream backpressure by default Half of portal API usage requires a 1 message response (`.run()`, `.run_in_actor()`) and the streaming APIs should probably be explicitly enabled for backpressure if desired by the user. This makes more sense in (psuedo) realtime systems where it's better to notify on a block then freeze without notice. Make this default behaviour with a new error to be raised: `tractor._exceptions.StreamOverrun` when a sender overruns a stream by the default size (2**6 for now). The old behavior can be enabled with `Context.open_stream(backpressure=True)` but now with warning log messages when there are overruns. Add task-linked-context error propagation using a "nursery raising" technique such that if either end of context linked pair of tasks errors, that error can be relayed to other side and raised as a form of interrupt at the receiving task's next `trio` checkpoint. This enables reliable error relay without expecting the (error) receiving task to call an API which would raise the remote exception (which it might never currently if using `tractor.MsgStream` APIs). Further internal implementation details: - define the default msg buffer size as `Actor.msg_buffer_size` - expose a `msg_buffer_size: int` kwarg from `Actor.get_context()` - maybe raise aforementioned context errors using `Context._maybe_error_from_remote_msg()` inside `Actor._push_result()` - support optional backpressure on a stream when pushing messages in `Actor._push_result()` - in `_invote()` handle multierrors raised from a `@tractor.context` entrypoint as being potentially caused by a relayed error from the remote caller task, if `Context._error` has been set then raise that error inside the `RemoteActorError` that will be relayed back to that caller more or less proxying through the source side error back to its origin.	2021-12-05 19:31:41 -05:00
Tyler Goodlet	2680a9473d	Always set `Context._portal` on the caller task side	2021-12-05 19:28:00 -05:00
Tyler Goodlet	92b540d518	Add internal msg stream backpressure controls In preparation for supporting both backpressure detection (through an optional error) as well as control over the msg channel buffer size, add internal configuration flags for both to contexts. Also adjust `Context._err_on_from_remote_msg()` -> `._maybe..` such that it can be called and will only raise if a scope nursery has been set. Add a `Context._error` for stashing the remote task's error that may be delivered in an `'error'` message.	2021-12-05 19:19:53 -05:00
Tyler Goodlet	6751349987	Add a stream overrun exception	2021-12-05 18:28:02 -05:00
Tyler Goodlet	d307eab118	Rework `Actor.send_cmd()` to `.start_remote_task()` This more formally declares the runtime's remote task startingn API and uses it throughout all the dependent `Portal` API methods. Allows dropping `Portal._submit()` and simplifying `.run_in_actor()` style result waiting to be delegated to the context APIs at remote task `return` response time. We now also track the remote entrypoint "type` as `Context._remote_func_type`.	2021-12-04 18:20:43 -05:00
Tyler Goodlet	c5c3f7e789	Use `tractor.Context` throughout the runtime core Instead of tracking feeder mem chans per RPC dialog, store `Context` instances which (now) hold refs to the underlying RPC-task feeder chans and track them inside a `Actor._contexts` map. This begins a transition to making the "context" idea the primitive abstraction for representing messaging dialogs between tasks in different memory domains (i.e. usually separate processes). A slew of changes made this possible: - change `Actor.get_memchans()` -> `.get_context()`. - Add new `Context._send_chan` and `._recv_chan` vars. - implicitly create a new context on every `Actor.send_cmd()` call. - use the context created by `.send_cmd()` in `Portal.open_context()` instead of manually creating one. - call `Actor.get_context()` inside tasks run from `._invoke()` such that feeder chans are implicitly created for callee tasks thus fixing the bug #265. NB: We might change some of the internal semantics to do with when the feeder chans are actually created to denote whether or not a far end task is actually read to receive messages. For example, in the cases where it never will be ready to receive messages (one-way streaming, a context that never opens a stream, etc.) we will likely want some kind of error or at least warning to the caller that messages can't be sent (yet).	2021-12-03 14:49:55 -05:00
Tyler Goodlet	f4793af2b9	Error on mal-use of `Context.started()` Previously we were ignoring a race where the callee an opened task context could enter `Context.open_stream()` before calling `.started(). Disallow this as well as calling `.started()` more then once.	2021-12-03 10:08:55 -05:00
Tyler Goodlet	08e9593306	Suppress broken resources errors in `Portal.cancel_actor()`	2021-12-02 15:29:04 -05:00
Tyler Goodlet	14f84571fb	Don't cancel receive streams inside `.cancel_actor()` We don't need to any more presuming you get ideal remote cancellation conditions where the remote actor should teardown and kill the streams from its end.	2021-12-02 15:29:04 -05:00
Tyler Goodlet	e561a4908f	Appease mypy	2021-12-02 15:29:04 -05:00
Tyler Goodlet	46070f99de	Factor soft-wait logic into a helper, use with mp	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d81eb1a51e	Finally, deterministic remote cancellation support On msg loop termination we now check and see if a channel is associated with a child-actor registered in some local task's nursery. If so, we attempt to wait on channel closure initiated from the child side (by draining the underlying msg stream) so as to avoid closing it too early resulting in the child not relaying its termination status response. This means we now support the ideal case in 2-general's where we get back the ack to the closure request instead of just ignoring it and timing out XD The main implementation detail is that when `Portal.cancel_actor()` remotely calls `Actor.cancel()` we actually wait for the RPC response from that request before allowing the channel shutdown sequence to engage. The new msg stream draining support enables this. Also, factor child-to-parent error propagation logic into a helper func and improve some docs (yeah yeah y'all don't like the ''', i don't care - it makes my eyes not hurt).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	d817f1a658	Add a nursery "exited" signal Use a `trio.Event` to enable nursery closure detection such that core runtime tasks can be notified when a local nursery exits and allow shutdown protocols to operate without close-before-terminate issues (such as IPC channel closure during remote peer cancellation).	2021-12-02 08:18:04 -05:00
Tyler Goodlet	a23afb0bb8	Set channel cancel called flag on cancel requests	2021-12-02 08:18:04 -05:00
Tyler Goodlet	1976e61d1a	Add `.drain()` support to msg streams Enables "draining" the last set of messages after a channel/stream has been terminated mostly for the purposes of receiving a final ACK to a remote cancel command. Also, add an internal `Channel._cancel_called` flag which can be set by `Portal.cancel_actor()`.	2021-12-02 08:18:04 -05:00
Tyler Goodlet	0ac3397dbb	Only soft-acquire debug lock if a proc was spawned	2021-12-02 08:17:03 -05:00
Tyler Goodlet	62b2867e07	Tweak doc strings	2021-12-02 08:16:49 -05:00
Tyler Goodlet	bf6958cdbe	Handle cancelled-before-proc-created spawn case It's definitely possible to have a nursery spawn task be cancelled before a `trio.Process` handle is ever returned; we now handle this case as a cancelled-during-spawn scenario. Zombie collection logic also is bypassed in this case.	2021-12-02 08:16:05 -05:00
Tyler Goodlet	77fc705b1f	Add nooz	2021-11-29 22:52:19 -05:00
Tyler Goodlet	7eb465a699	Graceful cancel actors before hard reaping	2021-11-29 16:03:23 -05:00
Tyler Goodlet	f6de7e0afd	Factor out msg unwrapping into a func	2021-11-29 08:46:35 -05:00
Tyler Goodlet	0e7234aa68	Cache the return message instead of the value Thanks to @richardsheridan for pointing out the limitations of using any kind of value as the result-cached-flag and how it might cause problems for anyone returning pickled blob-data. This changes the `Portal` internal result value tracking to stash the full message from which the value can be retrieved by any `Portal.result()` caller. The internal change is that `Portal._return_once()` now returns a tuple of the message and its value.	2021-11-29 07:44:44 -05:00
Tyler Goodlet	095c94b1d2	Fix `Portal.run_in_actor()` returns `None` bug Fixes the issue where if the main remote task returns `None`, `Portal.result()` would erroneously wait again on the underlying feeder mem chan since `None` was being used as the cache flag. Instead set the flag as the channel uid and consider the result collected when set to anything else (since it would be odd to return that value from a remote task when you already can read it as part of portal/channel apis).	2021-11-20 13:02:08 -05:00
Tyler Goodlet	6b0366fe04	Guard against TCP server never started on cancel	2021-11-07 23:49:32 -05:00
Tyler Goodlet	dbe5d96d66	Fix missing yield in lock acquirer	2021-11-07 23:48:05 -05:00
Tyler Goodlet	74f460eba7	Make auto generated child names <parent_name>.<name>	2021-11-02 15:40:15 -04:00
Tyler Goodlet	083b73ad4a	Test: don't grab debug lock if not in mode	2021-10-25 10:22:41 -04:00
Tyler Goodlet	d0f5c7a5e2	Change to `gather_contexts()`, use event for graceful exit The api we've made here is actually closer to `asyncio.gather()` but with opening async context managers instead of funcs. Use another event to allow for graceful teardown of children on non-cancellation exits and add a doc string.	2021-10-24 14:00:01 -04:00
overclockworked64	50400359b8	Fix type annotations	2021-10-24 00:47:26 +02:00
overclockworked64	87e3d32992	Get rid of external teardown trigger because #245 resolves the problem	2021-10-23 16:17:30 -04:00
overclockworked64	04895b9d5e	Get rid of dumb random uid and use current actor's uid	2021-10-23 16:17:30 -04:00
overclockworked64	b7a4641674	Allow specifying start_method and hard_kill	2021-10-23 16:17:30 -04:00
overclockworked64	3130a04c61	Rename a variable and fix type annotations	2021-10-23 16:17:29 -04:00
overclockworked64	6f9229cd09	Cancel nursery	2021-10-23 16:17:29 -04:00
overclockworked64	6e6baf250b	Make sure the ID is a str	2021-10-23 16:17:29 -04:00
overclockworked64	73cbb2388a	Avoid RuntimeError by not using current_actor's uid	2021-10-23 16:17:29 -04:00
overclockworked64	2815f1c343	Make 'async_enter_all' take a teardown trigger which '_enter_and_wait' will wait on	2021-10-23 16:17:29 -04:00
overclockworked64	21afc69ac7	Postpone evaluation of annotations	2021-10-23 16:17:29 -04:00
overclockworked64	7d502cef74	Add 'open_actor_cluster' to __all__	2021-10-23 16:17:29 -04:00
Tyler Goodlet	c372367cc2	Fix *args-like type annot	2021-10-23 15:54:40 -04:00
Tyler Goodlet	9ddd75733c	Lul, fix everything for cluster helper	2021-10-23 15:54:40 -04:00
Tyler Goodlet	97006c904c	Expose `Lagged` for broadcasting	2021-10-23 15:54:40 -04:00
Tyler Goodlet	79fb1d0ebc	Fix top level nursery import	2021-10-23 15:54:40 -04:00
Tyler Goodlet	1e917fdb1d	Add an async actor cluster spawner prototype	2021-10-23 15:54:40 -04:00
Tyler Goodlet	4114eb1d25	Move broadcast channel parts into trionics	2021-10-23 15:54:40 -04:00
Tyler Goodlet	680a841282	Start `trionics` sub-pkg with `async_enter_all()` Since it seems we're building out more and more higher level primitives in order to support certain parallel style actor trees and messaging patterns (eg. task broadcast channels), we might as well start a new sub-package for purely `trio` constructions. We hereby dub this the realm of `trionics` (like electronics but for trios instead of electrons). To kick things off, add an `async_enter_all()` concurrent exit-stack-like context manager API which will concurrently spawn a sequence of provided async context managers and deliver their ordered results but with proper support for `trio` cancellation semantics. The stdlib's `AsyncExitStack` is not compatible with nurseries not `trio` tasks (which are cancelled) since as task will be suspended on the stack after push and does not ever hit a checkpoint until the stack is closed.	2021-10-23 15:54:40 -04:00
Tyler Goodlet	340ddba4ae	Rename the nursery module to `_supervise`	2021-10-23 15:54:40 -04:00
Tyler Goodlet	b3c4851ffb	Grab lock if cancelled during spawn before hard kill	2021-10-15 18:26:46 -04:00
Tyler Goodlet	5cfac58873	Don't pop a child entry that was never inserted	2021-10-15 18:16:58 -04:00
Tyler Goodlet	e4ed0fd2b3	Right, only worry about pdb lock when in debug mode	2021-10-15 09:29:25 -04:00
Tyler Goodlet	51259c4809	Pass uid not actor object	2021-10-14 13:46:27 -04:00
Tyler Goodlet	9d83ef82b2	Remove union type for root getter	2021-10-14 13:39:46 -04:00
Tyler Goodlet	fa317d1600	Change lock helper to take an actor uid tuple	2021-10-14 13:39:46 -04:00
Tyler Goodlet	6f5c35dd1b	Fix missing task status type	2021-10-14 13:39:46 -04:00
Tyler Goodlet	daa28ea0e9	Handle depth > 1 nursery owners which use debug mode	2021-10-14 13:39:46 -04:00
Tyler Goodlet	4b2710b8a5	Add tty lock acquire ctx mngr	2021-10-14 13:39:46 -04:00
Tyler Goodlet	d30ce96740	Breakout `wait_for_parent_stdin_hijack()`, increase root pdb checker poll time	2021-10-14 13:39:46 -04:00
Tyler Goodlet	f3a6ab62af	Use debugger helper in nursery and spawn tasks	2021-10-14 13:39:46 -04:00
Tyler Goodlet	62035078ce	Reduce some loglevels, stick in comment about blocking till next tick	2021-10-14 13:39:46 -04:00
Tyler Goodlet	893bad72d5	Add a maybe-open-debugger helper	2021-10-14 13:39:46 -04:00
Tyler Goodlet	77ec29008d	Simplify to soft and hard reap sequences This is actually surprisingly easy to grok having gone through a lot of pain understanding edge cases in the zombie lord dev branch. Basically we just need to make sure actors are managed in a 2 step reap sequence. In the "soft" reap phase we wait for the process to terminate on its own concurrently with (maybe) waiting for its portal's final result (if it's a `.run_in_actor()`). If this path is cancelled or errors, then we do a "hard" reap where we timeout and send a signal to the proc to terminate immediately. The only last remaining trick is to tie in the root-is-debugger-aware logic to yet again avoid tty clobbers.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	46ff558556	Unwind process opening and shield hard reap	2021-10-14 13:39:46 -04:00
Tyler Goodlet	bb9d9c74b1	Do immediate remote task cancels As for `Actor.cancel()` requests, do the same for `Actor._cancel_task()` but use `_invoke()` to ensure correct msg transactions with caller. Don't cancel task cancels on a cancel-all-tasks operation in attempt at more determinism.	2021-10-14 13:39:46 -04:00
Tyler Goodlet	41f0992445	Don't whine about ; it ain't rpc	2021-10-14 13:39:46 -04:00
Tyler Goodlet	7643bbf183	Make actor runtime cancellation immediate	2021-10-14 13:39:46 -04:00
Tyler Goodlet	1f0cc15675	Just set flag for use-after-closed service nursery calls	2021-10-06 17:02:13 -04:00
Tyler Goodlet	10f66e5141	De-noise warnings, add a 'cancel' log level Now that we're on our way to a (somewhat) serious beta release I think it's about time to start de-noising the logging emissions. Since we're trying out this approach of "stack layer oriented" log levels, I figured this is a good time to move most of the "warnings" to what they should be: cancellation monitoring status messages. The level is set to 16 which is just above our "runtime" level but just below the traditional "info" level. I think this will be a decent approach since usually if you're confused about why your `tractor` app is behaving unlike you expect, it's 90% of the time going to be to do with cancellation or error propagation. This this setup a user can specify the 'cancel' level and see all the msgs pertaining to both actor and task-in-actor cancellation mechanics.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	4d5a5c147a	Move core actor runtime logging to, well, "runtime"	2021-10-06 17:02:13 -04:00
Tyler Goodlet	d2f0843041	Make custom log levels report the right stack frame The stdlib's `logging.LoggingAdapter` doesn't currently pass through `stacklevel: int` down to its wrapped logger instance. Hack it here and get our msgs looking like they would if using a built-in level.	2021-10-06 17:02:13 -04:00
Tyler Goodlet	3f6d4d6af4	Don't log.error if it was intentional	2021-10-06 17:02:13 -04:00
Tyler Goodlet	b496e790fe	Use from `.from_stream()` in TCP handler	2021-10-06 15:54:27 -04:00
Tyler Goodlet	c6dc96b08c	Add "message transport" structured sub-typing In an effort to have some kind of more formal interface around the transport layer, add a `MsgTransport` protocol type and use with the channel composition of message streams. Start a little "key map" of `(<codec>, <protocol>)` to `MsgTransport` types which can be dynamically loaded. Add a `Channel.from_stream()` constructor thus cleaning up the mangled logic that was in the constructor based on inputs. Drop all the "auto reconnect" channel logic for now since nothing is using it (internally) and it's likely it will need rework once we bring in a protocol besides TCP.	2021-10-06 15:54:27 -04:00
Tyler Goodlet	135459ca25	Tolerate one decode error; may have been a registry ping	2021-10-05 13:37:17 -04:00
Tyler Goodlet	07e8821cd5	Add a stream type factory	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1382ad653d	Ugh, appease mypy yet again	2021-10-05 13:37:17 -04:00
Tyler Goodlet	076f37c589	Attempt to gracefully handle channel breakage?	2021-10-05 13:37:17 -04:00
Tyler Goodlet	19d6885243	Ensure tuple for passed in arbiter addr	2021-10-05 13:37:17 -04:00
Tyler Goodlet	486e983964	Cast `defaultdict` to `dict` for registry get	2021-10-05 13:37:17 -04:00
Tyler Goodlet	1ab495a64d	Map broken stream errs to transport closed; msgspec seems to be racy	2021-10-05 13:37:17 -04:00
Tyler Goodlet	562419c907	Convert actor UIDs to hashable tuples `msgspec` sends python lists over the wire (https://github.com/jcrist/msgspec/issues/30) which is fine and dandy but we use them as lookup keys so we need to be sure we tuple-cast first.	2021-10-05 13:37:17 -04:00
Tyler Goodlet	3facfb6d4c	Fix log levels	2021-10-05 13:37:17 -04:00
Tyler Goodlet	aa080543d0	Mypy fixes to enforce uid tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	b64396f708	Pkg `msgpec` as optional dep, load transport type if importable	2021-10-05 13:37:17 -04:00
Tyler Goodlet	96b3f94c72	Accept transport closed error during handshake and msg loop	2021-10-05 13:37:17 -04:00
Tyler Goodlet	ecd8c4bc7e	Drop happy eyeballs inf delay	2021-10-05 13:37:17 -04:00
Tyler Goodlet	112117c1fc	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-10-05 13:37:17 -04:00
Tyler Goodlet	95e35f3d60	Add streaming decode support for `msgspec` Add a `tractor._ipc.MsgspecStream` type which can be swapped in for `msgspec` serialization transparently. A small msg-length-prefix framing is implemented as part of the type and we use `tricycle.BufferedReceieveStream` to handle buffering logic for the underlying transport. Notes: - had to force cast a few more list -> tuple spots due to no native `tuple`decode-by-default in `msgspec`: https://github.com/jcrist/msgspec/issues/30 - the framing can be understood by this protobuf walkthrough: https://eli.thegreenplace.net/2011/08/02/length-prefix-framing-for-protocol-buffers - `tricycle` becomes a new dependency	2021-10-05 13:37:17 -04:00
Tyler Goodlet	e39ee3a9cc	Always cast arbiter addr to tuple	2021-10-05 13:37:17 -04:00
Tyler Goodlet	dda0b22870	Try out `msgspec` in our msgpack stream channel Can only really use an encoder currently since there is no streaming api in `msgspec` as of currently. See jcrist/msgspec#27. Not sure if any encoding speedups are currently noticeable especially without any validation going on yet XD. First experiments toward #196	2021-10-05 13:37:17 -04:00
Tyler Goodlet	4079f02acf	Cast to tuples for all uids explicitly	2021-10-05 13:37:17 -04:00
Tyler Goodlet	518a0d5e14	Add todo for log msg filename..	2021-10-04 10:38:44 -04:00
Tyler Goodlet	8b416e6bba	Stream and context api tweaks - drop `shield` input to `MsgStream` - check for cancel called prior to loading the feeder mem chan in `Context.open_stream()` - warn on a timeout when trying to cancel a remote task from `Context.cancel()` - drop noop endofchannel handler block	2021-10-04 10:38:44 -04:00
Tyler Goodlet	bd31f47d5f	Handle kbi in ctx blocks via `BaseException` Fixes prior committed tests by more generally handling `BaseExcepion` in context blocks. Left in the commented concrete list for reference.	2021-10-04 10:38:44 -04:00
Tyler Goodlet	4259738864	Flip to using the `trio` spawner on windows Was able to try it manually on a windows 10 system and the debugger works great!	2021-09-18 14:10:32 -04:00
Tyler Goodlet	1137a9e7ac	Fix 404ed tokio urls	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2745a2b1dc	Solve first-recv-cancelled by recursive `.receive()` on wake	2021-09-02 21:12:54 -04:00
Tyler Goodlet	d9bb52fe7b	Store array `maxlen` in state singleton The `collections.deque` takes care of array length truncation of values for us implicitly but in the future we'll likely want this value exposed to alternate array implementations. This patch is to provide for that as well as make `mypy` happy since the `dequeu.maxlen` can also be `None`.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	9258f79510	Don't wake sibling bcast consumers on a cancelled call	2021-09-02 21:12:54 -04:00
Tyler Goodlet	44ef26bb18	Shorten default feeder mem chan size to 64	2021-09-02 21:12:54 -04:00
Tyler Goodlet	7857a9ac6d	Add `shield: bool` kwarg to `Portal.open_stream_from()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	63ec740e27	Add some bcaster ref sanity asserts around subscriptions	2021-09-02 21:12:54 -04:00
Tyler Goodlet	093e7d921c	Instance ids are ints	2021-09-02 21:12:54 -04:00
Tyler Goodlet	bec3f5999d	Drop uuid4 keys, raise closed error on subscription after close	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a4cb0ef21f	Fix `.receive()` re-assignment, drop `.clone()`	2021-09-02 21:12:54 -04:00
Tyler Goodlet	346b5d2eda	Blade runner it Get rid of all the (requirements for) clones of the underlying receivable. We can just use a uuid generated key for each instance (thinking now this can probably just be `id(self)`). I'm fully convinced now that channel cloning is only a source of confusion and anti-patterns when we already have nurseries to define resource lifetimes. There is no benefit in particular when you allocate subscriptions using a context manager (not sure why `trio.open_memory_channel()` doesn't enforce this). Further refinements: - add a `._closed` state that will error the receiver on reuse - drop module script section; it's been moved to a real test - call the "receiver" duck-type stub a new name	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6c17c7367a	Store handle to underlying channel's `.receive()` This allows for wrapping an existing stream by re-assigning its receive method to the allocated broadcaster's `.receive()` so as to avoid expecting any original consumer(s) of the stream to now know about the broadcaster; this instead mutates the stream to delegate to the new receive call behind the scenes any time `.subscribe()` is called. Add a `typing.Protocol` for so called "cloneable channels" until we decide/figure out a better keying system for each subscription and mask all undesired typing failures.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	2d1c24112b	Add subscription support to message streams Add `ReceiveMsgStream.subscribe()` which allows allocating a broadcast receiver around the stream for use by multiple actor-local consumer tasks. Entering this context manager idempotently mutates the stream's receive machinery which for now can not be undone. Move `.clone()` to the receive stream type. Resolves #204	2021-09-02 21:12:54 -04:00
Tyler Goodlet	a12b1fc631	Drop optimization check, binance made its point	2021-09-02 21:12:54 -04:00
Tyler Goodlet	ceed96aa3f	Add common state delegate type for all consumers For every set of broadcast receivers which pull from the same producer, we need a singleton state for all of, - subscriptions - the sender ready event - the queue Add a `BroadcastState` dataclass for this and pass it to all subscriptions. This makes the design much more like the built-in memory channels which do something very similar with `MemoryChannelState`. Use a `filter()` on the subs list in the sequence update step, plus some other commented approaches we can try for speed.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6e78bcf898	Facepalm: use single `_subs` per clone set	2021-09-02 21:12:54 -04:00
Tyler Goodlet	4ad75a3287	Obviously keying on tasks isn't going to work Using the current task as a subscription key fails horribly as soon as you hand off new subscription receiver to another task you've spawned.. Instead use the underlying ``trio.abc.ReceiveChannel.clone()`` as a key (so i guess we're assuming cloning is supported by the underlying?) which makes this all work just like default mem chans. As a bonus, now we can just close the underlying rx (which may be a clone) on `.aclose()` and everything should just work in terms of the underlying channels lifetime (i think?). Change `.subscribe()` to be async since the receive channel type interface only expects `.aclose()` and it actually ends up being nicer for 3.9+ style `async with` parentheses style anyway.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	64358f6525	Rename to broadcast mod, don't expect mem chan specifically	2021-09-02 21:12:54 -04:00
Tyler Goodlet	1af7dbb732	`Task` is hashable, so key on it	2021-09-02 21:12:54 -04:00
Tyler Goodlet	6a2c3da1bb	Simplify api around receive channel Buncha improvements: - pass in the queue via constructor - tracking over all underlying memory channel closure using cloning - do it like `tokio` and set lagged consumers to the last sequence before raising - copy the subs on first receiver wakeup for iteration instead of iterating the table directly (and being forced to skip the current tasks sequence increment) - implement `.aclose()` to close the underlying clone for this task - make `broadcast_receiver()` just take the recv chan since it doesn't need anything on the send side.	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3817b4fb5e	Ultra naive broadcast channel prototype	2021-09-02 21:12:54 -04:00
Tyler Goodlet	3208b67f57	Drop shielding on root lock acquire; seems to prevent hangs	2021-09-02 16:23:38 -04:00
Tyler Goodlet	61d2307e52	Unlock pdb tty on all possible net faults	2021-09-02 16:23:38 -04:00
Tyler Goodlet	79f0d6fda0	Attempt to avoid pdb lockups on channel breakage Always try to release the root tty lock on broken connection errors.	2021-09-02 16:23:10 -04:00
Tyler Goodlet	4f166500d0	Add return type to debugger factory	2021-09-02 16:22:59 -04:00
Tyler Goodlet	d906c81f14	Export portal type at top level	2021-09-02 16:22:59 -04:00
Tyler Goodlet	68d56d5df0	Try not masking SIGINT in child processes	2021-09-02 16:22:59 -04:00
Tyler Goodlet	497fa72c96	Add a SIGINT handler that kills the process tree We're not actually using this but it's for reference if we do end up needing it. The std lib's `pdb` internals override SIGINT handling whenever one enters the debugger repl. Force a handler that kills the tree if SIGINT is triggered from the root actor, otherwise ignore it since supervised children should be managed already. This resolves an issue with guest mode where `pdb` causes SIGINTs to be swallowed resulting in the host loop never terminating the process tree.	2021-09-02 16:22:02 -04:00
Tyler Goodlet	b4d95e9543	Update docs to new close semantics	2021-09-02 08:24:18 -04:00
Tyler Goodlet	af85d35685	Drop stream shielding; it was from a legacy design The whole origin was not having an explicit open/close semantic for streams. We have that now so this internal mechanic isn't needed and further our streams become more correct by having `.aclose()` be independent of cancellation.	2021-09-02 08:24:18 -04:00
Tyler Goodlet	7431e8ea01	Don't log cancelled inceptions seen by the root	2021-08-02 21:15:42 -04:00
Tyler Goodlet	82999801a6	Drop leftover noisy exception logging..	2021-08-02 16:56:00 -04:00
Tyler Goodlet	674fbbc6b3	Docs and comments tidying	2021-08-01 10:44:13 -04:00
Tyler Goodlet	6006adc0de	Hide `_invoke()` tb, move actor error to exceptions mod	2021-07-31 13:56:26 -04:00
Tyler Goodlet	0afa7f0f8e	Fix lock context manager return type	2021-07-31 12:50:58 -04:00
Tyler Goodlet	b3d28a1ee4	Drop debugger path and duplicate func from rebasing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	09f00a5a00	Go back to only logging tbs on no debugger	2021-07-31 12:46:40 -04:00
Tyler Goodlet	44bfacc0c2	Comment hard-kill-sidestep for now since nursery version covers it?	2021-07-31 12:46:40 -04:00
Tyler Goodlet	551816e80d	Solve the root-cancels-child-in-tty-lock race Finally this makes a cancelled root actor nursery not clobber child tasks which request and lock the root's tty for the debugger repl. Using an edge triggered event which is set after all fifo-lock-queued tasks are complete, we can be sure that no lingering child tasks are going to get interrupted during pdb use and tty lock acquisition. Further, even if new tasks do queue up to get the lock, the root will incrementally send cancel msgs to each sub-actor only once the tty is not locked by a (set of) child request task(s). Add shielding around all the critical sections where the child attempts to allocate the lock from the root such that it won't be disrupted from cancel messages from the root after the acquire lock transaction has started.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	be1fcb2a5b	Distinguish between a local pdb unlock and the tty unlock in root	2021-07-31 12:46:40 -04:00
Tyler Goodlet	ef89ed947a	Fix hard kill in debug mode; only do it when debug lock is empty	2021-07-31 12:46:40 -04:00
Tyler Goodlet	5b3894827f	Move some infos to runtime level	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0fdcfa0ba1	Move debugger wait inside OCA nursery	2021-07-31 12:46:40 -04:00
Tyler Goodlet	37a1897c47	Don't shield debugger status wait; it causes hangs	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0f2a39a311	Catch and delay errors in the root if debugger is active	2021-07-31 12:46:40 -04:00
Tyler Goodlet	23a1622256	Don't kill root's immediate children when in debug If the root calls `trio.Process.kill()` on immediate child proc teardown when the child is using pdb, we can get stdstreams clobbering that results in a pdb++ repl where the user can't see what's been typed. Not killing such children on cancellation / error seems to resolve this issue whilst still giving reliable termination. For now, code that special path until a time it becomes a problem for ensuring zombie reaps.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	49d439b681	Add some brief todo notes on idea of shielded breakpoint	2021-07-31 12:46:40 -04:00
Tyler Goodlet	6f05f5d5e6	Wait for debugger lock task context termination	2021-07-31 12:46:40 -04:00
Tyler Goodlet	b369b91357	Fix up var naming and typing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	969bce3aa4	Use context for remote debugger locking A context is the natural fit (vs. a receive stream) for locking the root proc's tty usage via it's `.started()` sync point. Simplify the `_breakpoin()` routine to be a simple async func instead of all this "returning a coroutine" stuff from before we decided that `tractor.breakpoint()` must be async. Use `runtime` level for locking logging making it easier to trace.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	443ebea165	Use "pdb" level logging in debug mode	2021-07-08 13:02:33 -04:00
Tyler Goodlet	25779d48a8	Define explicit adapter level methods for mypy	2021-07-08 12:51:35 -04:00
Tyler Goodlet	fde52d2464	Mypy fixes	2021-07-08 12:48:34 -04:00
Tyler Goodlet	8c927d708d	Change trace to transport level	2021-07-07 14:31:15 -04:00
Tyler Goodlet	31590e82a3	Flip "trace" level to "transport" level logging	2021-07-07 14:31:03 -04:00
Tyler Goodlet	2513c652c1	Go back to only logging crashes if no pdb gets engaged	2021-07-06 08:23:30 -04:00
Tyler Goodlet	9ddb654783	Avoid mutate on iterate race	2021-07-06 08:23:30 -04:00
Tyler Goodlet	7f86d63e77	Drop trip kwarg	2021-07-06 08:23:30 -04:00
Tyler Goodlet	12f987514d	Don't enter debug on closed resource errors	2021-07-06 08:23:30 -04:00
Tyler Goodlet	98bbf8e0df	Move join event trigger to direct exit path	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b1cd7fdedf	Don't shield on root cancel it can causes hangs	2021-07-06 08:23:30 -04:00
Tyler Goodlet	ef725c5972	Always hard kill sub-procs on teardown Adds a new hard kill routine for the `trio` spawning backend.	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b21e2a6caa	Add pre-stream open error conditions	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c6cdaf9c31	De-densify some code	2021-07-06 08:23:30 -04:00
Tyler Goodlet	91640facbc	Always shield cancel the caller on cancel-causing-errors, add teardown logging	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c2484e88a1	First try: pack cancelled tracebacks and ship to caller	2021-07-06 08:23:30 -04:00
Tyler Goodlet	3423ea4011	Add temp warning msg for context cancel call	2021-07-06 08:23:29 -04:00
Tyler Goodlet	af701c16ee	Consider relaying context error via raised-in-scope-nursery task	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1703171bea	Set stream "end of channel" after shielded check! Another face palm that was causing serious issues for code that is using the `.shielded` feature.. Add a bunch more detailed comments for all this subtlety and hopefully get it right once and for all. Also aggregated the `trio` errors that should trigger closure inside `.aclose()`, hopefully that's right too.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	3d633408fc	Don't clobber msg loop mem chan on rx stream close Revert this change since it really is poking at internals and doesn't make a lot of sense. If the context is going to be cancelled then the msg loop will tear down the feed memory channel when ready, we don't need to be clobbering it and confusing the runtime machinery lol.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	196dea80db	Drop trailing comma	2021-07-06 08:23:29 -04:00
Tyler Goodlet	54916be601	Adjustments for non-frozen context dataclass change	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1a69727b75	Fix exception typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	348148ff1e	Explicitly formalize context/streaming teardown Add clear teardown semantics for `Context` such that the remote side cancellation propagation happens only on error or if client code explicitly requests it (either by exit flag to `Portal.open_context()` or by manually calling `Context.cancel()`). Add `Context.result()` to wait on and capture the final result from a remote context function; any lingering msg sequence will be consumed/discarded. Changes in order to make this possible: - pass the runtime msg loop's feeder receive channel in to the context on the calling (portal opening) side such that a final 'return' msg can be waited upon using `Context.result()` which delivers the final return value from the callee side `@tractor.context` async function. - always await a final result from the target context function in `Portal.open_context()`'s `__aexit__()` if the context has not been (requested to be) cancelled by client code on block exit. - add an internal `Context._cancel_called` for context "cancel requested" tracking (much like `trio`'s cancel scope). - allow flagging a stream as terminated using an internal `._eoc` flag which will mark the stream as stopped for iteration. - drop `StopAsyncIteration` catching in `.receive()`; it does nothing.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	73302d9d16	Specially raise a `ContextCancelled` for a task-context rpc	2021-07-06 08:23:29 -04:00
Tyler Goodlet	409f7f0d5a	Expose streaming components at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	eb3662f981	Add a specially handled `ContextCancelled` error	2021-07-06 08:23:29 -04:00
Tyler Goodlet	39b9896a62	Only close recv chan if we get a ref	2021-07-06 08:23:29 -04:00
Tyler Goodlet	9a4244b9a6	Support no arg to `Context.started()` like trio	2021-07-06 08:23:29 -04:00
Tyler Goodlet	a2e2f7e7a8	Only send stop msg if not received from far end	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6559fb72aa	Expose msg stream types at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e311430d25	Be more pedantic with error handling	2021-07-06 08:23:29 -04:00
Tyler Goodlet	08eb6bd019	Fix typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1f8966ba64	Support passing `shield` at stream contruction	2021-07-06 08:23:29 -04:00
Tyler Goodlet	14114547e8	Expose `@context` decorator at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e3955bb62b	Add initial bi-directional streaming This mostly adds the api described in https://github.com/goodboy/tractor/issues/53#issuecomment-806258798 The first draft summary: - formalize bidir steaming using the `trio.Channel` style interface which we derive as a `MsgStream` type. - add `Portal.open_context()` which provides a `trio.Nursery.start()` remote task invocation style for setting up and tearing down tasks contexts in remote actors. - add a distinct `'started'` message to the ipc protocol to facilitate `Context.start()` with a first return value. - for our `ReceiveMsgStream` type, don't cancel the remote task in `.aclose()`; this is now done explicitly by the surrounding `Context` usage: `Context.cancel()`. - streams in either direction still use a `'yield'` message keeping the proto mostly symmetric without having to worry about which side is the caller / portal opener. - subtlety: only allow sending a `'stop'` message during a 2-way streaming context from `ReceiveStream.aclose()`, detailed comment with explanation is included. Relates to #53	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6aab16f877	Drop added logging around root cancel	2021-07-04 11:00:08 -04:00
Tyler Goodlet	caa70245e0	Try remapping all broken errs wholesale on windows	2021-07-04 10:47:15 -04:00
Tyler Goodlet	3f75732b02	Remap windows specific connection reset error	2021-07-04 10:25:19 -04:00
Tyler Goodlet	1edf5c2f06	Specially remap TCP 104-connection-reset to `TransportClosed` Since we currently have no real "discovery protocol" between process trees, the current naive approach is to check via a connect and drop to see if a TCP server is bound to a particular address during root actor startup. This was a historical decision and had no real grounding beyond taking a simple approach to get something working when the project was first started. This is obviously problematic from an error handling perspective since we need to be able to avoid such quick connect-and-drops from cancelling an "arbiter"'s (registry actor's) channel-msg loop machinery (which would propagate and cancel the actor). For now we map this particular TCP error, which gets remapped by `trio` as a `trio.BrokenResourceError` to our own internal `TransportClosed` which is swallowed by channel message loop processing and indicates a graceful teardown of the far end actor.	2021-07-03 18:57:54 -04:00
Tyler Goodlet	a2d400583f	Fix tuple type	2021-07-02 18:10:06 -04:00
Tyler Goodlet	32b4ae0603	Accept transport closed error during handshake and msg loop	2021-07-02 11:38:24 -04:00
Tyler Goodlet	80e100f818	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-07-02 11:36:22 -04:00
Tyler Goodlet	73e123bac7	Fix line length	2021-05-07 11:21:40 -04:00
Tyler Goodlet	1584c547cd	Drop run and rpc_module_paths from discovery tests	2021-05-07 11:21:40 -04:00
Tyler Goodlet	87971de1d9	Re-raise any sidestepped `trio.Cancelled`	2021-05-06 12:05:17 -04:00
Tyler Goodlet	9f38406e85	Appease mypy	2021-05-06 12:05:17 -04:00
Tyler Goodlet	c4b42000eb	Shield around root actor cancel	2021-05-06 12:05:17 -04:00
Tyler Goodlet	607c48f1ac	Distinctly separate and harden mp spawning It's clear now that special attention is needed to handle the case where a spawned `multiprocessing` proc is started but then the parent is cancelled before the child can connect back; in this case we need to be sure to kill the near-zombie child asap. This may end up being the solution to other resiliency issues seen around mp with nested process trees too. More testing is needed to be sure. Relates to #84 #89 #134 #146	2021-05-06 12:05:17 -04:00
Tyler Goodlet	fc36e73628	Comment out `MsgStream` for now	2021-04-28 16:40:38 -04:00
Tyler Goodlet	f59346d854	Add func type checking to `.run_in_actor()`	2021-04-28 12:23:08 -04:00
Tyler Goodlet	86fc418050	Error on bad registry pops	2021-04-28 12:23:08 -04:00
Tyler Goodlet	83af295b45	Fix func type checking	2021-04-28 12:23:08 -04:00
Tyler Goodlet	ad9256bcdb	Drop stream exhaustion; no longer needed	2021-04-28 12:23:08 -04:00
Tyler Goodlet	3e19fd311b	Move debugger locking to new stream api	2021-04-28 12:23:08 -04:00
Tyler Goodlet	80c96cab01	Add a warning for soon to be deprecated `ctx` use in `@stream` func	2021-04-28 12:23:08 -04:00
Tyler Goodlet	36251357b3	Add a new one-way stream API NB: this is a breaking change removing support for `Portal.run()` being able to invoke remote streaming functions and instead replacing the method call with an async context manager api `Portal.open_stream_from()` This style explicitly defines stream teardown at the call site instead of expecting the user to handle tricky things correctly themselves: eg. `async_geneartor.aclosing()`. Going forward `Portal.run()` can be used only for invoking async functions.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	81f3558494	Formatting	2021-04-28 12:23:08 -04:00
Tyler Goodlet	897ab79946	Add a no runtime error	2021-04-28 12:23:08 -04:00
Tyler Goodlet	7f38b7225d	Aggregate and organize streaming components Move receive stream into streaming modules and rebrand as a "message stream". Factor out cancellation mechanics in `.aclose()` into the `Context` type which will soon provide the api for for cancelling portal invocations. Comment-stage a few methods on both types in anticipation of a new bi-directional streaming api. Add a `MsgStream` bidirectional channel type which will be the eventual type yielded from `Context.open_stream()`. Adjust the response/dialog types to be the set `{'asyncfun', 'asyncgen', 'context'}`. OH, and add async func checking in `Portal.run()` to catch and error on sync funcs early.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	d0eacc3fd6	Appease mypy	2021-04-27 12:08:30 -04:00
Tyler Goodlet	89ce1a63e4	Only accept asyncfunc response type	2021-04-27 12:08:30 -04:00
Tyler Goodlet	5798ef6796	Enforce async funcs on callee side, convert arbiter methods	2021-04-27 12:08:30 -04:00
Tyler Goodlet	c2a1612bf5	Drop sync function support You can always wrap a sync function in an async one and there seems to be no good reason to support invoking them directly especially since cancellation won't work without some thread hackery. If it's requested we'll point users to `trio-parallel`. Resolves #77	2021-04-27 12:08:30 -04:00
Tyler Goodlet	be22a2526a	Add `Actor.cancel_soon()` for sync self destruct Add a sync method that can be used to cancel the current actor from a synchronous context. This is useful in debugging situations where sync debugger code may need to kill the process tree. Also, make the internal "lifetime stack" a global var; easier to manage from client code that may was to add callbacks prior to the actor runtime being fully setup.	2021-04-27 11:35:28 -04:00
Tyler Goodlet	47565cfbf3	Use root as default name from `tractor.run()`	2021-02-25 08:51:28 -05:00
Tyler Goodlet	cd636b270e	Update debug tests to expect 'root' actor name	2021-02-24 13:38:20 -05:00
Tyler Goodlet	983e66b31b	Add second implicit-runtime-boot branch	2021-02-24 13:13:45 -05:00
Tyler Goodlet	b285db4c58	Factor OCA supervisor into new func	2021-02-24 13:13:38 -05:00
Tyler Goodlet	5ffd2d2ab3	Ignore type checks on stdlib overrides	2021-02-21 14:08:23 -05:00
Tyler Goodlet	7888ef6f01	Fix more stdlib typing issues with latest mypy	2021-02-21 12:48:03 -05:00
Tyler Goodlet	109066dda9	Support sync code breakpointing via built-in Override `breakpoint()` for sync code making it work properly with `trio` as per: https://github.com/python-trio/trio/issues/1155#issuecomment-742964018 Relates to #193	2021-02-21 12:36:00 -05:00
Tyler Goodlet	9f4e497b9c	Don't shield proc waits	2021-01-14 18:21:26 -05:00
Tyler Goodlet	e546ead2ff	Pub sub internals type fixes	2021-01-14 18:20:59 -05:00
Tyler Goodlet	3df001f3a9	Fix msg pub global lock sharing Using `None` as the default key for a `@msg.pub` can cause conflicts if there is more then one "taskless" (no tasks={,} passed) pub offered on an actor... So instead use the first trio "task name" (usually just the function name) instead thus avoiding this very hard to debug and understand problem. Probably should throw in a test but I'm super lazy today.	2021-01-14 18:20:49 -05:00
Tyler Goodlet	5ed5d18ccb	Begin rpc_module_paths deprecation	2021-01-08 22:08:45 -05:00
Tyler Goodlet	32b10681a1	Drop tractor.run() from @tractor_test	2021-01-08 20:56:03 -05:00
Tyler Goodlet	41a4de5af2	Use actual task name lel	2021-01-08 20:55:42 -05:00
Tyler Goodlet	59421d9f3a	Fix some borked tests	2021-01-08 20:55:11 -05:00
Tyler Goodlet	333ddcf93f	Can we ever really appease mypy?	2021-01-03 11:18:31 -05:00
Tyler Goodlet	0bb2163b0c	Implicitly open root actor on first nursery use.	2021-01-02 21:39:30 -05:00
Tyler Goodlet	bd3059f01b	Allow for error bypass	2021-01-02 21:39:30 -05:00
Tyler Goodlet	803152ead5	Use explicit named args	2021-01-02 21:39:30 -05:00
Tyler Goodlet	e6245671b0	Use runtime level on attach	2021-01-02 21:38:55 -05:00
goodboy	bfe500060f	Merge pull request #181 from goodboy/drop_tractor_run Deprecate `tractor.run()`	2020-12-28 12:53:04 -05:00
Tyler Goodlet	723fb17394	Add deprecation warning to run()	2020-12-27 13:29:30 -05:00
Tyler Goodlet	f05534e472	Re-org root actor startup into context manager This begins the move to dropping support for `tractor.run()` which we don't really need since the runtime is started (as it always has been) from a new sub-task / nursery. Instead this introduces starting the actor tree through a `open_root_actor()` async context manager which we'll likely implicitly call (from the root) on the first use of an actor nursery. Drop `_actor._start_actor()` and factor its contents into this new api. Make `run()` and `run_daemon()` use `open_root_actor()` until we decide to remove them. Relates to #168 and #177	2020-12-27 13:29:30 -05:00
Tyler Goodlet	b040cdc0c9	Add null byte guard from mainline	2020-12-27 13:28:54 -05:00
Tyler Goodlet	6b650c0fe6	Add a "runtime" log level	2020-12-26 15:45:45 -05:00
Tyler Goodlet	0d05a727b6	Use error log level by default	2020-12-25 15:28:32 -05:00
Tyler Goodlet	c28ffd8b1c	Don't exception log multi-cancels	2020-12-25 15:23:59 -05:00
Tyler Goodlet	5d7a4e2b12	Denoise some common teardown "errors" to warnings.	2020-12-25 15:10:20 -05:00
Tyler Goodlet	8522f90000	Add type annots to exceptions mod Also add a `is_multi_cancelled()` predicate to test for `trio.MultiError`s that contain entirely cancel signals. Resolves #125	2020-12-25 15:07:36 -05:00
Tyler Goodlet	4bf9b27f57	Drop all .statespace refs; it was a silly idea	2020-12-22 19:33:16 -05:00
Tyler Goodlet	9fd3c42eb1	Port inter-process method calls to `Portal.run_from_ns()`	2020-12-22 10:39:47 -05:00
Tyler Goodlet	7134f35d6e	Add `Portal.run_from_ns()` It turns out in order to maintain our sneaky little "call an `Actor` method in this remote process" we still need the ability to invoke functions from a namespace. We're currently using a "self" namespace as a way to do this for internal inter-process method calling. Either way, I see no reason not to keep a public method for this invoke style (we just won't market it) since it is still how the machinery works underneath.	2020-12-22 10:39:47 -05:00
Tyler Goodlet	a668f714d5	Allow passing function refs to `Portal.run()` This resolves and completes #69 allowing all RPC invocation APIs to pass function references directly instead of explicit `str` names for the target namespace and function (this is still done implicitly underneath). This brings us closer to `trio`'s task running API as well as acknowledges that any inter-host RPC system (and API) will likely need to be implemented on top of local RPC primitives anyway. Even if this ends up not being true we can always go to "function stubs" as part of our IAC protocol or, add a new method to do explicit namespace calls: `.run_from_module()` or whatever everyone votes on. Resolves #69 Further, this commit drops `Actor.statespace` from the entire system since a user can easily get this same functionality using module level variables. Fix docs to match all these changes (luckily mostly already done due to example scripts referencing).	2020-12-21 09:09:55 -05:00
Tyler Goodlet	0d67ce4abc	Fix collections type import for py3.10	2020-12-18 17:58:07 -05:00
Tyler Goodlet	797bcc1df2	Handle early timeouts on last debugger test	2020-12-17 13:35:45 -05:00
Tyler Goodlet	201771a521	'Fix mypy, change interal type name to `ReceiveStream`, settle on `.shield()`'	2020-12-17 12:01:49 -05:00
Tyler Goodlet	15ead6b561	Add a way to shield a stream's underlying channel Add a ``tractor._portal.StreamReceiveChannel.shield_channel()`` context manager which allows for avoiding the closing of an IPC stream's underlying channel for the purposes of task re-spawning. Sometimes you might want to cancel a task consuming a stream but not tear down the IPC between actors (the default). A common use can might be where the task's "setup" work might need to be redone but you want to keep the established portal / channel in tact despite the task restart. Includes a test.	2020-12-16 21:42:28 -05:00
Tyler Goodlet	d497078eb7	Appease 3.8 mypy	2020-12-11 20:04:56 -05:00
Tyler Goodlet	e51c2620e5	End the `pdb` SIGINT handling madness Turns out this is a lower level issue in terms of the stdlib's default `pdb.Pdb` settings and how they conflict with `trio`s cancellation and KBI handling. The details are hashed out more thoroughly in python-trio/trio#1155. Maybe we can get a fix in trio so things are solved under our feet :)	2020-12-11 00:15:09 -05:00
Tyler Goodlet	12f425137c	Drop duplicate project-package name in msg header	2020-11-03 12:15:49 -05:00
Tyler Goodlet	1580cc6fa0	Add explanation to module load error	2020-10-15 23:16:56 -04:00
Tyler Goodlet	5822d38ae4	Set _is_root runtime var in _main()	2020-10-15 23:16:54 -04:00
Tyler Goodlet	3b8684f655	Always call `Actor.cancel()` at end of root's main task It's simpler and the only real logical difference is logging messages. This should also give us an overall consistent tear down sequence.	2020-10-14 13:59:57 -04:00
Tyler Goodlet	02a9cac557	Drop remaining warn()s	2020-10-14 13:48:14 -04:00
Tyler Goodlet	f60321a35a	Always cancel service nursery last The channel server should be torn down before the rpc task/service nursery. Do this explicitly even in the root's main task to avoid a strange hang I found in the pubsub tests. Start dropping the `warnings.warn()` usage.	2020-10-14 13:46:05 -04:00
goodboy	7115d6c3bd	Merge pull request #129 from goodboy/multiproc_debug Wen? Multiprocessing-native debugger now!	2020-10-14 09:14:03 -04:00
Tyler Goodlet	e3c26943ba	Support debug mode only on the trio backend	2020-10-13 14:20:44 -04:00
Tyler Goodlet	08ff989631	Add some comments	2020-10-13 11:59:18 -04:00
Tyler Goodlet	573b8fef73	Add better actor cancellation tracking Add `Actor._cancel_called` and `._cancel_complete` making it possible to determine whether the actor has started the cancellation sequence and whether that sequence has fully completed. This allows for blocking in internal machinery tasks as necessary. Also, always trigger the end of ongoing rpc tasks even if the last task errors; there's no guarantee the trio cancellation semantics will guarantee us a nice internal "state" without this.	2020-10-13 11:48:52 -04:00
Tyler Goodlet	c375a2d028	mypy fixes	2020-10-13 11:03:55 -04:00
Tyler Goodlet	c41e5c8313	Fix missing await	2020-10-13 00:45:29 -04:00
Tyler Goodlet	79c38b04e7	Report `trio.Cancelled` when exhausting portals.. For reliable remote cancellation we need to "report" `trio.Cancelled`s (just like any other error) when exhausting a portal such that the caller can make decisions about cancelling the respective actor if need be. Resolves #156	2020-10-12 23:28:36 -04:00
Tyler Goodlet	07112089d0	Add mention subactor uid during locking	2020-10-07 05:53:26 -04:00
Tyler Goodlet	d43d367153	Facepalm: tty locking from root doesn't require an extra task	2020-10-05 11:58:58 -04:00
Tyler Goodlet	83a45119e9	Add "root mailbox" contact info passing Every subactor in the tree now receives the socket (or whatever the mailbox type ends up being) during startup and can call the new `tractor._discovery.get_root()` function to get a portal to the current root actor in their tree. The main reason for adding this atm is to support nested child actors gaining access to the root's tty lock for debugging. Also, when a channel disconnects from a message loop, might as well kill all its rpc tasks.	2020-10-05 11:58:58 -04:00
Tyler Goodlet	a2151cdd4d	Allow re-entrant breakpoints during pdb stepping	2020-10-05 11:58:58 -04:00
Tyler Goodlet	9067bb2a41	Shorten arbiter contact timeout	2020-10-05 11:58:58 -04:00
Tyler Goodlet	29ed065dc4	Ack our inability to hard kill sub-procs	2020-09-28 13:56:42 -04:00
Tyler Goodlet	fc2cb610b9	Make "hard kill" just a `Process.terminate()` It's not like any of this code is really being used anyway since we aren't indefinitely blocking for cancelled subactors to terminate (yet). Drop the `do_hard_kill()` bit for now and just rely on the underlying process api. Oh, and mark the nursery as cancelled asap.	2020-09-28 13:49:45 -04:00
Tyler Goodlet	5dd2d35fc5	Huh, maybe we don't need to block SIGINT Seems like the request task cancel scope is actually solving all the deadlock issues and masking SIGINT isn't changing much behaviour at all. I think let's keep it unmasked for now in case it does turn out useful in cancelling from unrecoverable states while in debug.	2020-09-28 13:11:22 -04:00
Tyler Goodlet	25e93925b0	Add a cancel scope around child debugger requests This is needed in order to avoid the deadlock condition where a child actor is waiting on the root actor's tty lock but it's parent (possibly the root) is waiting on it to terminate after sending a cancel request. The solution is simple: create a cancel scope around the request in the child and always cancel it when a cancel request from the parent arrives.	2020-09-28 13:02:33 -04:00
Tyler Goodlet	363498b882	Disable SIGINT handling in child processes There seems to be no good reason not too since our cancellation machinery/protocol should do this work when the root receives the signal. This also (hopefully) helps with some debugging race condition stuff.	2020-09-28 09:24:36 -04:00
Tyler Goodlet	f1b242f913	Block SIGINT handling while in the debugger This seems to prevent a certain class of bugs to do with the root actor cancelling local tasks and getting into deadlock while children are trying to acquire the tty lock. I'm not sure it's the best idea yet since you're pretty much guaranteed to get "stuck" if a child activates the debugger after the root has been cancelled (at least "stuck" in terms of SIGINT being ignored). That kinda race condition seems to still exist somehow: a child can "beat" the root to activating the tty lock and the parent is stuck waiting on the child to terminate via its nursery.	2020-09-28 08:54:21 -04:00
Tyler Goodlet	76e1c83161	Add matrix room link	2020-09-24 11:12:45 -04:00
Tyler Goodlet	9e1d9a8ce1	Add an internal context stack This aids with tearing down resources after the crash handling and debugger have completed. Leaving this internal for now but should eventually get a public convenience function like `tractor.context_stack()`.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	09daba4c9c	Explicitly handle `debug_mode` flag correctly	2020-09-24 10:12:33 -04:00
Tyler Goodlet	8b6e9f5530	Port to new debug api, set `_is_root` state flag on startup	2020-09-24 10:12:33 -04:00
Tyler Goodlet	150179bfe4	Support entering post mortem on crashes in root actor	2020-09-24 10:12:33 -04:00
Tyler Goodlet	291ecec070	Maybe not sticky by default	2020-09-24 10:12:33 -04:00
Tyler Goodlet	bd157e05ef	Port to service nursery	2020-09-24 10:12:33 -04:00

... 7 8 9 10 11 ...

1053 Commits (7e93b81a83996472657fa2c01f87979d455a774f)