tractor

jc211

tractor

Author	SHA1	Message	Date
Tyler Goodlet	3208b67f57	Drop shielding on root lock acquire; seems to prevent hangs	2021-09-02 16:23:38 -04:00
Tyler Goodlet	82999801a6	Drop leftover noisy exception logging..	2021-08-02 16:56:00 -04:00
Tyler Goodlet	674fbbc6b3	Docs and comments tidying	2021-08-01 10:44:13 -04:00
Tyler Goodlet	551816e80d	Solve the root-cancels-child-in-tty-lock race Finally this makes a cancelled root actor nursery not clobber child tasks which request and lock the root's tty for the debugger repl. Using an edge triggered event which is set after all fifo-lock-queued tasks are complete, we can be sure that no lingering child tasks are going to get interrupted during pdb use and tty lock acquisition. Further, even if new tasks do queue up to get the lock, the root will incrementally send cancel msgs to each sub-actor only once the tty is not locked by a (set of) child request task(s). Add shielding around all the critical sections where the child attempts to allocate the lock from the root such that it won't be disrupted from cancel messages from the root after the acquire lock transaction has started.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0fdcfa0ba1	Move debugger wait inside OCA nursery	2021-07-31 12:46:40 -04:00
Tyler Goodlet	37a1897c47	Don't shield debugger status wait; it causes hangs	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0f2a39a311	Catch and delay errors in the root if debugger is active	2021-07-31 12:46:40 -04:00
Tyler Goodlet	98bbf8e0df	Move join event trigger to direct exit path	2021-07-06 08:23:30 -04:00
Tyler Goodlet	e311430d25	Be more pedantic with error handling	2021-07-06 08:23:29 -04:00
Tyler Goodlet	73e123bac7	Fix line length	2021-05-07 11:21:40 -04:00
Tyler Goodlet	1584c547cd	Drop run and rpc_module_paths from discovery tests	2021-05-07 11:21:40 -04:00
Tyler Goodlet	f59346d854	Add func type checking to `.run_in_actor()`	2021-04-28 12:23:08 -04:00
Tyler Goodlet	983e66b31b	Add second implicit-runtime-boot branch	2021-02-24 13:13:45 -05:00
Tyler Goodlet	b285db4c58	Factor OCA supervisor into new func	2021-02-24 13:13:38 -05:00
Tyler Goodlet	5ed5d18ccb	Begin rpc_module_paths deprecation	2021-01-08 22:08:45 -05:00
Tyler Goodlet	59421d9f3a	Fix some borked tests	2021-01-08 20:55:11 -05:00
Tyler Goodlet	0bb2163b0c	Implicitly open root actor on first nursery use.	2021-01-02 21:39:30 -05:00
Tyler Goodlet	5d7a4e2b12	Denoise some common teardown "errors" to warnings.	2020-12-25 15:10:20 -05:00
Tyler Goodlet	4bf9b27f57	Drop all .statespace refs; it was a silly idea	2020-12-22 19:33:16 -05:00
Tyler Goodlet	a668f714d5	Allow passing function refs to `Portal.run()` This resolves and completes #69 allowing all RPC invocation APIs to pass function references directly instead of explicit `str` names for the target namespace and function (this is still done implicitly underneath). This brings us closer to `trio`'s task running API as well as acknowledges that any inter-host RPC system (and API) will likely need to be implemented on top of local RPC primitives anyway. Even if this ends up not being true we can always go to "function stubs" as part of our IAC protocol or, add a new method to do explicit namespace calls: `.run_from_module()` or whatever everyone votes on. Resolves #69 Further, this commit drops `Actor.statespace` from the entire system since a user can easily get this same functionality using module level variables. Fix docs to match all these changes (luckily mostly already done due to example scripts referencing).	2020-12-21 09:09:55 -05:00
Tyler Goodlet	02a9cac557	Drop remaining warn()s	2020-10-14 13:48:14 -04:00
Tyler Goodlet	08ff989631	Add some comments	2020-10-13 11:59:18 -04:00
Tyler Goodlet	d43d367153	Facepalm: tty locking from root doesn't require an extra task	2020-10-05 11:58:58 -04:00
Tyler Goodlet	fc2cb610b9	Make "hard kill" just a `Process.terminate()` It's not like any of this code is really being used anyway since we aren't indefinitely blocking for cancelled subactors to terminate (yet). Drop the `do_hard_kill()` bit for now and just rely on the underlying process api. Oh, and mark the nursery as cancelled asap.	2020-09-28 13:49:45 -04:00
Tyler Goodlet	b11e91375c	Initial attempt at multi-actor debugging Allow entering and attaching to a `pdb` instance in a child process. The current hackery is to have the child make an rpc to the parent and ask it to hijack stdin, once complete the child enters a `pdb` blocking method. The parent then relays all stdin input to the child thus controlling the "remote" debugger. A few things were added to accomplish this: - tracking the mapping of subactors to their parent nurseries - in the root actor, cancelling all nurseries under the root `trio` task on cancellation (i.e. `Actor.cancel()`) - pass a "runtime vars" map down the actor tree for propagating global state	2020-09-24 10:12:10 -04:00
Tyler Goodlet	292513b353	Module define default accept addr	2020-08-08 20:58:04 -04:00
Tyler Goodlet	a24c6bfdd2	Correctly catch cancelled nursery case (purely for logging)	2020-08-03 18:44:50 -04:00
Tyler Goodlet	7c73775474	Force keyword only args in actor spawn methods	2020-07-24 17:06:43 -04:00
Tyler Goodlet	8e32199509	Get entry points reorg without asyncio compat This is an edit to factor out changes needed for the `asyncio` in guest mode integration (which currently isn't tested well) so that later more pertinent changes (which are tested well) can be rebased off of this branch and merged into mainline sooner. The infect_asyncio branch will need to be rebased onto this branch as well before merge to mainline.	2020-07-24 17:02:03 -04:00
Tyler Goodlet	8054bc7c70	Support "infected asyncio" actors This is an initial solution for #120. Allow spawning `asyncio` based actors which run `trio` in guest mode. This enables spawning `tractor` actors on top of the `asyncio` event loop whilst still leveraging the SC focused internal actor supervision machinery. Add a `tractor.to_syncio.run()` api to allow spawning tasks on the `asyncio` loop from an embedded (remote) `trio` task and return or stream results all the way back through the `tractor` IPC system using a very similar api to portals. One outstanding problem is getting SC around calls to `asyncio.create_task()`. Currently a task that crashes isn't able to easily relay the error to the embedded `trio` task without us fully enforcing the portals based message protocol (which seems superfluous given the error ref is in process). Further experiments using `anyio` task groups may alleviate this.	2020-07-24 16:48:06 -04:00
Tyler Goodlet	d64508e1a6	Add more detailed docs around nursery logic The logic in the `ActorNursery` block is critical to cancellation semantics and in particular, understanding how supervisor strategies are invoked. Stick in a bunch of explanatory comments to clear up these details and also prepare to introduce more supervisor strats besides the current one-cancels-all approach.	2020-01-31 09:50:25 -05:00
Tyler Goodlet	2a4307975d	Fix that thing where the first example in your docs is supposed to work Thanks to @salotz for pointing out that the first example in the docs was broken. Though it's somewhat embarrassing this might also explain the problem in #79 and certain issues in #59... The solution here is to import the target RPC module using the its unique basename and absolute filepath in the sub-actor that requires it. Special handling for `__main__` and `__mp_main__` is needed since the spawned subprocess will have no knowledge about these parent- -state-specific module variables. Solution: map the modules name to the respective module file basename in the child process since the module variables will of course have different values in children.	2020-01-29 12:16:14 -05:00
Tyler Goodlet	4b0554b61f	Type checker fixes	2020-01-21 10:28:32 -05:00
Tyler Goodlet	6c45416016	Drop ActorNusery.wait(); it's no longer necessary really	2020-01-21 10:27:53 -05:00
Tyler Goodlet	c074aea030	Support TRIP for process launching This took a ton of tinkering and a rework of the actor nursery tear down logic. The main changes include: - each subprocess is now spawned from inside a trio task from one of two containing nurseries created in the body of `tractor.open_nursery()`: one for `run_in_actor()` processes and one for `start_actor()` "daemons". This is to address the need for `trio-run-in_process.open_in_process()` opening a nursery which must be closed from the same task that opened it. Using this same approach for `multiprocessing` seems to work well. The nurseries are waited in order (rip actors then daemon actors) during tear down which allows for avoiding the recursive re-entry of `ActorNursery.wait()` handled prior. - pull out all the nested functions / closures that were in `ActorNursery.wait()` and move into the `_spawn` module such that that process shutdown logic takes place in each containing task's code path. This allows for vastly simplifying `.wait()` to just contain an event trigger which initiates process waiting / result collection. Likely `.wait()` should just be removed since it can no longer be used to synchronously wait on the actor nursery. - drop `ActorNursery.__aenter__()` / `.__atexit__()` and move this "supervisor" tear down logic into the closing block of `open_nursery()`. This not only cleans makes the code more comprehensible it also makes our nursery implementation look more like the one in `trio`. Resolves #93	2020-01-21 10:27:53 -05:00
Tyler Goodlet	afa640dcab	More trip WIP stuff working.. kinda Get a few more things working: - fail reliably when remote module loading goes awry - do a real hacky job of module loading using `sys.path` stuffsies - we're still totally borked when trying to spin up and quickly cancel a bunch of subactors... It's a small move forward I guess.	2020-01-21 10:27:53 -05:00
Tyler Goodlet	1b7cdfe512	WIP trying out trio_run_in_process	2020-01-21 10:27:53 -05:00
Tyler Goodlet	79c152fe38	Make latest mpypy happy	2019-12-10 00:55:03 -05:00
Tyler Goodlet	f977d37cee	Add nursery self-destruct logic on cancel failure If a nursery fails to cancel (some sub-actors presumably) then hard kill the whole process tree to avoid hangs during a catastrophic failure. This logic may get factored out (and changed) as we introduce custom supervisor strategies.	2019-11-22 17:11:48 -05:00
Tyler Goodlet	4e078368fc	Propagate `tractor.run()` logging level to subactors	2019-03-18 21:32:08 -04:00
Tyler Goodlet	dc5cc040e6	Try to support waiting on Windows processes This pokes around a little in `trio` hazmat but it should work as it piggy backs on the new cross platform subprocess support. Relates to #59	2019-03-06 21:24:23 -05:00
Tyler Goodlet	d75739e9c7	Factor process creation into a separate factory Make a `_spawn` module for encapsulating all the `multiprocessing` "spawn method" stuff and factor current forkserver steps into it.	2019-03-05 18:52:19 -05:00
Tyler Goodlet	78ddd33e3a	Move to `trio.CancelScope`	2019-02-16 14:25:06 -05:00
Tyler Goodlet	b91d13cfea	Use local actor var	2019-02-15 17:11:26 -05:00
Tyler Goodlet	855f959768	Don't log traceback on kb interrupt	2019-01-23 20:00:57 -05:00
Tyler Goodlet	7377598683	Properly respect `rpc_module_paths` in `run_in_actor()`	2019-01-12 17:55:08 -05:00
Tyler Goodlet	a482681f9c	Leverage `pytest.raises()` better; fix a bunch of docs	2018-11-22 11:43:04 -05:00
Tyler Goodlet	82fcf025cc	Fix: MultiError isn't an Exception...	2018-11-19 14:16:09 -05:00
Tyler Goodlet	9bb8a062eb	mypy fixes	2018-11-19 08:47:42 -05:00
Tyler Goodlet	835d1fa07a	Vastly improve error triggered cancellation At the expense of a bit more complexity in `ActorNursery.wait()` (which I commented the heck out of fwiw) this adds far superior and correct cancellation semantics for when a nursery is cancelled due to (remote) errors in subactors. This includes: - `wait()` will now raise a `trio.MultiError` if multiple subactors error with the same semantics as in `trio`. - in `wait()` portals which are paired with `run_in_actor()` spawned subactors (versus `start_actor()`) are waited on separately and if the nursery hasn't been cancelled but there are errors those are raised immediately before waiting on `start_actor()` subactors which will block indefinitely if they haven't been explicitly cancelled. - if `wait()` does raise when the nursery hasn't yet been cancelled it's expected that it will be called again depending on the actor supervision strategy (i.e. right now we operate with a one-cancels-all strategy, the same as `trio`, so `ActorNursery.__aexit__() calls `cancel()` if any error is raised by `wait()`). Oh and I added `is_main_process()` helper; can't remember why..	2018-11-19 08:44:19 -05:00

1 2

62 Commits (3facfb6d4c2f82183bd480aa1d2490a48f8ddfea)