tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	8eb9a742dd	Add multi-process debugging support using `pdbpp` This is the first step in addressing #113 and the initial support of #130. Basically this allows (sub)processes to engage the `pdbpp` debug machinery which read/writes the root actor's tty but only in a FIFO semaphored way such that no two processes are using it simultaneously. That means you can have multiple actors enter a trace or crash and run the debugger in a sensible way without clobbering each other's access to stdio. It required adding some "tear down hooks" to a custom `pdbpp.Pdb` type such that we release a child's lock on the parent on debugger exit (in this case when either of the "continue" or "quit" commands are issued to the debugger console). There's some code left commented in anticipation of full support for issue #130 where we're need to actually capture and feed stdin to the target (remote) actor which won't necessarily being running on the same host.	2020-09-24 10:12:10 -04:00
Tyler Goodlet	b11e91375c	Initial attempt at multi-actor debugging Allow entering and attaching to a `pdb` instance in a child process. The current hackery is to have the child make an rpc to the parent and ask it to hijack stdin, once complete the child enters a `pdb` blocking method. The parent then relays all stdin input to the child thus controlling the "remote" debugger. A few things were added to accomplish this: - tracking the mapping of subactors to their parent nurseries - in the root actor, cancelling all nurseries under the root `trio` task on cancellation (i.e. `Actor.cancel()`) - pass a "runtime vars" map down the actor tree for propagating global state	2020-09-24 10:12:10 -04:00
Tyler Goodlet	ec5d443ee5	Always log actor errors	2020-08-13 11:55:22 -04:00
Tyler Goodlet	b3eba00c3a	Appease the great mypy	2020-08-08 20:57:43 -04:00
Tyler Goodlet	42be410076	Handle mp accept_addr	2020-08-08 20:27:43 -04:00
Tyler Goodlet	8477d21499	Restructure actor runtime nursery scoping In an effort acquire more deterministic actor cancellation, this adds a clearer and more resilient (whilst possibly a bit slower) internal nursery structure with explicit semantics for clarifying the task-scope shutdown sequence. Namely, on cancellation, the explicit steps are now: - cancel all currently running rpc tasks and wait for them to complete - cancel the channel server and wait for it to complete - cancel the msg loop for the channel with the immediate parent - de-register with arbiter if possible - wait on remaining connections to release - exit process To accomplish this add a new nursery called the "service nursery" which spawns all rpc tasks instead of using the "root nursery". The root is now used solely for async launching the msg loop for the primary channel with the parent such that it is (nearly) the last thing torn down on cancellation. In the future it should also be possible to have `self.cancel()` return a result to the parent once the runtime is sure that the rest of the shutdown is atomic; this would allow for a true unbounded shield in `Portal.cancel_actor()`. This will likely require that the error handling blocks in `Actor._async_main()` are moved "inside" the root nursery block such that the msg loop with the parent truly is the last thing to terminate.	2020-08-08 14:55:41 -04:00
Tyler Goodlet	ae8488a578	Always shield de-register step with arbiter	2020-08-07 11:36:26 -04:00
Tyler Goodlet	56b81f07e5	Return `Dict[Tuple, Tuple]` from `.get_registry()`	2020-08-03 18:42:23 -04:00
Tyler Goodlet	639299e6eb	Expose a `.get_registry()` method on the arbiter	2020-08-03 15:40:41 -04:00
Tyler Goodlet	9a40291d4a	Repair startup sequence around parent state transfer In order to have reliable subactor startup we need the following sequence to take place: - connect to the parent actor, handshake and receive runtime state - load exposed modules into memory - start the channel server up fully using the provided bind address - finally, start processing new messages from the parent Add a bunch more comments to clarify all this.	2020-07-28 22:25:22 -04:00
Guillermo Rodriguez	0a5691e0a8	Removed arbiter_addr local, and bind_addr is now passed through channel, in early child actor init.	2020-07-28 11:55:11 -03:00
Guillermo Rodriguez	ef053eb070	Added named arguments to child init, and now passing less of them.	2020-07-27 21:05:00 -03:00
Guillermo Rodriguez	e5dbf14ec3	Onlt await params in trio mode	2020-07-27 15:20:55 -03:00
Guillermo Rodriguez	2a407be532	Now passing additional initialization parameters through channel early after handshake.	2020-07-27 14:55:37 -03:00
Tyler Goodlet	7c3928f0bf	Oh mypy..	2020-07-24 17:31:24 -04:00
Tyler Goodlet	0b305fd78a	Change spawn method name in `Actor.load_modules()`	2020-07-24 17:08:52 -04:00
Tyler Goodlet	8fbdfd6a3a	Add an obnoxious error message on internal failures	2020-07-24 17:06:23 -04:00
Tyler Goodlet	1706791313	Drop entrypoints from `Actor`	2020-07-24 17:04:22 -04:00
Tyler Goodlet	596aca8097	Alias __mp_main__ at import time	2020-02-09 01:07:14 -05:00
Tyler Goodlet	8264b7d136	Drop old module loading from abspath cruft	2020-01-31 12:04:46 -05:00
Tyler Goodlet	6348121d23	Do __main__ fixups like ``mulitprocessing does`` Instead of hackery trying to map modules manually from the filesystem let Python do all the work by simply copying what ``multiprocessing`` does to "fixup the __main__ module" in spawned subprocesses. The new private module ``_mp_fixup_main.py`` is simply cherry picked code from ``multiprocessing.spawn`` which does just that. We only need these "fixups" when using a backend other then ``multiprocessing``; for now just when using ``trio_run_in_process``.	2020-01-29 21:14:48 -05:00
Tyler Goodlet	2a4307975d	Fix that thing where the first example in your docs is supposed to work Thanks to @salotz for pointing out that the first example in the docs was broken. Though it's somewhat embarrassing this might also explain the problem in #79 and certain issues in #59... The solution here is to import the target RPC module using the its unique basename and absolute filepath in the sub-actor that requires it. Special handling for `__main__` and `__mp_main__` is needed since the spawned subprocess will have no knowledge about these parent- -state-specific module variables. Solution: map the modules name to the respective module file basename in the child process since the module variables will of course have different values in children.	2020-01-29 12:16:14 -05:00
Tyler Goodlet	27c9760f96	Be explicit about the spawning backend default Set `trio-run-in-process` as the default on *nix systems and `multiprocessing`'s spawn method on Windows. Enable overriding the default choice using `tractor._spawn.try_set_start_method()`. Allows for easy runs of the test suite using a user chosen backend.	2020-01-26 21:13:29 -05:00
Tyler Goodlet	4b0554b61f	Type checker fixes	2020-01-21 10:28:32 -05:00
Tyler Goodlet	91c3716968	Do module abspath loading in actor init	2020-01-21 10:27:53 -05:00
Tyler Goodlet	afa640dcab	More trip WIP stuff working.. kinda Get a few more things working: - fail reliably when remote module loading goes awry - do a real hacky job of module loading using `sys.path` stuffsies - we're still totally borked when trying to spin up and quickly cancel a bunch of subactors... It's a small move forward I guess.	2020-01-21 10:27:53 -05:00
Tyler Goodlet	1b7cdfe512	WIP trying out trio_run_in_process	2020-01-21 10:27:53 -05:00
Tyler Goodlet	698951c515	More mypy apeasement on 3.7	2020-01-15 21:06:13 -05:00
Tyler Goodlet	79c152fe38	Make latest mpypy happy	2019-12-10 00:55:03 -05:00
Tyler Goodlet	d2a01e8b81	Drop use of `trio.Event.clear()` Just spin up new events instead; because apparently they're so cheap (rolls eyes). Resolves #78	2019-11-23 11:29:23 -05:00
Tyler Goodlet	95e8f3d306	Propagate `trio.MultiError`s up the actor tree `trio.MultiError` isn't an `Exception` (derived instead from `BaseException`) so we have to specially catch it in the task invocation machinery and ship it upwards (like regular errors) since nurseries running in sub-actors can raise them.	2019-10-28 00:47:06 -04:00
Tyler Goodlet	f885b02c73	Validate stream functions at decorate time	2019-03-29 19:10:32 -04:00
Tyler Goodlet	e51f84af90	Require explicit marking of non async gen streaming funcs Add `@tractor.stream` which must be used to denote non async generator streaming functions which use the `tractor.Context` API to push values. This enforces a more explicit denotation as well as allows enforcing the declaration of the `ctx` argument in definitions.	2019-03-25 21:36:13 -04:00
Tyler Goodlet	4ee35038fb	Move discovery functions to their own module	2019-03-24 11:37:11 -04:00
Tyler Goodlet	2aa6ffce60	Provide each task's cancel scope to every `Context` This begins moving toward explicitly decorated "streaming functions" instead of checking for a `ctx` arg in the signature. - provide each context with its task's top level `trio.CancelScope` such that tasks can cancel themselves explictly if needed via calling `Context.cancel_scope()` - make `Actor.cancel_task()` a private method (`_cancel_task()`) and handle remote rpc calls specially such that the caller does not need to provide the `chan` argument; non-primitive types can't be passed on the wire and we don't want the client actor be require knowledge of the channel instance the request is associated with. This also ties into how we're tracking tasks right now (`Actor._rpc_tasks` is keyed by the call id, a UUID, plus the channel). - make `_do_handshake` a private actor method - use UUID version 4	2019-03-23 23:31:26 -04:00
Tyler Goodlet	7014a07986	Add "spawn" start method support Add full support for using the "spawn" process starting method as per: https://docs.python.org/3/library/multiprocessing.html#contexts-and-start-methods Add a `spawn_method` argument to `tractor.run()` for specifying the desired method explicitly. By default use the "fastest" method available. On *nix systems this is the original "forkserver" method. This should be the solution to getting windows support! Resolves #60	2019-03-06 00:29:07 -05:00
Tyler Goodlet	d75739e9c7	Factor process creation into a separate factory Make a `_spawn` module for encapsulating all the `multiprocessing` "spawn method" stuff and factor current forkserver steps into it.	2019-03-05 18:52:19 -05:00
Tyler Goodlet	78ddd33e3a	Move to `trio.CancelScope`	2019-02-16 14:25:06 -05:00
Tyler Goodlet	fe1c4dbc4c	mpypy and docs fixups	2019-02-16 14:05:03 -05:00
Tyler Goodlet	616192d853	Don't use async gen functions for the stream API As mentioned in prior commits there's currently a bug in Python that make async gens not task safe. Since this is the core cause of almost all recent problems, instead implement our own async iterator derivative of `trio.abc.ReceiveChannel` by wrapping a `trio._channel.MemoryReceiveChannel`. This fits more natively with the memory channel API in ``trio`` and adds potentially more flexibility for possible bidirectional inter-actor streaming in the future. Huge thanks to @oremanj and of course @njsmith for guidance on this one!	2019-02-15 21:59:42 -05:00
Tyler Goodlet	f44ac4528a	Use mem chan in actor core	2019-02-15 16:23:58 -05:00
Tyler Goodlet	3b19e15306	Don't allow cancelling a cancel_task() task	2019-01-23 20:01:29 -05:00
Tyler Goodlet	03e00886da	Add `Actor.cancel_task()` Enable cancelling specific tasks from a peer actor such that when a actor task or the actor itself is cancelled, remotely spawned tasks can also be cancelled. In much that same way that you'd expect a node (task) in the `trio` task tree to cancel any subtasks, actors should be able to cancel any tasks they spawn in separate processes. To enable this: - track rpc tasks in a flat dict keyed by (chan, cid) - store a `is_complete` event to enable waiting on specific tasks to complete - allow for shielding the msg loop inside an internal cancel scope if requested by the caller; there was an issue with `open_portal()` where the channel would be torn down because the current task was cancelled but we still need messaging to continue until the portal block is exited - throw an error if the arbiter tries to find itself for now	2019-01-21 00:38:07 -05:00
Tyler Goodlet	76f7ae5cf4	Log about the loglevel	2019-01-16 17:09:30 -05:00
Tyler Goodlet	06c908f285	Wrap remote import-time errors just the same	2019-01-12 17:56:22 -05:00
Tyler Goodlet	41f2096e86	Adopt `Context` in the RPC core Instead of chan/cid, whenever a remote function defines a `ctx` argument name deliver a `Context` instance to the function. This allows remote funcs to provide async generator like streaming replies (and maybe more later). Additionally, - load actor modules after establishing a connection to the spawning parent to avoid crashing before the error can be reported upwards - fix a bug to do with unpacking and raising local internal actor errors from received messages	2019-01-12 15:27:38 -05:00
Tyler Goodlet	5fab61412c	Propagate module import and func lookup errors RPC module/function lookups should not cause the target actor to crash. This change instead ships the error back to the calling actor allowing for the remote actor to continue running depending on the caller's error handling logic. Adds a new `ModuleNotExposed` error to accommodate.	2019-01-01 16:12:34 -05:00
Tyler Goodlet	d492236f3a	Handle broken channels more resiliently on teardown	2018-12-15 02:19:47 -05:00
Tyler Goodlet	4dccb44c67	Add support for cancelling remote tasks via a msg	2018-12-10 23:12:46 -05:00
Tyler Goodlet	a588047ad4	Stop channel based async gen streams on exit I'm not sure how this ever worked but when a "fake" async gen (i.e. function with special `chan`, `cid` kwargs) is completed we need to signal the end of the stream just like with normal async gens. Also don't fail when trying to remove tasks that were never tracked. Fixes #46	2018-11-30 01:24:59 -05:00

1 2

75 Commits (8eb9a742dd9448394a2ea27152c627ac599e0d71)