tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	0afa7f0f8e	Fix lock context manager return type	2021-07-31 12:50:58 -04:00
Tyler Goodlet	b3d28a1ee4	Drop debugger path and duplicate func from rebasing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	09f00a5a00	Go back to only logging tbs on no debugger	2021-07-31 12:46:40 -04:00
Tyler Goodlet	44bfacc0c2	Comment hard-kill-sidestep for now since nursery version covers it?	2021-07-31 12:46:40 -04:00
Tyler Goodlet	551816e80d	Solve the root-cancels-child-in-tty-lock race Finally this makes a cancelled root actor nursery not clobber child tasks which request and lock the root's tty for the debugger repl. Using an edge triggered event which is set after all fifo-lock-queued tasks are complete, we can be sure that no lingering child tasks are going to get interrupted during pdb use and tty lock acquisition. Further, even if new tasks do queue up to get the lock, the root will incrementally send cancel msgs to each sub-actor only once the tty is not locked by a (set of) child request task(s). Add shielding around all the critical sections where the child attempts to allocate the lock from the root such that it won't be disrupted from cancel messages from the root after the acquire lock transaction has started.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	be1fcb2a5b	Distinguish between a local pdb unlock and the tty unlock in root	2021-07-31 12:46:40 -04:00
Tyler Goodlet	ef89ed947a	Fix hard kill in debug mode; only do it when debug lock is empty	2021-07-31 12:46:40 -04:00
Tyler Goodlet	5b3894827f	Move some infos to runtime level	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0fdcfa0ba1	Move debugger wait inside OCA nursery	2021-07-31 12:46:40 -04:00
Tyler Goodlet	37a1897c47	Don't shield debugger status wait; it causes hangs	2021-07-31 12:46:40 -04:00
Tyler Goodlet	0f2a39a311	Catch and delay errors in the root if debugger is active	2021-07-31 12:46:40 -04:00
Tyler Goodlet	23a1622256	Don't kill root's immediate children when in debug If the root calls `trio.Process.kill()` on immediate child proc teardown when the child is using pdb, we can get stdstreams clobbering that results in a pdb++ repl where the user can't see what's been typed. Not killing such children on cancellation / error seems to resolve this issue whilst still giving reliable termination. For now, code that special path until a time it becomes a problem for ensuring zombie reaps.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	49d439b681	Add some brief todo notes on idea of shielded breakpoint	2021-07-31 12:46:40 -04:00
Tyler Goodlet	6f05f5d5e6	Wait for debugger lock task context termination	2021-07-31 12:46:40 -04:00
Tyler Goodlet	b369b91357	Fix up var naming and typing	2021-07-31 12:46:40 -04:00
Tyler Goodlet	969bce3aa4	Use context for remote debugger locking A context is the natural fit (vs. a receive stream) for locking the root proc's tty usage via it's `.started()` sync point. Simplify the `_breakpoin()` routine to be a simple async func instead of all this "returning a coroutine" stuff from before we decided that `tractor.breakpoint()` must be async. Use `runtime` level for locking logging making it easier to trace.	2021-07-31 12:46:40 -04:00
Tyler Goodlet	443ebea165	Use "pdb" level logging in debug mode	2021-07-08 13:02:33 -04:00
Tyler Goodlet	25779d48a8	Define explicit adapter level methods for mypy	2021-07-08 12:51:35 -04:00
Tyler Goodlet	fde52d2464	Mypy fixes	2021-07-08 12:48:34 -04:00
Tyler Goodlet	8c927d708d	Change trace to transport level	2021-07-07 14:31:15 -04:00
Tyler Goodlet	31590e82a3	Flip "trace" level to "transport" level logging	2021-07-07 14:31:03 -04:00
Tyler Goodlet	2513c652c1	Go back to only logging crashes if no pdb gets engaged	2021-07-06 08:23:30 -04:00
Tyler Goodlet	9ddb654783	Avoid mutate on iterate race	2021-07-06 08:23:30 -04:00
Tyler Goodlet	7f86d63e77	Drop trip kwarg	2021-07-06 08:23:30 -04:00
Tyler Goodlet	12f987514d	Don't enter debug on closed resource errors	2021-07-06 08:23:30 -04:00
Tyler Goodlet	98bbf8e0df	Move join event trigger to direct exit path	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b1cd7fdedf	Don't shield on root cancel it can causes hangs	2021-07-06 08:23:30 -04:00
Tyler Goodlet	ef725c5972	Always hard kill sub-procs on teardown Adds a new hard kill routine for the `trio` spawning backend.	2021-07-06 08:23:30 -04:00
Tyler Goodlet	b21e2a6caa	Add pre-stream open error conditions	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c6cdaf9c31	De-densify some code	2021-07-06 08:23:30 -04:00
Tyler Goodlet	91640facbc	Always shield cancel the caller on cancel-causing-errors, add teardown logging	2021-07-06 08:23:30 -04:00
Tyler Goodlet	c2484e88a1	First try: pack cancelled tracebacks and ship to caller	2021-07-06 08:23:30 -04:00
Tyler Goodlet	3423ea4011	Add temp warning msg for context cancel call	2021-07-06 08:23:29 -04:00
Tyler Goodlet	af701c16ee	Consider relaying context error via raised-in-scope-nursery task	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1703171bea	Set stream "end of channel" after shielded check! Another face palm that was causing serious issues for code that is using the `.shielded` feature.. Add a bunch more detailed comments for all this subtlety and hopefully get it right once and for all. Also aggregated the `trio` errors that should trigger closure inside `.aclose()`, hopefully that's right too.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	3d633408fc	Don't clobber msg loop mem chan on rx stream close Revert this change since it really is poking at internals and doesn't make a lot of sense. If the context is going to be cancelled then the msg loop will tear down the feed memory channel when ready, we don't need to be clobbering it and confusing the runtime machinery lol.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	196dea80db	Drop trailing comma	2021-07-06 08:23:29 -04:00
Tyler Goodlet	54916be601	Adjustments for non-frozen context dataclass change	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1a69727b75	Fix exception typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	348148ff1e	Explicitly formalize context/streaming teardown Add clear teardown semantics for `Context` such that the remote side cancellation propagation happens only on error or if client code explicitly requests it (either by exit flag to `Portal.open_context()` or by manually calling `Context.cancel()`). Add `Context.result()` to wait on and capture the final result from a remote context function; any lingering msg sequence will be consumed/discarded. Changes in order to make this possible: - pass the runtime msg loop's feeder receive channel in to the context on the calling (portal opening) side such that a final 'return' msg can be waited upon using `Context.result()` which delivers the final return value from the callee side `@tractor.context` async function. - always await a final result from the target context function in `Portal.open_context()`'s `__aexit__()` if the context has not been (requested to be) cancelled by client code on block exit. - add an internal `Context._cancel_called` for context "cancel requested" tracking (much like `trio`'s cancel scope). - allow flagging a stream as terminated using an internal `._eoc` flag which will mark the stream as stopped for iteration. - drop `StopAsyncIteration` catching in `.receive()`; it does nothing.	2021-07-06 08:23:29 -04:00
Tyler Goodlet	73302d9d16	Specially raise a `ContextCancelled` for a task-context rpc	2021-07-06 08:23:29 -04:00
Tyler Goodlet	409f7f0d5a	Expose streaming components at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	eb3662f981	Add a specially handled `ContextCancelled` error	2021-07-06 08:23:29 -04:00
Tyler Goodlet	39b9896a62	Only close recv chan if we get a ref	2021-07-06 08:23:29 -04:00
Tyler Goodlet	9a4244b9a6	Support no arg to `Context.started()` like trio	2021-07-06 08:23:29 -04:00
Tyler Goodlet	a2e2f7e7a8	Only send stop msg if not received from far end	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6559fb72aa	Expose msg stream types at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e311430d25	Be more pedantic with error handling	2021-07-06 08:23:29 -04:00
Tyler Goodlet	08eb6bd019	Fix typing	2021-07-06 08:23:29 -04:00
Tyler Goodlet	1f8966ba64	Support passing `shield` at stream contruction	2021-07-06 08:23:29 -04:00
Tyler Goodlet	14114547e8	Expose `@context` decorator at top level	2021-07-06 08:23:29 -04:00
Tyler Goodlet	e3955bb62b	Add initial bi-directional streaming This mostly adds the api described in https://github.com/goodboy/tractor/issues/53#issuecomment-806258798 The first draft summary: - formalize bidir steaming using the `trio.Channel` style interface which we derive as a `MsgStream` type. - add `Portal.open_context()` which provides a `trio.Nursery.start()` remote task invocation style for setting up and tearing down tasks contexts in remote actors. - add a distinct `'started'` message to the ipc protocol to facilitate `Context.start()` with a first return value. - for our `ReceiveMsgStream` type, don't cancel the remote task in `.aclose()`; this is now done explicitly by the surrounding `Context` usage: `Context.cancel()`. - streams in either direction still use a `'yield'` message keeping the proto mostly symmetric without having to worry about which side is the caller / portal opener. - subtlety: only allow sending a `'stop'` message during a 2-way streaming context from `ReceiveStream.aclose()`, detailed comment with explanation is included. Relates to #53	2021-07-06 08:23:29 -04:00
Tyler Goodlet	6aab16f877	Drop added logging around root cancel	2021-07-04 11:00:08 -04:00
Tyler Goodlet	caa70245e0	Try remapping all broken errs wholesale on windows	2021-07-04 10:47:15 -04:00
Tyler Goodlet	3f75732b02	Remap windows specific connection reset error	2021-07-04 10:25:19 -04:00
Tyler Goodlet	1edf5c2f06	Specially remap TCP 104-connection-reset to `TransportClosed` Since we currently have no real "discovery protocol" between process trees, the current naive approach is to check via a connect and drop to see if a TCP server is bound to a particular address during root actor startup. This was a historical decision and had no real grounding beyond taking a simple approach to get something working when the project was first started. This is obviously problematic from an error handling perspective since we need to be able to avoid such quick connect-and-drops from cancelling an "arbiter"'s (registry actor's) channel-msg loop machinery (which would propagate and cancel the actor). For now we map this particular TCP error, which gets remapped by `trio` as a `trio.BrokenResourceError` to our own internal `TransportClosed` which is swallowed by channel message loop processing and indicates a graceful teardown of the far end actor.	2021-07-03 18:57:54 -04:00
Tyler Goodlet	a2d400583f	Fix tuple type	2021-07-02 18:10:06 -04:00
Tyler Goodlet	32b4ae0603	Accept transport closed error during handshake and msg loop	2021-07-02 11:38:24 -04:00
Tyler Goodlet	80e100f818	Add our own "transport closed" signal This change some super old (and bad) code from the project's very early days. For some redic reason i must have thought masking `trio`'s internal stream / transport errors and a TCP EOF as `StopAsyncIteration` somehow a good idea. The reality is you probably want to know the difference between an unexpected transport error and a simple EOF lol. This begins to resolve that by adding our own special `TransportClosed` error to signal the "graceful" termination of a channel's underlying transport. Oh, and this builds on the `msgspec` integration which helped shed light on the core issues here B)	2021-07-02 11:36:22 -04:00
Tyler Goodlet	73e123bac7	Fix line length	2021-05-07 11:21:40 -04:00
Tyler Goodlet	1584c547cd	Drop run and rpc_module_paths from discovery tests	2021-05-07 11:21:40 -04:00
Tyler Goodlet	87971de1d9	Re-raise any sidestepped `trio.Cancelled`	2021-05-06 12:05:17 -04:00
Tyler Goodlet	9f38406e85	Appease mypy	2021-05-06 12:05:17 -04:00
Tyler Goodlet	c4b42000eb	Shield around root actor cancel	2021-05-06 12:05:17 -04:00
Tyler Goodlet	607c48f1ac	Distinctly separate and harden mp spawning It's clear now that special attention is needed to handle the case where a spawned `multiprocessing` proc is started but then the parent is cancelled before the child can connect back; in this case we need to be sure to kill the near-zombie child asap. This may end up being the solution to other resiliency issues seen around mp with nested process trees too. More testing is needed to be sure. Relates to #84 #89 #134 #146	2021-05-06 12:05:17 -04:00
Tyler Goodlet	fc36e73628	Comment out `MsgStream` for now	2021-04-28 16:40:38 -04:00
Tyler Goodlet	f59346d854	Add func type checking to `.run_in_actor()`	2021-04-28 12:23:08 -04:00
Tyler Goodlet	86fc418050	Error on bad registry pops	2021-04-28 12:23:08 -04:00
Tyler Goodlet	83af295b45	Fix func type checking	2021-04-28 12:23:08 -04:00
Tyler Goodlet	ad9256bcdb	Drop stream exhaustion; no longer needed	2021-04-28 12:23:08 -04:00
Tyler Goodlet	3e19fd311b	Move debugger locking to new stream api	2021-04-28 12:23:08 -04:00
Tyler Goodlet	80c96cab01	Add a warning for soon to be deprecated `ctx` use in `@stream` func	2021-04-28 12:23:08 -04:00
Tyler Goodlet	36251357b3	Add a new one-way stream API NB: this is a breaking change removing support for `Portal.run()` being able to invoke remote streaming functions and instead replacing the method call with an async context manager api `Portal.open_stream_from()` This style explicitly defines stream teardown at the call site instead of expecting the user to handle tricky things correctly themselves: eg. `async_geneartor.aclosing()`. Going forward `Portal.run()` can be used only for invoking async functions.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	81f3558494	Formatting	2021-04-28 12:23:08 -04:00
Tyler Goodlet	897ab79946	Add a no runtime error	2021-04-28 12:23:08 -04:00
Tyler Goodlet	7f38b7225d	Aggregate and organize streaming components Move receive stream into streaming modules and rebrand as a "message stream". Factor out cancellation mechanics in `.aclose()` into the `Context` type which will soon provide the api for for cancelling portal invocations. Comment-stage a few methods on both types in anticipation of a new bi-directional streaming api. Add a `MsgStream` bidirectional channel type which will be the eventual type yielded from `Context.open_stream()`. Adjust the response/dialog types to be the set `{'asyncfun', 'asyncgen', 'context'}`. OH, and add async func checking in `Portal.run()` to catch and error on sync funcs early.	2021-04-28 12:23:08 -04:00
Tyler Goodlet	d0eacc3fd6	Appease mypy	2021-04-27 12:08:30 -04:00
Tyler Goodlet	89ce1a63e4	Only accept asyncfunc response type	2021-04-27 12:08:30 -04:00
Tyler Goodlet	5798ef6796	Enforce async funcs on callee side, convert arbiter methods	2021-04-27 12:08:30 -04:00
Tyler Goodlet	c2a1612bf5	Drop sync function support You can always wrap a sync function in an async one and there seems to be no good reason to support invoking them directly especially since cancellation won't work without some thread hackery. If it's requested we'll point users to `trio-parallel`. Resolves #77	2021-04-27 12:08:30 -04:00
Tyler Goodlet	be22a2526a	Add `Actor.cancel_soon()` for sync self destruct Add a sync method that can be used to cancel the current actor from a synchronous context. This is useful in debugging situations where sync debugger code may need to kill the process tree. Also, make the internal "lifetime stack" a global var; easier to manage from client code that may was to add callbacks prior to the actor runtime being fully setup.	2021-04-27 11:35:28 -04:00
Tyler Goodlet	47565cfbf3	Use root as default name from `tractor.run()`	2021-02-25 08:51:28 -05:00
Tyler Goodlet	cd636b270e	Update debug tests to expect 'root' actor name	2021-02-24 13:38:20 -05:00
Tyler Goodlet	983e66b31b	Add second implicit-runtime-boot branch	2021-02-24 13:13:45 -05:00
Tyler Goodlet	b285db4c58	Factor OCA supervisor into new func	2021-02-24 13:13:38 -05:00
Tyler Goodlet	5ffd2d2ab3	Ignore type checks on stdlib overrides	2021-02-21 14:08:23 -05:00
Tyler Goodlet	7888ef6f01	Fix more stdlib typing issues with latest mypy	2021-02-21 12:48:03 -05:00
Tyler Goodlet	109066dda9	Support sync code breakpointing via built-in Override `breakpoint()` for sync code making it work properly with `trio` as per: https://github.com/python-trio/trio/issues/1155#issuecomment-742964018 Relates to #193	2021-02-21 12:36:00 -05:00
Tyler Goodlet	9f4e497b9c	Don't shield proc waits	2021-01-14 18:21:26 -05:00
Tyler Goodlet	e546ead2ff	Pub sub internals type fixes	2021-01-14 18:20:59 -05:00
Tyler Goodlet	3df001f3a9	Fix msg pub global lock sharing Using `None` as the default key for a `@msg.pub` can cause conflicts if there is more then one "taskless" (no tasks={,} passed) pub offered on an actor... So instead use the first trio "task name" (usually just the function name) instead thus avoiding this very hard to debug and understand problem. Probably should throw in a test but I'm super lazy today.	2021-01-14 18:20:49 -05:00
Tyler Goodlet	5ed5d18ccb	Begin rpc_module_paths deprecation	2021-01-08 22:08:45 -05:00
Tyler Goodlet	32b10681a1	Drop tractor.run() from @tractor_test	2021-01-08 20:56:03 -05:00
Tyler Goodlet	41a4de5af2	Use actual task name lel	2021-01-08 20:55:42 -05:00
Tyler Goodlet	59421d9f3a	Fix some borked tests	2021-01-08 20:55:11 -05:00
Tyler Goodlet	333ddcf93f	Can we ever really appease mypy?	2021-01-03 11:18:31 -05:00
Tyler Goodlet	0bb2163b0c	Implicitly open root actor on first nursery use.	2021-01-02 21:39:30 -05:00
Tyler Goodlet	bd3059f01b	Allow for error bypass	2021-01-02 21:39:30 -05:00
Tyler Goodlet	803152ead5	Use explicit named args	2021-01-02 21:39:30 -05:00
Tyler Goodlet	e6245671b0	Use runtime level on attach	2021-01-02 21:38:55 -05:00
goodboy	bfe500060f	Merge pull request #181 from goodboy/drop_tractor_run Deprecate `tractor.run()`	2020-12-28 12:53:04 -05:00
Tyler Goodlet	723fb17394	Add deprecation warning to run()	2020-12-27 13:29:30 -05:00
Tyler Goodlet	f05534e472	Re-org root actor startup into context manager This begins the move to dropping support for `tractor.run()` which we don't really need since the runtime is started (as it always has been) from a new sub-task / nursery. Instead this introduces starting the actor tree through a `open_root_actor()` async context manager which we'll likely implicitly call (from the root) on the first use of an actor nursery. Drop `_actor._start_actor()` and factor its contents into this new api. Make `run()` and `run_daemon()` use `open_root_actor()` until we decide to remove them. Relates to #168 and #177	2020-12-27 13:29:30 -05:00
Tyler Goodlet	b040cdc0c9	Add null byte guard from mainline	2020-12-27 13:28:54 -05:00
Tyler Goodlet	6b650c0fe6	Add a "runtime" log level	2020-12-26 15:45:45 -05:00
Tyler Goodlet	0d05a727b6	Use error log level by default	2020-12-25 15:28:32 -05:00
Tyler Goodlet	c28ffd8b1c	Don't exception log multi-cancels	2020-12-25 15:23:59 -05:00
Tyler Goodlet	5d7a4e2b12	Denoise some common teardown "errors" to warnings.	2020-12-25 15:10:20 -05:00
Tyler Goodlet	8522f90000	Add type annots to exceptions mod Also add a `is_multi_cancelled()` predicate to test for `trio.MultiError`s that contain entirely cancel signals. Resolves #125	2020-12-25 15:07:36 -05:00
Tyler Goodlet	4bf9b27f57	Drop all .statespace refs; it was a silly idea	2020-12-22 19:33:16 -05:00
Tyler Goodlet	9fd3c42eb1	Port inter-process method calls to `Portal.run_from_ns()`	2020-12-22 10:39:47 -05:00
Tyler Goodlet	7134f35d6e	Add `Portal.run_from_ns()` It turns out in order to maintain our sneaky little "call an `Actor` method in this remote process" we still need the ability to invoke functions from a namespace. We're currently using a "self" namespace as a way to do this for internal inter-process method calling. Either way, I see no reason not to keep a public method for this invoke style (we just won't market it) since it is still how the machinery works underneath.	2020-12-22 10:39:47 -05:00
Tyler Goodlet	a668f714d5	Allow passing function refs to `Portal.run()` This resolves and completes #69 allowing all RPC invocation APIs to pass function references directly instead of explicit `str` names for the target namespace and function (this is still done implicitly underneath). This brings us closer to `trio`'s task running API as well as acknowledges that any inter-host RPC system (and API) will likely need to be implemented on top of local RPC primitives anyway. Even if this ends up not being true we can always go to "function stubs" as part of our IAC protocol or, add a new method to do explicit namespace calls: `.run_from_module()` or whatever everyone votes on. Resolves #69 Further, this commit drops `Actor.statespace` from the entire system since a user can easily get this same functionality using module level variables. Fix docs to match all these changes (luckily mostly already done due to example scripts referencing).	2020-12-21 09:09:55 -05:00
Tyler Goodlet	0d67ce4abc	Fix collections type import for py3.10	2020-12-18 17:58:07 -05:00
Tyler Goodlet	797bcc1df2	Handle early timeouts on last debugger test	2020-12-17 13:35:45 -05:00
Tyler Goodlet	201771a521	'Fix mypy, change interal type name to `ReceiveStream`, settle on `.shield()`'	2020-12-17 12:01:49 -05:00
Tyler Goodlet	15ead6b561	Add a way to shield a stream's underlying channel Add a ``tractor._portal.StreamReceiveChannel.shield_channel()`` context manager which allows for avoiding the closing of an IPC stream's underlying channel for the purposes of task re-spawning. Sometimes you might want to cancel a task consuming a stream but not tear down the IPC between actors (the default). A common use can might be where the task's "setup" work might need to be redone but you want to keep the established portal / channel in tact despite the task restart. Includes a test.	2020-12-16 21:42:28 -05:00
Tyler Goodlet	d497078eb7	Appease 3.8 mypy	2020-12-11 20:04:56 -05:00
Tyler Goodlet	e51c2620e5	End the `pdb` SIGINT handling madness Turns out this is a lower level issue in terms of the stdlib's default `pdb.Pdb` settings and how they conflict with `trio`s cancellation and KBI handling. The details are hashed out more thoroughly in python-trio/trio#1155. Maybe we can get a fix in trio so things are solved under our feet :)	2020-12-11 00:15:09 -05:00
Tyler Goodlet	12f425137c	Drop duplicate project-package name in msg header	2020-11-03 12:15:49 -05:00
Tyler Goodlet	1580cc6fa0	Add explanation to module load error	2020-10-15 23:16:56 -04:00
Tyler Goodlet	5822d38ae4	Set _is_root runtime var in _main()	2020-10-15 23:16:54 -04:00
Tyler Goodlet	3b8684f655	Always call `Actor.cancel()` at end of root's main task It's simpler and the only real logical difference is logging messages. This should also give us an overall consistent tear down sequence.	2020-10-14 13:59:57 -04:00
Tyler Goodlet	02a9cac557	Drop remaining warn()s	2020-10-14 13:48:14 -04:00
Tyler Goodlet	f60321a35a	Always cancel service nursery last The channel server should be torn down before the rpc task/service nursery. Do this explicitly even in the root's main task to avoid a strange hang I found in the pubsub tests. Start dropping the `warnings.warn()` usage.	2020-10-14 13:46:05 -04:00
goodboy	7115d6c3bd	Merge pull request #129 from goodboy/multiproc_debug Wen? Multiprocessing-native debugger now!	2020-10-14 09:14:03 -04:00
Tyler Goodlet	e3c26943ba	Support debug mode only on the trio backend	2020-10-13 14:20:44 -04:00
Tyler Goodlet	08ff989631	Add some comments	2020-10-13 11:59:18 -04:00
Tyler Goodlet	573b8fef73	Add better actor cancellation tracking Add `Actor._cancel_called` and `._cancel_complete` making it possible to determine whether the actor has started the cancellation sequence and whether that sequence has fully completed. This allows for blocking in internal machinery tasks as necessary. Also, always trigger the end of ongoing rpc tasks even if the last task errors; there's no guarantee the trio cancellation semantics will guarantee us a nice internal "state" without this.	2020-10-13 11:48:52 -04:00
Tyler Goodlet	c375a2d028	mypy fixes	2020-10-13 11:03:55 -04:00
Tyler Goodlet	c41e5c8313	Fix missing await	2020-10-13 00:45:29 -04:00
Tyler Goodlet	79c38b04e7	Report `trio.Cancelled` when exhausting portals.. For reliable remote cancellation we need to "report" `trio.Cancelled`s (just like any other error) when exhausting a portal such that the caller can make decisions about cancelling the respective actor if need be. Resolves #156	2020-10-12 23:28:36 -04:00
Tyler Goodlet	07112089d0	Add mention subactor uid during locking	2020-10-07 05:53:26 -04:00
Tyler Goodlet	d43d367153	Facepalm: tty locking from root doesn't require an extra task	2020-10-05 11:58:58 -04:00
Tyler Goodlet	83a45119e9	Add "root mailbox" contact info passing Every subactor in the tree now receives the socket (or whatever the mailbox type ends up being) during startup and can call the new `tractor._discovery.get_root()` function to get a portal to the current root actor in their tree. The main reason for adding this atm is to support nested child actors gaining access to the root's tty lock for debugging. Also, when a channel disconnects from a message loop, might as well kill all its rpc tasks.	2020-10-05 11:58:58 -04:00
Tyler Goodlet	a2151cdd4d	Allow re-entrant breakpoints during pdb stepping	2020-10-05 11:58:58 -04:00
Tyler Goodlet	9067bb2a41	Shorten arbiter contact timeout	2020-10-05 11:58:58 -04:00
Tyler Goodlet	29ed065dc4	Ack our inability to hard kill sub-procs	2020-09-28 13:56:42 -04:00
Tyler Goodlet	fc2cb610b9	Make "hard kill" just a `Process.terminate()` It's not like any of this code is really being used anyway since we aren't indefinitely blocking for cancelled subactors to terminate (yet). Drop the `do_hard_kill()` bit for now and just rely on the underlying process api. Oh, and mark the nursery as cancelled asap.	2020-09-28 13:49:45 -04:00
Tyler Goodlet	5dd2d35fc5	Huh, maybe we don't need to block SIGINT Seems like the request task cancel scope is actually solving all the deadlock issues and masking SIGINT isn't changing much behaviour at all. I think let's keep it unmasked for now in case it does turn out useful in cancelling from unrecoverable states while in debug.	2020-09-28 13:11:22 -04:00
Tyler Goodlet	25e93925b0	Add a cancel scope around child debugger requests This is needed in order to avoid the deadlock condition where a child actor is waiting on the root actor's tty lock but it's parent (possibly the root) is waiting on it to terminate after sending a cancel request. The solution is simple: create a cancel scope around the request in the child and always cancel it when a cancel request from the parent arrives.	2020-09-28 13:02:33 -04:00
Tyler Goodlet	363498b882	Disable SIGINT handling in child processes There seems to be no good reason not too since our cancellation machinery/protocol should do this work when the root receives the signal. This also (hopefully) helps with some debugging race condition stuff.	2020-09-28 09:24:36 -04:00
Tyler Goodlet	f1b242f913	Block SIGINT handling while in the debugger This seems to prevent a certain class of bugs to do with the root actor cancelling local tasks and getting into deadlock while children are trying to acquire the tty lock. I'm not sure it's the best idea yet since you're pretty much guaranteed to get "stuck" if a child activates the debugger after the root has been cancelled (at least "stuck" in terms of SIGINT being ignored). That kinda race condition seems to still exist somehow: a child can "beat" the root to activating the tty lock and the parent is stuck waiting on the child to terminate via its nursery.	2020-09-28 08:54:21 -04:00
Tyler Goodlet	76e1c83161	Add matrix room link	2020-09-24 11:12:45 -04:00
Tyler Goodlet	9e1d9a8ce1	Add an internal context stack This aids with tearing down resources after the crash handling and debugger have completed. Leaving this internal for now but should eventually get a public convenience function like `tractor.context_stack()`.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	09daba4c9c	Explicitly handle `debug_mode` flag correctly	2020-09-24 10:12:33 -04:00
Tyler Goodlet	8b6e9f5530	Port to new debug api, set `_is_root` state flag on startup	2020-09-24 10:12:33 -04:00
Tyler Goodlet	150179bfe4	Support entering post mortem on crashes in root actor	2020-09-24 10:12:33 -04:00
Tyler Goodlet	291ecec070	Maybe not sticky by default	2020-09-24 10:12:33 -04:00
Tyler Goodlet	bd157e05ef	Port to service nursery	2020-09-24 10:12:33 -04:00

1 2 3 4 5 ...

453 Commits (f72eabd42aa32426c4db087ac325ace26bf5cde5)