tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	ab2664da70	Runtime level log on debug REPL exits	2023-10-16 15:46:21 -04:00
Tyler Goodlet	ae326cbb9a	Ignore kbis in `open_crash_handler()` by default	2023-10-16 15:45:34 -04:00
Tyler Goodlet	07cec02303	Add comments around diff between `C/context` refs	2023-10-16 15:45:02 -04:00
Tyler Goodlet	2fdb8fc25a	Factor non-yield stream msg processing into helper Since both `MsgStream.receive()` and `.receive_nowait()` need the same raising logic when a non-stream msg arrives (so that maybe an appropriate IPC translated error can be raised) move the `KeyError` handler code into a new `._streaming._raise_from_no_yield_msg()` func and call it from both methods to make the error-interface-raising symmetrical across both methods.	2023-10-16 15:35:16 -04:00
Tyler Goodlet	6d951c526a	Comment all `.pause(shield=True)` attempts again, need to solve cancel scope `.__exit__()` frame hiding issue..	2023-10-10 09:55:11 -04:00
Tyler Goodlet	575a24adf1	Always raise remote (cancelled) error if set Previously we weren't raising a remote error if the local scope was cancelled during a call to `Context.result()` which is problematic if the caller WAS NOT the requester for said remote cancellation; in that case we still want a `ContextCancelled` raised with the `.canceller: str` set to the cancelling actor uid. Further fix a naming bug where the (seemingly older) `._remote_err` was being set to such an error instead of `._remote_error` XD	2023-10-10 09:45:49 -04:00
Tyler Goodlet	919e462f88	Write more comprehensive `Portal.cancel_actor()` doc str	2023-10-08 15:57:18 -04:00
Tyler Goodlet	a09b8560bb	Oof, default reg addrs needs to be in `list[tuple]` form..	2023-10-07 18:52:37 -04:00
Tyler Goodlet	d24a9e158f	Msg-ified `ContextCancelled`s sub-error type should always be just, its type..	2023-10-07 18:51:03 -04:00
Tyler Goodlet	18a1634025	Add shielding support to `.pause()` Implement it like you'd expect using simply a wrapping `trio.CancelScope` which is itself shielded by the input `shield: bool` B) There's seemingly still some issues with the frame selection when the REPL engages and not sure how to resolve it yet but at least this does indeed work for practical purposes. Still needs a test obviously!	2023-10-06 15:49:23 -04:00
Tyler Goodlet	4314a59327	Add post-mortem catch around failed transport addr binds to aid with runtime debugging	2023-10-03 10:54:46 -04:00
Tyler Goodlet	e94f1261b5	Move `maybe_open_crash_handler()` CLI `--pdb`-driven wrapper to debug mod	2023-10-02 18:10:34 -04:00
Tyler Goodlet	86da79a854	Rename to `parse_maddr()` and fill out doc strings	2023-09-29 14:49:18 -04:00
Tyler Goodlet	de89e3a9c4	Add libp2p style "multi-address" parser from `piker` Details are in the module docs; this is a first draft with lotsa room for refinement and extension.	2023-09-29 14:11:31 -04:00
Tyler Goodlet	7bed470f5c	Start `.devx.cli` extensions for pop CLI frameworks Starting of with just a `typer` (and thus transitively `click`) `typer.Typer.callback` hook which allows passthrough of the `--ll <loglevel: str>` and `--pdb <debug_mode: bool>` flags for use when building CLIs that use the runtime Bo Still needs lotsa refinement and obviously better docs but, the doc string for `load_runtime_vars()` shows how to use the underlying `.devx._debug.open_crash_handler()` via a wrapper that can be passed the `--pdb` flag and then enable debug mode throughout the entire actor system.	2023-09-28 15:36:24 -04:00
Tyler Goodlet	fa9a9cfb1d	Kick off `.devx` subpkg for our dev tools B) Where `.devx` is "developer experience", a hopefully broad enough subpkg name for all the slick stuff planned to augment working on the actor runtime 💥 Move the `._debug` module into the new subpkg and adjust rest of core code base to reflect import path change. Also add a new `.devx._debug.open_crash_handler()` manager for wrapping any sync code outside a `trio.run()` which is handy for eventual CLI addons for popular frameworks like `click`/`typer`.	2023-09-28 14:14:50 -04:00
Tyler Goodlet	3d0e95513c	Init-support for "multi homed" transports Since we'd like to eventually allow a diverse set of transport (protocol) methods and stacks, and a multi-peer discovery system for distributed actor-tree applications, this reworks all runtime internals to support multi-homing for any given tree on a logical host. In other words any actor can now bind its transport server (currently only unsecured TCP + `msgspec`) to more then one address available in its (linux) network namespace. Further, registry actors (now dubbed "registars" instead of "arbiters") can also similarly bind to multiple network addresses and provide discovery services to remote actors via multiple addresses which can now be provided at runtime startup. Deats: - adjust `._runtime` internals to use a `list[tuple[str, int]]` (and thus pluralized) socket address sequence where applicable for transport server socket binds, now exposed via `Actor.accept_addrs`: - `Actor.__init__()` now takes a `registry_addrs: list`. - `Actor.is_arbiter` -> `.is_registrar`. - `._arb_addr` -> `._reg_addrs: list[tuple]`. - always reg and de-reg from all registrars in `async_main()`. - only set the global runtime var `'_root_mailbox'` to the loopback address since normally all in-tree processes should have access to it, right? - `._serve_forever()` task now takes `listen_sockaddrs: list[tuple]` - make `open_root_actor()` take a `registry_addrs: list[tuple[str, int]]` and defaults when not passed. - change `ActorNursery.start_..()` methods take `bind_addrs: list` and pass down through the spawning layer(s) via the parent-seed-msg. - generalize all `._discovery()` APIs to accept `registry_addrs`-like inputs and move all relevant subsystems to adopt the "registry" style naming instead of "arbiter": - make `find_actor()` support batched concurrent portal queries over all provided input addresses using `.trionics.gather_contexts()` Bo - syntax: move to using `async with <tuples>` 3.9+ style chained @acms. - a general modernization of the code to a python 3.9+ style. - start deprecation and change to "registry" naming / semantics: - `._discovery.get_arbiter()` -> `.get_registry()`	2023-09-27 16:25:21 -04:00
Tyler Goodlet	ee151b00af	Mk `gather_contexts()` support `@acm`s yielding `None` We were using a `all(<yielded values>)` condition which obviously won't work if the batched managers yield any non-truthy value. So instead see the `unwrapped: dict` with the `id(mngrs)` and only unblock once all values have been filled in to be something that is not that value.	2023-09-27 14:05:22 -04:00
Tyler Goodlet	22c14e235e	Expose `Channel` @ pkg level, drop `_debug.pp()` alias	2023-08-18 10:18:25 -04:00
Tyler Goodlet	1102843087	Teensie tidy up on actor doc string	2023-08-18 10:10:36 -04:00
Tyler Goodlet	e03bec5efc	Move `.to_asyncio` to modern optional value type annots	2023-07-21 15:08:46 -04:00
Tyler Goodlet	bee2c36072	Make `NamespacePath` work on object refs Detect if the input ref is a non-func (like an `object` instance) in which case grab its type name using `type()`. Wrap all the name-getting into a new `_mk_fqpn()` static meth: gets the "fully qualified path name" and returns path and name in tuple; port other methds to use it. Refine and update the docs B)	2023-07-12 13:07:30 -04:00
Tyler Goodlet	b36b3d522f	Map `breakpoint()` built-in to new `.pause_from_sync()` ep	2023-07-07 15:35:52 -04:00
Tyler Goodlet	4ace8f6037	Fix frame-selection display on first REPL entry For whatever reason pdb(p), and in general, will show the frame of the next python instruction/LOC on initial entry (at least using `.set_trace()`), as such remove the `try/finally` block in the sync code entrypoint `.pause_from_sync()`, and also since doesn't seem like we really need it anyway. Further, and to this end: - enable hidden frames support in our default config. - fix/drop/mask all the frame ref-ing/mangling we had prior since it's no longer needed as well as manual `Lock` releasing which seems to work already by having the `greenback` spawned task do it's normal thing? - move to no `Union` type annots. - hide all frames that can add "this is the runtime confusion" to traces.	2023-07-07 14:51:44 -04:00
Tyler Goodlet	98a7326c85	._runtime: log level tweaks, use crit for stale debug lock detection	2023-07-07 14:49:23 -04:00
Tyler Goodlet	46972df041	.log: more correct handling for `get_logger(__name__)` usage	2023-07-07 14:48:37 -04:00
Tyler Goodlet	ac695a05bf	Updates from latest `piker.data._sharedmem` changes	2023-06-22 17:16:17 -04:00
Tyler Goodlet	fc56971a2d	First proto: use `greenback` for sync func breakpointing This works now for supporting a new `tractor.pause_from_sync()` `tractor`-aware-replacement for `Pdb.set_trace()` from sync functions which are also scheduled from our runtime. Uses `greenback` to do all the magic of scheduling the bg `tractor._debug._pause()` task and engaging the normal TTY locking machinery triggered by `await tractor.breakpoint()` Further this starts some public API renaming, making a switch to `tractor.pause()` from `.breakpoint()` which IMO much better expresses the semantics of the runtime intervention required to suffice multi-process "breakpointing"; it also is an alternate name for the same in computer science more generally: https://en.wikipedia.org/wiki/Breakpoint It also avoids using the same name as the `breakpoint()` built-in which is important since there is alot more going on when you call our equivalent API. Deats of that: - add deprecation warning for `tractor.breakpoint()` - add `tractor.pause()` and a shorthand, easier-to-type, alias `.pp()` for "pause-point" B) - add `pause_from_sync()` as the new `breakpoint()`-from-sync-function hack which does all the `greenback` stuff for the user. Still TODO: - figure out where in the runtime and when to call `greenback.ensure_portal()`. - fix the frame selection issue where `trio._core._ki._ki_protection_decorator:wrapper` seems to be always shown on REPL start as the selected frame..	2023-06-21 16:08:18 -04:00
Tyler Goodlet	4f442efbd7	Pass `str` dtype for `use_str` case	2023-06-15 12:20:20 -04:00
Tyler Goodlet	f9a84f0732	Allocate size-specced "empty" sequence from default values by type	2023-06-15 12:20:20 -04:00
Tyler Goodlet	e0bf964ff0	Mod define `_USE_POSIX`, add a of of todos	2023-06-15 12:20:20 -04:00
Tyler Goodlet	b52ff270c5	Add `ShmList` slice support in `.__getitem__()`	2023-06-15 12:20:20 -04:00
Tyler Goodlet	1713ecd9f8	Rename token type to `NDToken` in the style of `nptyping`	2023-06-15 12:20:20 -04:00
Tyler Goodlet	edb82fdd78	Don't require runtime (for now), type annot fixing	2023-06-15 12:20:20 -04:00
Tyler Goodlet	71477290fc	Add `ShmList` wrapping the stdlib's `ShareableList` First attempt at getting `multiprocessing.shared_memory.ShareableList` working; we wrap the stdlib type with a readonly attr and a `.key` for cross-actor lookup. Also, rename all `numpy` specific routines to have a `ndarray` suffix in the func names.	2023-06-15 12:20:20 -04:00
Tyler Goodlet	9716d86825	Initial module import from `piker.data._sharemem` More or less a verbatim copy-paste minus some edgy variable naming and internal `piker` module imports. There is a bunch of OHLC related defaults that need to be dropped and we need to adjust to an optional dependence on `numpy` by supporting shared lists as per the mp docs.	2023-06-15 12:20:20 -04:00
Tyler Goodlet	7507e269ec	Just import `mp` top level in `._spawn`	2023-06-14 15:32:15 -04:00
Tyler Goodlet	17ae449160	Tidy up `typing` imports in broadcaster mod	2023-06-14 15:31:52 -04:00
Tyler Goodlet	6495688730	Drop `Optional` style from runtime mod	2023-05-25 16:00:05 -04:00
Tyler Goodlet	a0276f41c2	Remote cancellation runtime-internal vars renames - `Context._cancel_called_remote` -> `._cancelled_remote` since "called" implies the cancellation was "requested" when it could be due to another error and the actor uid is the value - only set once the far end task scope is terminated due to either error or cancel, which has nothing to do with what caused the cancellation. - `Actor._cancel_called_remote` -> `._cancel_called_by_remote` which emphasizes that this variable is only set IFF some remote actor requested that this actor's runtime be cancelled via `Actor.cancel()`.	2023-05-19 14:31:55 -04:00
Tyler Goodlet	ead9e418de	Expose `allow_overruns` to `Portal.open_context()` Turns out you can get a case where you might be opening multiple ctx-streams concurrently and during the context opening phase you block for all contexts to open, but then when you eventually start opening streams some slow to start context has caused the others become in an overrun state.. so we need to let the caller control whether that's an error ;) This also needs a test!	2023-05-15 10:00:45 -04:00
Tyler Goodlet	60791ed546	Oof, fix remaining `Actor.cancel()` in `Actor._from_parent()`	2023-05-15 10:00:45 -04:00
Tyler Goodlet	7293b82bcc	Tweak doc string	2023-05-15 10:00:45 -04:00
Tyler Goodlet	20d75ff934	Move move context code into new `._context` mod	2023-05-15 10:00:45 -04:00
Tyler Goodlet	04e4397a8f	Ignore drainer-task nursery RTE during context exit	2023-05-15 10:00:45 -04:00
Tyler Goodlet	968f13f9ef	Set `Context._scope_nursery` on callee side too Because obviously we probably want to support `allow_overruns` on the remote callee side as well XD Only found the bugs fixed in this patch this thanks to writing a much more exhaustive test set for overrun cases B)	2023-05-15 10:00:45 -04:00
Tyler Goodlet	f9911c22a4	Seriously cover all overrun cases This actually caught further runtime bugs so it's gud i tried.. Add overrun-ignore enabled / disabled cases and error catching for all of them. More or less this should cover every possible outcome when it comes to setting `allow_overruns: bool` i hope XD	2023-05-15 10:00:45 -04:00
Tyler Goodlet	6db656fecf	Flip allocate log msgs to debug	2023-05-15 10:00:45 -04:00
Tyler Goodlet	c72026091e	Remote `Context` cancellation semantics rework B) This adds remote cancellation semantics to our `tractor.Context` machinery to more closely match that of `trio.CancelScope` but with operational differences to handle the nature of parallel tasks interoperating across multiple memory boundaries: - if an actor task cancels some context it has opened via `Context.cancel()`, the remote (scope linked) task will be cancelled using the normal `CancelScope` semantics of `trio` meaning the remote cancel scope surrounding the far side task is cancelled and `trio.Cancelled`s are expected to be raised in that scope as per normal `trio` operation, and in the case where no error is raised in that remote scope, a `ContextCancelled` error is raised inside the runtime machinery and relayed back to the opener/caller side of the context. - if any actor task cancels a full remote actor runtime using `Portal.cancel_actor()` the same semantics as above apply except every other remote actor task which also has an open context with the actor which was cancelled will also be sent a `ContextCancelled` but with the `.canceller` field set to the uid of the original cancel requesting actor. This changeset also includes a more "proper" solution to the issue of "allowing overruns" during streaming without attempting to implement any form of IPC streaming backpressure. Implementing task-granularity backpressure cross-process turns out to be more or less impossible without augmenting out streaming protocol (likely at the cost of performance). Further allowing overruns requires special care since any blocking of the runtime RPC msg loop task effectively can block control msgs such as cancels and stream terminations. The implementation details per abstraction layer are as follows. ._streaming.Context: - add a new contructor factor func `mk_context()` which provides a strictly private init-er whilst allowing us to not have to define an `.__init__()` on the type def. - add public `.cancel_called` and `.cancel_called_remote` properties. - general rename of what was the internal `._backpressure` var to `._allow_overruns: bool`. - move the old contents of `Actor._push_result()` into a new `._deliver_msg()` allowing for better encapsulation of per-ctx msg handling. - always check for received 'error' msgs and process them with the new `_maybe_cancel_and_set_remote_error()` before any msg delivery to the local task, thus guaranteeing error and cancellation handling despite any overflow handling. - add a new `._drain_overflows()` task-method for use with new `._allow_overruns: bool = True` mode. - add back a `._scope_nursery: trio.Nursery` (allocated in `Portal.open_context()`) who's sole purpose is to spawn a single task which runs the above method; anything else is an error. - augment `._deliver_msg()` to start a task and run the above method when operating in no overrun mode; the task queues overflow msgs and attempts to send them to the underlying mem chan using a blocking `.send()` call. - on context exit, any existing "drainer task" will be cancelled and remaining overflow queued msgs are discarded with a warning. - rename `._error` -> `_remote_error` and set it in a new method `_maybe_cancel_and_set_remote_error()` which is called before processing - adjust `.result()` to always call `._maybe_raise_remote_err()` at its start such that whenever a `ContextCancelled` arrives we do logic for whether or not to immediately raise that error or ignore it due to the current actor being the one who requested the cancel, by checking the error's `.canceller` field. - set the default value of `._result` to be `id(Context()` thus avoiding conflict with any `.result()` actually being `False`.. ._runtime.Actor: - augment `.cancel()` and `._cancel_task()` and `.cancel_rpc_tasks()` to take a `requesting_uid: tuple` indicating the source actor of every cancellation request. - pass through the new `Context._allow_overruns` through `.get_context()` - call the new `Context._deliver_msg()` from `._push_result()` (since the factoring out that method's contents). ._runtime._invoke: - `TastStatus.started()` back a `Context` (unless an error is raised) instead of the cancel scope to make it easy to set/get state on that context for the purposes of cancellation and remote error relay. - always raise any remote error via `Context._maybe_raise_remote_err()` before doing any `ContextCancelled` logic. - assign any `Context._cancel_called_remote` set by the `requesting_uid` cancel methods (mentioned above) to the `ContextCancelled.canceller`. ._runtime.process_messages: - always pass a `requesting_uid: tuple` to `Actor.cancel()` and `._cancel_task` to that any corresponding `ContextCancelled.canceller` can be set inside `._invoke()`.	2023-05-15 10:00:45 -04:00
Tyler Goodlet	90e41016b9	Only tuplize `.canceller` if non-`None`	2023-05-15 10:00:45 -04:00
Tyler Goodlet	f54c415060	Move `NoRuntime` import inside `current_actor()` to avoid cycle	2023-05-15 10:00:45 -04:00
Tyler Goodlet	67f82c6ebd	Add new remote error introspection attrs To handle both remote cancellation this adds `ContextCanceled.canceller: tuple` the uid of the cancel requesting actor and is expected to be set by the runtime when servicing any remote cancel request. This makes it possible for `ContextCancelled` receivers to know whether "their actor runtime" is the source of the cancellation. Also add an explicit `RemoteActor.src_actor_uid` which better formalizes the notion of "which remote actor" the error originated from. Both of these new attrs are expected to be packed in the `.msgdata` when the errors are loaded locally.	2023-05-15 10:00:45 -04:00
Tyler Goodlet	220b244508	Log waiter task cancelling msg as cancel-level	2023-05-15 10:00:45 -04:00
Tyler Goodlet	831790377b	Assign `RemoteActorError` boxed error type for context cancelleds	2023-05-15 10:00:45 -04:00
Tyler Goodlet	e80e0a551f	Change a bunch of log levels to cancel, including any `ContextCancelled` handling	2023-05-15 10:00:45 -04:00
Tyler Goodlet	b3f9251eda	Add some log-level method doc-strings	2023-05-15 10:00:45 -04:00
Tyler Goodlet	903537ce04	Tweak context doc str	2023-05-15 10:00:45 -04:00
Tyler Goodlet	d75343106b	More single doc-strs in discovery mod	2023-05-15 10:00:45 -04:00
Tyler Goodlet	cfb2bc0fee	Enable `Context` backpressure by default; avoid startup race-crashes?	2023-05-15 10:00:45 -04:00
Tyler Goodlet	1c3893a383	Drop commented `pdbpp` import logic	2023-05-15 09:01:55 -04:00
Tyler Goodlet	79622bbeea	Restore `breakpoint()` hook after runtime exits Previously we were leaking our (pdb++) override into the Python runtime which would always result in a runtime error whenever `breakpoint()` is called outside our runtime; after exit of the root actor . This explicitly restores any previous hook override (detected during startup) or deletes the hook and restores the environment if none existed prior. Also adds a new WIP debugging example script to ensure breakpointing works as normal after runtime close; this will be added to the test suite.	2023-05-15 00:47:29 -04:00
Tyler Goodlet	95535b2226	Some more 3.10+ optional type sigs	2023-05-15 00:47:29 -04:00
Tyler Goodlet	ae4ff5dc8d	pdbp: adding typing to config settings vars	2023-05-14 22:38:46 -04:00
Tyler Goodlet	705538398f	`pdbp`: turn off line truncating by default, fixes terminal resizing stuff	2023-05-14 22:38:16 -04:00
Tyler Goodlet	86aef5238d	Hide actor nursery exit frame	2023-05-14 21:24:26 -04:00
Tyler Goodlet	cc82447db6	First try: switch debug machinery over to `pdbp` B)	2023-05-14 21:24:26 -04:00
Tyler Goodlet	23cffbd940	Use multiline import for debug mod	2023-05-14 21:24:26 -04:00
Tyler Goodlet	f667d16d66	Copy the now deprecated `trio.Process.aclose()` Move it into our `_spawn.do_hard_kill()` since we do indeed rely on the particular process killing sequence on "soft kill" failure cases.	2023-05-14 19:31:50 -04:00
Tyler Goodlet	24a062341e	Just call `trio.Process.aclose()` directly for now?	2023-04-02 14:34:41 -04:00
Tyler Goodlet	8637778739	Expose `raise_on_lag: bool` flag through factory	2023-01-30 12:18:23 -05:00
Tyler Goodlet	47166e45f0	Be explicit with passthrough kwargs (there's so few)	2023-01-29 17:31:21 -05:00
Tyler Goodlet	4ce2dcd12b	Switch back to raising `Lagged` by default Makes the broadcast test suite not hang xD, and is our expected default behaviour. Also removes a ton of commented legacy cruft from before the refactor to remove the `.receive()` recursion and fixes some typing. Oh right, and in the case where there's only one subscriber left we warn log about it since in theory we could actually entirely unwind the bcaster back to the original underlying, though not sure if that's sane or works for some use cases (like wanting to have some other subscriber get added dynamically later).	2023-01-29 15:03:34 -05:00
Tyler Goodlet	80f983818f	Ignore monkey patched `.send()` type annot	2023-01-29 15:03:34 -05:00
Tyler Goodlet	6ba29f8d56	Recurse and get the last value when in warn mode	2023-01-29 15:03:34 -05:00
Tyler Goodlet	2707a0e971	Add `._raise_on_lag` flag to disable `Lag` raising	2023-01-29 15:03:34 -05:00
Tyler Goodlet	9f9907271b	Merge `ReceiveMsgStream` and `MsgStream` Since one-way streaming can be accomplished by just not sending on one side (and/or thus wrapping such usage in a more restrictive API), we just drop the recv-only parent type. The only method different was `MsgStream.send()`, now merged in. Further in usage of `.subscribe()` we monkey patch the underlying stream's `.send()` onto the delivered broadcast receiver so that subscriber tasks can two-way stream as though using the stream directly. This allows us to more definitively drop `tractor.open_stream_from()` in the longer run if we so choose as well; note currently this will potentially create an issue if a caller tries to `.send()` on such a one way stream.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	c2367c1c5e	Better `trio`-ize `BroadcastReceiver` internals Driven by a bug found in `piker` where we'd get an inf recursion error due to `BroadcastReceiver.receive()` being called when consumer tasks are awoken but no value is ready to `.nowait_receive()`. This new rework takes an approach closer to the interface and internals of `trio.MemoryReceiveChannel` particularly in terms of, - implementing a `BroadcastReceiver.receive_nowait()` and using it within the async `.receive()`. - failing over to an internal `._receive_from_underlying()` when the `_nowait()` call raises `trio.WouldBlock`. - adding `BroadcastState.statistics()` for debugging and testing dropping recursion from `.receive()`.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	13c9eadc8f	Move result log msg up and drop else block	2023-01-29 14:55:02 -05:00
Tyler Goodlet	aa4871b13d	Call `MsgStream.aclose()` in `Context.open_stream.__aexit__()` We weren't doing this originally I think just because of the path dependent nature of the way the code was developed (originally being mega pedantic about one-way vs. bidirectional streams) but, it doesn't seem like there's any issue just calling the stream's `.aclose()`; also have the benefit of just being less code and logic checks B)	2023-01-29 14:55:02 -05:00
Tyler Goodlet	556f4626db	Tweak warning msg for still-alive-after-cancelled actor	2023-01-29 14:55:02 -05:00
Tyler Goodlet	df01294bb2	Show more functiony syntax in ctx-cancelled log msgs	2023-01-29 14:55:02 -05:00
Tyler Goodlet	ddf3d0d1b3	Show tracebacks for un-shipped/propagated errors	2023-01-29 14:55:02 -05:00
Tyler Goodlet	97d5f7233b	Fix uid2nursery lookup table type annot	2023-01-29 14:55:02 -05:00
Tyler Goodlet	d27c081a15	Ensure arbiter sockaddr type before usage	2023-01-29 14:55:02 -05:00
Tyler Goodlet	a4874a3227	Always set the `parent_exit: trio.Event` on exit	2023-01-29 14:55:02 -05:00
Tyler Goodlet	de04bbb2bb	Don't raise on a broken IPC-context when sending stop msg	2023-01-29 14:55:02 -05:00
Tyler Goodlet	4f977189c0	Handle broken mem chan on `Actor._push_result()` When backpressure is used and a feeder mem chan breaks during msg delivery (usually because the IPC allocating task already terminated) instead of raising we simply warn as we do for the non-backpressure case. Also, add a proper `Actor.is_arbiter` test inside `._invoke()` to avoid doing an arbiter-registry lookup if the current actor is the registrar.	2023-01-29 14:55:02 -05:00
Tyler Goodlet	121a8cc891	Drop `Optional` usage from root mod	2023-01-26 16:00:08 -05:00
Tyler Goodlet	c54b8ca4ba	Begin deprecation of `arbiter_addr` -> `registry_addr`	2023-01-26 16:00:08 -05:00
Tyler Goodlet	5b8a87d0f6	Slightly better `xonsh` check hack, fix typing	2023-01-26 15:48:15 -05:00
Tyler Goodlet	2e278ceb74	Add a super hacky check for `xonsh`, smh..	2023-01-26 15:26:43 -05:00
Tyler Goodlet	dba8118553	Always attempt prompt redraw on ctl-c in REPL The stdlib has all sorts of muckery with ignoring SIGINT in the `Pdb._cmdloop()` but here we just override all that since we don't trust their decisions about cancellation handling whatsoever. Adds a `Lock.repl: MultiActorPdb` attr which is set by any task which acquires root TTY lock indicating (via actor global state) that the current actor is using the debugger REPL and can be expected to re-draw the prompt on SIGINT. Further we mask out log messages from any actor who also has the `shield_sigint_handler()` enabled to avoid logging noise when debugging.	2023-01-26 12:44:13 -05:00
Tyler Goodlet	fca2e7c10e	Simplify closed abruptly log msg	2023-01-26 12:44:13 -05:00
Tyler Goodlet	5ed62c5c54	Add note about intermediary-actor in debug issue	2023-01-26 12:44:13 -05:00
Tyler Goodlet	6c8cacc9d1	Adjust all default is `None` annots (per new `mypy`)	2022-12-12 13:18:22 -05:00
Tyler Goodlet	38326e8c15	Avoid error on context double pops	2022-12-11 23:46:33 -05:00
Tyler Goodlet	b5192cca8e	Always greedily `list`-cast`mngrs` input sequence	2022-12-11 23:20:58 -05:00
Tyler Goodlet	c606be8c64	Passthrough runtime kwargs from `open_actor_cluster()`	2022-12-11 19:56:08 -05:00
Tyler Goodlet	f2641c8964	Avoid "task never called `.started()`" runtime erros when cancelling	2022-10-14 19:42:23 -04:00
Tyler Goodlet	f39414ce12	Drop error-repacking for `.run_in_actor()`s block If we pack the nursery parent task's error into the `errors` table directly in the handler, we don't need to specially handle packing that same error into any exception group raised while handling sub-actor cancellation; drops some ugly indentation ;)	2022-10-14 19:42:23 -04:00
Tyler Goodlet	e298b70edf	Drop added `.pdp()` level msgs used duringn dev	2022-10-14 19:42:23 -04:00
Tyler Goodlet	38f9d35dee	Fix errors table type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	88448f7281	Fix handler type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	0956d5f461	Restore the `trio` SIGINT handler, cancel root lock tasks on no-peers Pretty sure this is the final touch to alleviate all our debug lock headaches! Instead of trying to revert to the "last" handler (as `pdb` does internally in the stdlib) we always just revert to the handler `trio` registers during startup. Further this seems to allow cancelling the root-side locking task if it's detected as stale IFF we only do this when the root actor is in a "no more IPC peers" state. Deatz: - (always) set `._debug.Lock._trio_handler` as the `trio` version, not some last used handler to make sure we're getting the ctrl-c handling we want when not in debug mode. - assign the trio handler in `open_root_actor()` `._runtime._async_main()` to be sure it's applied in subactors as well as the root. - only do debug lock blocking and root-side-locking-task cancels when a "no peers" condition is detected in the root actor: i.e. no IPC channels are detected by the root meaning it's impossible any actor has a sane lock-state ongoing for debug mode.	2022-10-14 18:18:01 -04:00
Tyler Goodlet	33f2234baf	Hide some stack layers the user doesn't really need to see	2022-10-14 18:18:01 -04:00
Tyler Goodlet	7521bded3d	Pack error from the parent task into the actor nursery	2022-10-14 18:16:51 -04:00
Tyler Goodlet	50fe098e06	First pass, swap `MultiError` for `BaseExceptionGroup`	2022-10-14 18:16:51 -04:00
Tyler Goodlet	98056f6ed7	Move logging context map into `log.py` module	2022-10-12 12:46:20 -04:00
Tyler Goodlet	b81b6be98a	Drop extra log msgs, some old commented code	2022-10-12 12:35:35 -04:00
Tyler Goodlet	fb721f36ef	Support debug-lock blocking, use on no-more IPC This is a lingering debugger locking race case we needed to handle: - child crashes acquires TTY lock in root and attaches to `pdb` - child IPC goes down such that all channels to the root are broken / non-functional. - root is stuck thinking the child is still in debug even though it can't be contacted and the child actor machinery hasn't been cancelled by its parent. - root get's stuck in deadlock with child since it won't send a cancel request until the child is finished debugging, but the child can't unlock the debugger bc IPC is down. To avoid this scenario add debug lock blocking list via `._debug.Lock._blocked: set[tuple]` which holds actor uids for any actor that is detected by the root as having no transport channel connections with said root (of which at least one should exist if this sub-actor at some point acquired the debug lock). The root consequently checks this list for any actor that tries to (re)acquire the lock and blocks with a `ContextCancelled`. When a debug condition is tested in `._runtime._invoke` the context's `._enter_debugger_on_cancel` which is set to `False` if the actor is on the block list in which case the post-mortem entry is skipped. Further this adds a root-locking-task side cancel scope to `Lock._root_local_task_cs_in_debug` which can be cancelled by the root runtime when a stale lock is detected after all IPC channels for the actor have been torn down. NOTE: right now we're NOT doing this since it seems to cause test failures likely due because it may cause pre-mature cancellation and maybe needs a bit more experimenting?	2022-10-11 20:00:05 -04:00
Tyler Goodlet	734d8dd663	Move `trio` scope outside first inter-task-chan receive	2022-10-11 20:00:05 -04:00
Tyler Goodlet	1c480e6c92	Add `Context` cancel message and debug toggle flag In the case of a callee-side context cancelling itself it can be handy to let the caller-side task know (even if through logging) that the cancel was due to some known reason. Make `.cancel()` accept such a message on the callee side and have it included in the `._runtime._invoke()` raised `ContextCancelled` emission. Also add a `Context._trigger_debugger_on_cancel: bool` flag which can be set to `False` to avoid the debugger post-mortem crash mode from engaging on cross-context tasks which cancel themselves for a known reason (as is needed for blocked tasks in the debug TTY-lock machinery).	2022-10-11 20:00:05 -04:00
Tyler Goodlet	44b59f3338	Go back to a `global` single-ton nursery per actor Turns out the lifetime mgmt of separate nurseries per delegate manager is tricky; a new nursery can't be naively allocated on cache-misses since it may get closed by some early terminating task instead of by the "last using" consumer task. In theory if we allocate using the same logic as that used for the last-task-triggers-exit then this should work? For now just go back to a single global nursery per `_Cache` which still avoids use of the internal actor service nursery.	2022-10-09 21:27:23 -04:00
Tyler Goodlet	7a719ac2a7	Use one nursery per unique manager (signature) Instead of sticking all `trionics.maybe_open_context()` tasks inside the actor's (root) service nursery, open a unique one per manager function instance (id). Further, accept a callable for the `key` such that a user can have more flexible control on the caching logic and move the `maybe_open_nursery()` helper out of the portal mod and into this trionics "managers" module.	2022-10-09 21:27:23 -04:00
Tyler Goodlet	d24fae8381	'Rename mp spawn methods to have a `'mp_'` prefix'	2022-10-09 17:54:55 -04:00
Tyler Goodlet	5ab98513b7	Move `@tractor_test` into `conftest.py`	2022-10-09 17:14:20 -04:00
Tyler Goodlet	90f4912580	Organize process spawning into lookup table Instead of the logic branching create a table `._spawn._methods` which is used to lookup the desired backend framework (in this case still only one of `multiprocessing` or `trio`) and make the top level `.new_proc()` do the lookup and any common logic. Use a `typing.Literal` to define the lookup table's key set. Repair and ignore a bunch of type-annot related stuff todo with `mypy` updates and backend-specific process typing.	2022-10-09 16:51:21 -04:00
Tyler Goodlet	15047341bd	Ignore forserver override attrs with `mypy`	2022-10-09 16:14:11 -04:00
Tyler Goodlet	e609183242	Expose lifetime stack as class attr, add base test suite	2022-09-15 23:50:15 -04:00
Tyler Goodlet	10eeda2d2b	Use built-ins for all data-structure-type annotations	2022-09-15 23:41:28 -04:00
Tyler Goodlet	ad19bf2cf1	Remove `tractor.run()` once and for all It's been deprecated for a while now and all docs and tests have been changed. Closes #183	2022-09-15 23:41:28 -04:00
Tyler Goodlet	9aef03772a	Expose `Actor` at pkg level, adjust debug type annots	2022-09-15 23:41:28 -04:00
Tyler Goodlet	7548dba8f2	Change to new doc string style	2022-09-15 23:41:28 -04:00
Tyler Goodlet	208d56af2c	Make `async_main()` a module func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	a3a5bc267e	Make `process_messages()` a mod func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	d4084b2032	Rename our core module to `_runtime`	2022-09-15 23:41:28 -04:00
Tyler Goodlet	bafd10a260	Make `maybe_open_context()` re-entrant safe, use per factory locks	2022-09-15 19:02:02 -04:00
Tyler Goodlet	5ad540c417	Add debug complete event `None`-guard for when already reset	2022-09-15 19:02:02 -04:00
Tyler Goodlet	8f1fe2376a	Simplify all hooks to a common `Lock.release()`	2022-08-02 18:14:05 -04:00
Tyler Goodlet	650313dfef	Drop legacy handler blocks factored into `_acquire_debug_lock()`	2022-08-02 12:50:27 -04:00
Tyler Goodlet	e4006da6f4	Drop `pdbpp` bug notes, add follow up issue #320 note	2022-08-02 12:48:40 -04:00
Tyler Goodlet	7f6169a050	Drop legacy commented/todo remote debug helper block	2022-08-02 12:43:14 -04:00
Tyler Goodlet	02c3b9a672	Put `pygments` back to default	2022-08-02 12:17:34 -04:00
Tyler Goodlet	c5c7a9027c	Line len lint and drop rpc log msg level again	2022-08-02 12:17:34 -04:00
Tyler Goodlet	937ed99e39	Factor sigint overriding into lock methods	2022-08-02 12:17:28 -04:00
Tyler Goodlet	91f034a136	Move all module vars into a `Lock` type	2022-08-02 12:17:28 -04:00
Tyler Goodlet	6f01c78122	Disable `pygments` highlighting on ctlc tests	2022-08-02 12:17:28 -04:00
Tyler Goodlet	c0cd99e374	Timeout on arbiter ping, avoid TCP SYN hangs in CI?	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b01daa5319	Factor lock-state release logic into helper The common logic to both remove our custom SIGINT handler as well as signal the actor global event that pdb is complete. Call this whenever we exit a post mortem call and thus any time some rpc task get's debugged inside `._actor._invoke()`. Further, we have to manually print the REPL prompt on 3.9 for some wack reason, so stick a version guard in the sigint handler for that..	2022-08-02 12:17:28 -04:00
Tyler Goodlet	bd362a05f0	Run release hook around `next` repl commands as well	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b21f2e16ad	Always consider the debugger when exiting contexts When in an uncertain teardown state and in debug mode a context can be popped from actor runtime before a child finished debugging (the case when the parent is tearing down but the child hasn't closed/completed its tty lock IPC exit phase) and the child sends the "stop" message to unlock the debugger but it's ignored bc the parent has already dropped the ctx. Instead we call `._debug.maybe_wait_for_deugger()` before these context removals to avoid the root getting stuck thinking the lock was never released. Further, add special `Actor._cancel_task()` handling code inside `_invoke()` which continues to execute the method despite the IPC channel to the caller being broken and thus avoiding potential hangs due to a target (child) actor task remaining alive.	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ba7b355d9c	Add note about default behaviour of `fancycompleter`	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ef8dc0204c	Just drop all longlisting for now and leave comments	2022-08-02 12:17:28 -04:00
Tyler Goodlet	a101971027	Go back to original longlist code	2022-08-02 12:17:28 -04:00
Tyler Goodlet	835836123b	Just don't call longlist on 3.10+ for now	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b9eb601265	General typing fixes for `mypy`	2022-08-02 12:17:27 -04:00
Tyler Goodlet	4dcc21234e	Only call `.poll()` if a method on the spawn backend	2022-08-02 12:17:27 -04:00
Tyler Goodlet	8b9f342eef	Port to new `.lowlevel.open_process()` API	2022-08-02 12:17:27 -04:00
Tyler Goodlet	a90ca4b384	Call longlist normally when on py < 3.10	2022-08-02 12:17:06 -04:00
Tyler Goodlet	d0dcd55f47	Only report disconnected actors if proc is still alive?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	519f4c300b	I dunno, seems like `breakpoint()` needs this?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	ff3f5959e9	Always enable debug level logging if mode enabled	2022-08-02 12:16:58 -04:00
Tyler Goodlet	abb00531d3	Add help msg for non `__main__` modules as well	2022-08-02 12:16:58 -04:00
Tyler Goodlet	18c525d2f1	Hack around double long list print issue.. See https://github.com/pdbpp/pdbpp/issues/496	2022-08-02 12:16:58 -04:00
Tyler Goodlet	e2453fd3da	Add spaces before values in log msg	2022-08-02 12:16:58 -04:00
Tyler Goodlet	b29def8b5d	Add runtime level msg around channel draining	2022-08-02 12:16:58 -04:00
Tyler Goodlet	f07e9dbb2f	Always undo SIGINT overrides, cancel detached children Ensure that even when `pdb` resumption methods are called during a crash where `trio`'s runtime has already terminated (eg. `Event.set()` will raise) we always revert our sigint handler to the original. Further inside the handler if we hit a case where a child is in debug and (thinks it) has the global pdb lock, if it has no IPC connection to a parent, simply presume tty sync-coordination is now lost and cancel the child immediately.	2022-08-02 12:16:49 -04:00
Tyler Goodlet	c7035be2fc	Tolerate double `.remove()`s of stream on portal teardowns	2022-07-27 11:40:02 -04:00
Tyler Goodlet	deaca7d6cc	Always propagate SIGINT when no locking peer found A hopefully significant fix here is to always avoid suppressing a SIGINT when the root actor can not detect an active IPC connections (via a connected channel) to the supposed debug lock holding actor. In that case it is most likely that the actor has either terminated or has lost its connection for debugger control and there is no way the root can verify the lock is in use; thus we choose to allow KBI cancellation. Drop the (by comment) `try`-`finally` block in `_hijoack_stdin_for_child()` around the `_acquire_debug_lock()` call since all that logic should now be handled internal to that locking manager. Try to catch a weird error around the `.do_longlist()` method call that seems to sometimes break on py3.10 and latest `pdbpp`.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	d47d0e7c37	Always call pdb hook even if tty locking fails	2022-07-27 11:40:02 -04:00
Tyler Goodlet	0062c96a3c	Log cancels with appropriate level	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4be13b7387	Just warn on IPC breaks	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7bb5addd4c	Only warn on `trio.BrokenResourceError`s from `_invoke()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	89b44f8163	Pre-declare disconnected flag	2022-07-27 11:40:02 -04:00
Tyler Goodlet	2819b6a5b2	Avoid attr error XD	2022-07-27 11:40:02 -04:00
Tyler Goodlet	f2671ed026	Type annot updates	2022-07-27 11:40:02 -04:00
Tyler Goodlet	41924c86a6	Drop uneeded backframe traceback hide annotation	2022-07-27 11:40:02 -04:00
Tyler Goodlet	206c7c0720	Make `Actor._process_messages()` report disconnects The method now returns a `bool` which flags whether the transport died to the caller and allows for reporting a disconnect in the channel-transport handler task. This is something a user will normally want to know about on the caller side especially after seeing a traceback from the peer (if in tree) on console.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	bf0ac3116c	Only cancel/get-result from a ctx if transport is up There's no point in sending a cancel message to the remote linked task and especially no reason to block waiting on a result from that task if the transport layer is detected to be disconnected. We expect that the transport shouldn't go down at the layer of the message loop (reconnection logic should be handled in the transport layer itself) so if we detect the channel is not connected we don't bother requesting cancels nor waiting on a final result message. Why? - if the connection goes down in error the caller side won't have a way to know "how long" it should block to wait for a cancel ack or result and causes a potential hang that may require an additional ctrl-c from the user especially if using the debugger or if the traceback is not seen on console. - obviously there's no point in waiting for messages when there's no transport to deliver them XD Further, add some more detailed cancel logging detailing the task and actor ids.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	74b819a857	Typing fixes, simplify `_set_trace()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8892204c84	Add notes around py3.10 stdlib bug from `pdb++` There's a bug that's triggered in the stdlib without latest `pdb++` installed; add a note for that. Further inside `wait_for_parent_stdin_hijack()` don't `.started()` until the interactor stream has been opened to avoid races when debugging this `._debug.py` module (at the least) since we usually don't want the spawning (parent) task to resume until we know for sure the tty lock has been acquired. Also, drop the random checkpoint we had inside `_breakpoint()`, not sure it was actually adding anything useful since we're (mostly) carefully shielded throughout this func.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8f4bbf1cbf	Add and use a pdb instance factory	2022-07-27 11:40:02 -04:00
Tyler Goodlet	aea8f63bae	Drop all the `@cm.__exit__()` override attempts.. None of it worked (you still will see `.__exit__()` frames on debugger entry - you'd think this would have been solved by now but, shrug) so instead wrap the debugger entry-point in a `try:` and put the SIGINT handler restoration inside `MultiActorPdb` teardown hooks. This seems to restore the UX as it was prior but with also giving the desired SIGINT override handler behaviour.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7964a9f6f8	Try overriding `_GeneratorContextManager.__exit__()`; didn't work.. Using either of `@pdb.hideframe` or `__tracebackhide__` on stdlib methods doesn't seem to work either.. This all seems to have something to do with async generator usage I think ?	2022-07-27 11:40:02 -04:00
Tyler Goodlet	e5195264a1	Handle a context cancel? Might be a noop	2022-07-27 11:40:02 -04:00
Tyler Goodlet	345573e602	Make `mypy` happy	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4e60c17375	Refine the handler for child vs. root cases This gets very close to avoiding any possible hangs to do with tty locking and SIGINT handling minus a special case that will be detailed below. Summary of implementation changes: - convert `_mk_pdb()` -> `with _open_pdb() as pdb:` which implicitly handles the `bdb.BdbQuit` case such that debugger teardown hooks are always called. - rename the handler to `shield_sigint()` and handle a variety of new cases: * the root is in debug but hasn't been cancelled -> call `Actor.cancel_soon()` * the root is in debug but has been called (`Actor.cancel_soon()` already called) -> raise KBI * a child is in debug and has a task locking the debugger -> ignore SIGINT in child and the root actor. - if the debugger instance is provided to the handler at acquire time, on SIGINT handling completion re-print the last pdb++ REPL output so that the user realizes they are still actively in debug. - ignore the unlock case where a race condition of "no task" holding the lock causes the `RuntimeError` normally associated with the "wrong task" doing so (not sure if this is a `trio` bug?). - change debug logs to runtime level. Unhandled case(s): - a child is maybe in debug mode but does not itself have any task using the debugger. * ToDo: we need a way to decide what to do with "intermediate" child actors who themselves either are not in `debug_mode=True` but have children who are such that a SIGINT won't cause cancellation of that child-as-parent-of-another-child iff any of their children are in in debug mode.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	6b7b58346f	(facepalm) Reraise `BdbQuit` and discard ownerless lock releases	2022-07-27 11:40:02 -04:00
Tyler Goodlet	3cac323421	Add WIP while-debugger-active SIGINT ignore handler	2022-07-27 11:40:02 -04:00
goodboy	4902e184e9	Merge pull request #318 from goodboy/aio_error_propagation Add context test that opens an inter-task-channel that errors	2022-07-15 12:42:19 -04:00
Tyler Goodlet	05790a20c1	Slight lint fixes	2022-07-15 11:18:48 -04:00
Tyler Goodlet	f0d78e1a6e	Use local task ref, fixes `mypy`	2022-07-15 10:39:49 -04:00
Tyler Goodlet	0906559ed9	Drop manual stack construction, fix attr typo	2022-07-14 20:43:17 -04:00
Tyler Goodlet	38d03858d7	Fix `asyncio`-task-sync and error propagation This fixes an previously undetected bug where if an `.open_channel_from()` spawned task errored the error would not be propagated to the `trio` side and instead would fail silently with a console log error. What was most odd is that it only seems easy to trigger when you put a slight task sleep before the error is raised (:eyeroll:). This patch adds a few things to address this and just in general improve iter-task lifetime syncing: - add `LinkedTaskChannel._trio_exited: bool` a flag set from the `trio` side when the channel block exits. - add a `wait_on_aio_task: bool` flag to `translate_aio_errors` which toggles whether to wait the `asyncio` task termination event on exit. - cancel the `asyncio` task if the trio side has ended, when `._trio_exited == True`. - always close the `trio` mem channel when the task exits such that the `asyncio` side can error on any next `.send()` call.	2022-07-14 16:35:41 -04:00
Tyler Goodlet	41983edc43	Use `str` \| `bytes` union for typing msg dump	2022-07-12 11:59:11 -04:00
Tyler Goodlet	5168700fbf	Tolerate non-decode-able bytes	2022-07-12 11:55:55 -04:00
Tyler Goodlet	673c4a8c66	Decode bytes prior to log msg	2022-07-12 11:55:55 -04:00
Tyler Goodlet	932b841176	Allow up to 4 `msgpsec` decode failures	2022-07-12 11:55:55 -04:00
Tyler Goodlet	f594f1bdda	Handle a connection reset on `msgspec` transport	2022-07-12 11:55:55 -04:00
Tyler Goodlet	4e7ab54452	Appease `mypy`	2022-07-12 11:22:30 -04:00
Tyler Goodlet	f94b7cd991	Drop `msgpack` lib and use `msgspec` for transport	2022-07-12 10:37:13 -04:00
Tyler Goodlet	8901272854	Fix typing	2022-04-13 08:20:53 -04:00
Tyler Goodlet	80897a8f2b	Add `tractor.query_actor()` an addr looker-upper Sometimes it's handy to just have a non-`Portal` yielding way to figure out if a "service" actor is up, so add this discovery helper for that. We'll prolly just leave it undocumented for now until we figure out a longer-term/better discovery system.	2022-04-13 07:50:42 -04:00
Tyler Goodlet	f3606d5bd8	Type fixes	2022-04-12 11:48:32 -04:00
Tyler Goodlet	c322a193f2	Make `LinkedTaskChannel` trio-task-broadcastable with `.subscribe()`	2022-04-12 11:42:44 -04:00
Tyler Goodlet	46963c2e63	Don't handle `GeneratorExit` on `asyncio` tasks	2022-04-12 11:42:44 -04:00
Tyler Goodlet	9b77b8c9ee	Add more explicit `asyncio` task error logging When an `asyncio` side task errors or is cancelled we now explicitly report the traceback and task name if possible as well as the source reason for the error (some come from the `trio` side). Further, properly set any `trio` side exception (after unwrapping it from the `outcome.Error`) on the future that runs the `trio` guest run.	2022-04-12 11:42:44 -04:00
Tyler Goodlet	c30cece37a	Fix one missing import/ref	2022-02-17 13:03:37 -05:00
Tyler Goodlet	509082c935	Port to new `msgspec` error type	2022-02-17 11:55:26 -05:00
Tyler Goodlet	75bb1added	Avoid importing mp for as long as possible	2022-02-17 11:55:26 -05:00
Tyler Goodlet	76a0492028	Fix type annot	2022-02-15 08:52:04 -05:00
Tyler Goodlet	4eab4a0213	Type fix	2022-02-15 08:51:25 -05:00
Tyler Goodlet	0edc6a26bc	Go back to strict map keys	2022-02-15 08:48:43 -05:00
Tyler Goodlet	c5acc3b969	Pack tuple keys as . delim strs in registry tests	2022-02-15 08:48:07 -05:00
Tyler Goodlet	17bfa120cc	Port to msgpec `0.4.0` imports	2022-02-14 14:05:55 -05:00
Tyler Goodlet	77ddc073e8	Use lists by default like `msgspec`	2022-02-09 10:07:33 -05:00
Tyler Goodlet	87de28fd88	Slight doc string update	2022-01-30 12:21:41 -05:00
Tyler Goodlet	56b29c27de	Add msg serialization coding todo resources list	2022-01-30 12:19:21 -05:00
Tyler Goodlet	25a27e780d	Add todo resources for eventual capability-based module filtering	2022-01-30 11:28:10 -05:00
Tyler Goodlet	c265f3f94e	Move namespace path type into `msg` mod	2022-01-30 11:27:34 -05:00
Tyler Goodlet	2900ceb003	Not all objects have a `.__name__`	2022-01-30 11:26:34 -05:00
Tyler Goodlet	b6ae77b5ac	Use `pkgutils.resolve_name()` and a `str` subtype Python 3.9's new object resolver + a `str` is much simpler then mucking with tuples (and easier to serialize). Include a `.to_tuple()` formatter since we still are passing the module namespace and function name separately inside the runtime's message format but in theory we might be able to simplify this depending on how we would change the support for `enable_modules:list[str]` in the spawn API. Thanks to @Fuyukai for pointing `resolve_name()` which I didn't know about before!	2022-01-30 11:26:34 -05:00
Tyler Goodlet	949cb2c9fe	First draft "namespace path" named tuple; probably will discard	2022-01-30 11:26:34 -05:00
Tyler Goodlet	7e004c0688	Add back blank `msg.py`	2022-01-29 14:22:15 -05:00
Tyler Goodlet	ffe88de53b	Better idea: start a `tractor.experimental` subpkg	2022-01-29 14:03:55 -05:00
Tyler Goodlet	d29a915d48	Update mod doc string	2022-01-29 14:02:04 -05:00
Tyler Goodlet	be87caa99b	Move legacy pubsub stuff from `msg.py` to trionics mod	2022-01-29 14:02:04 -05:00
Tyler Goodlet	9650055519	Use `.exitcode` which is poll + error handling	2022-01-21 12:49:26 -05:00
Tyler Goodlet	532974fb90	Drop leftover print	2022-01-21 12:49:26 -05:00
Tyler Goodlet	b1d72b77c9	Patch mp procs with a `.poll()` Not sure why they don't already expose this from the `Popen` backends but, k.	2022-01-21 12:49:26 -05:00
Tyler Goodlet	a2171c7e71	Cancel the `.cancel_actor()` request on proc death Adjust the `soft_wait()` strategy to avoid sending needless cancel requests if it is known that a child process is already terminated or does so before the cancel request times out. This should be no slower and should avoid needless waits on either closure-in-progress or already closed channels. Basic strategy is, - request child actor to cancel - if process termination is detected, cancel the cancel - if the process is still alive after a cancel request timeout warn the user and yield back to the hard reap handling	2022-01-21 12:49:26 -05:00
Tyler Goodlet	9b4cdb00e6	Add agpl header	2021-12-17 09:39:30 -05:00
Tyler Goodlet	24078f2d6e	More doc string style tweaks	2021-12-17 09:38:04 -05:00
Tyler Goodlet	56cc98375e	Return channel type from `_run_asyncio_task()` Better encapsulate all the mem-chan, Queue, sync-primitives inside our linked task channel in order to avoid `mypy`'s complaints about monkey patching. This also sets footing for adding an `asyncio`-side channel API that can be used more like this `trio`-side API.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	b69412a903	Drop cancel scope from linked task channel	2021-12-17 09:38:04 -05:00
Tyler Goodlet	6803891bd7	Collect `asyncio` task exceptions to avoid warning msg	2021-12-17 09:38:04 -05:00
Tyler Goodlet	5f4094691d	Re-wrap and raise `asyncio.CancelledError` For whatever reason `trio` seems to be swallowing this exception when raised in the `trio` task so instead wrap it in our own non-base exception type: `AsyncioCancelled` and raise that when the `asyncio` task cancels itself internally using `raise <err> from <src_err>` style. Further don't bother cancelling the `trio` task (via cancel scope) since we we can just use the recv mem chan closure error as a signal and explicitly lookup any set asyncio error.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	c48c68c0bc	Flip doc strings to my preferred format	2021-12-17 09:38:04 -05:00
Tyler Goodlet	44d0e9fc32	Add a `LinkedTaskChannel` for synced inter-loop-streaming Wraps the pairs of underlying `trio` mem chans and the `asyncio.Queue` with this new composite which will be delivered from `open_channel_from()`. This allows for both sending and receiving values from the `asyncio` task (2 way msg passing) as well controls for cancelling or waiting on the task. Factor `asyncio` translation and re-raising logic into a new closure which is run on both `trio` side error handling as well as on normal termination to avoid missing `asyncio` errors even when `trio` task cancellation is handled first. Only close the `trio` mem chans on `trio` task termination iff the task was spawned using `open_channel_from()`: - on `open_channel_from()` exit, mem chan closure is the desired semantic - on `run_task()` we normally only return a single value or error and if the channel is closed before the error is raised we may propagate a `trio.EndOfChannel` instead of the desired underlying `asyncio` task's error	2021-12-17 09:38:04 -05:00
Tyler Goodlet	9bc94b5ccc	Factor error translation into a ctx mngr Pull the common `asyncio` -> `trio` error translation logic into a common context manager and don't expect a final result to be captured when using `open_channel_from()` since it's a manager interface and it would be clunky to try and deliver some "final result" after exit.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	e6687bcdc4	Serious-ify doc string	2021-12-17 09:38:04 -05:00
Tyler Goodlet	8704664719	Reverse the order for asyncio cancelleds? I dunno why	2021-12-17 09:38:04 -05:00
Tyler Goodlet	1114b6980e	Adjust linked-loop-task tear down sequence Close the mem chan before cancelling the `trio` task in order to ensure we retrieve whatever error is shuttled from `asyncio` before the channel read is potentially cancelled (previously a race?). Handle `asyncio.CancelledError` specially such that we raise it directly (instead of `raise aio_cancelled from other_err`) since it is the source error in the case where the cancellation is `asyncio` internal.	2021-12-17 09:38:04 -05:00
Tyler Goodlet	56357242e9	Add a `Portal.cancel_actor()` test	2021-12-17 09:38:04 -05:00
Tyler Goodlet	0ab5e5cadd	Fill out nursery docstring	2021-12-17 09:38:04 -05:00
Tyler Goodlet	06fa650ed0	Drop runtime logging for asyncio mode	2021-12-17 09:38:04 -05:00
Tyler Goodlet	446feff172	Clean type imports	2021-12-17 09:38:04 -05:00
Tyler Goodlet	41eddffc2c	Drop old (and deluded) "streaming" cruft	2021-12-17 09:38:04 -05:00
Tyler Goodlet	7a65165279	Facepalm, re-raise captured `asyncio` task error	2021-12-17 09:38:04 -05:00
Tyler Goodlet	b376b7cd32	First draft: `.to_asyncio.open_channel_from()`	2021-12-17 09:38:04 -05:00
Tyler Goodlet	c262b1a3e8	Always cancel the asyncio task?	2021-12-17 09:38:04 -05:00
Tyler Goodlet	d9dac3f36c	Drop old implementation cruft	2021-12-17 09:38:04 -05:00
Tyler Goodlet	325c0cdb1b	Fix error propagation on asyncio streaming tasks	2021-12-17 09:38:04 -05:00
Tyler Goodlet	55e210fec6	Drop bad .close() call	2021-12-17 09:38:04 -05:00
Tyler Goodlet	aa24bbc11c	Proxy asyncio cancelleds as well	2021-12-17 09:38:04 -05:00
Tyler Goodlet	793bcfb7d4	Pass `infect_asyncio` flag to mp actors as well	2021-12-17 09:38:04 -05:00
Tyler Goodlet	d80f8d7a39	WIP redo asyncio async gen streaming	2021-12-17 09:38:04 -05:00
Tyler Goodlet	340effae11	Add initial infected asyncio error propagation test	2021-12-17 09:38:01 -05:00
Tyler Goodlet	509ae132ec	Raise any asyncio errors if in trio task on cancel	2021-12-17 09:38:01 -05:00
Tyler Goodlet	80f47dece2	Raise from asyncio error; fixes mypy	2021-12-17 09:38:01 -05:00
Tyler Goodlet	2cf87146a3	Log any asyncio error	2021-12-17 09:38:01 -05:00
Tyler Goodlet	8070b16bd0	Support asyncio actors with the trio spawner backend	2021-12-17 09:38:01 -05:00
Tyler Goodlet	1406ddc5ee	Add `infect_asyncio: bool` flag to nursery methods	2021-12-17 09:37:41 -05:00
Tyler Goodlet	055788cf16	Attempt to make mypy happy..	2021-12-17 09:19:23 -05:00
Tyler Goodlet	1825b21d2c	Wow, fix all the broken async func invoking code.. Clearly this wasn't developed against a task that spawned just an async func in `asyncio`.. Fix all that and remove a bunch of unnecessary func layers. Add provisional support for the target receiving the `to_trio` and `from_trio` channels and for the @tractor.stream marker.	2021-12-17 09:19:23 -05:00
Tyler Goodlet	acd63d0c89	First draft "infected `asyncio` mode" This should mostly maintain top level SC principles for any task spawned using `tractor.to_asyncio.run()`. When the `asyncio` task completes make sure to cancel the pertaining `trio` cancel scope and raise any error that may have resulted. This interface uses `trio`'s "guest-mode" to run `asyncio` loop using a special entrypoint which is handed to Python during process spawn.	2021-12-17 09:17:59 -05:00
Tyler Goodlet	98a830ccba	Drop cancel traceback capture; don't seem to need it?	2021-12-16 19:59:10 -05:00
Tyler Goodlet	8c004c1f36	Add an explicit messaging error for reporting an illegal context transaction	2021-12-16 19:59:10 -05:00
Tyler Goodlet	e2139c2bf0	Don't set `Context._error` to expected `ContextCancelled` If the one side of an inter-actor context cancels the other then that side should always expect back a `ContextCancelled` message. However we should not set this error in this case (where the cancel request was sent and a `ContextCancelled` msg was received back) since it may override some other error that caused the cancellation request to be sent out in the first place. As an example when a context opens another context to a peer and some error happens which causes the second peer context to be cancelled but we want to propagate the original error. Fixes the issue found in https://github.com/pikers/piker/issues/244	2021-12-16 19:59:10 -05:00
Tyler Goodlet	5d424e3703	Hide the key error tb on remote starting errors	2021-12-16 19:59:10 -05:00
Tyler Goodlet	da5e36bf0c	Revert back to avoiding key errors on cancellation	2021-12-16 18:02:03 -05:00
Tyler Goodlet	26394dd8df	Type annot fixes	2021-12-16 18:02:03 -05:00
Tyler Goodlet	11e64426f6	Wake all sleeping consumers on bcaster closure	2021-12-16 18:02:03 -05:00
Tyler Goodlet	213447008b	Add draft code for waiting on all nurseries in root	2021-12-16 18:02:03 -05:00
Tyler Goodlet	52627a6326	Rework interface: pass func and kwargs After more extensive testing I realized that keying on the context manager instance id isn't going to work since each entering task is going to create a unique key XD Instead pass the manager function as `acm_func` and optionally allow keying the resource on the passed `kwargs` (if hashable) or the `key:str`. Further, pass the key to the enterer task and avoid a separate keying scheme for the manager versus the value it delivers. Don't bother with checking and releasing the lock in `finally:` block, it should be an error if it's still locked.	2021-12-16 18:02:03 -05:00
Tyler Goodlet	3826bc9972	Don't catch key errors from the yielded to scope	2021-12-16 18:02:03 -05:00
Tyler Goodlet	b210278e2f	Naming change `cache` -> `_Cache`	2021-12-16 18:02:03 -05:00
Tyler Goodlet	ac22b4a875	Fix type annots in resource cacher internals	2021-12-16 18:02:03 -05:00
Tyler Goodlet	5f41dbf34f	Add `maybe_open_context()` an actor wide task-resource cache	2021-12-16 18:02:03 -05:00
Tyler Goodlet	57f2aca18c	Set eoc on closure (again)	2021-12-16 16:19:15 -05:00
Tyler Goodlet	f2ba961e81	Mark stream with EOC when stop message is received	2021-12-16 16:18:58 -05:00
Tyler Goodlet	3deb1b91e6	Wake all broadcast consumers on EOC Without this wakeup you can have tasks which re-enter `.receive()` and get stuck waiting on the wakeup event indefinitely. Whenever a ``trio.EndOfChannel`` arrives we want to make sure all consumers at least know about it and don't block. This previous behaviour was basically a bug. Add some state flags for tracking if the broadcaster was either cancelled or terminated via EOC mostly for testing and debugging purposes though this info might be useful if we decide to offer a `.statistics()` like API in the future.	2021-12-16 16:16:14 -05:00
Tyler Goodlet	61e134dc5d	Wake up consumers on end of channel as well	2021-12-16 16:15:54 -05:00
Tyler Goodlet	6f94ffc304	Re-license code base for distribution under AGPL This commit obviously denotes a re-license of all applicable parts of the code base. Acknowledgement of this change was completed in #274 by the majority of the current set of contributors. From here henceforth all changes will be AGPL licensed and distributed. This is purely an effort to maintain the same copy-left policy whilst closing the (perceived) SaaS loophole the GPL allows for. It is merely for this loophole: to avoid code hiding by any potential "network providers" who are attempting to use the project to make a profit without either compensating the authors or re-distributing their changes. I thought quite a bit about this change and can't see a reason not to close the SaaS loophole in our current license. We still are (hard) copy-left and I plan to keep the code base this way for a couple reasons: - The code base produces income/profit through parent projects and is demonstrably of high value. - I believe firms should not get free lunch for the sake of "contributions from their employees" or "usage as a service" which I have found to be a dubious argument at best. - If a firm who intends to profit from the code base wants to use it they can propose a secondary commercial license to purchase with the proceeds going to the project's authors under some form of well defined contract. - Many successful projects like Qt use this model; I see no reason it can't work in this case until such a time as the authors feel it should be loosened. There has been detailed discussion in #103 on licensing alternatives. The main point of this AGPL change is to protect the code base for the time being from exploitation while it grows and as we move into the next phase of development which will include extension into the multi-host distributed software space.	2021-12-14 23:33:27 -05:00
Tyler Goodlet	a38a983225	Increase debugger poll delay back to prior value If we make it too fast a nursery with debug mode children can cancel too fast and causes some test failures. It's likely not a huge deal anyway since the purpose of this poll/check is for human interaction and the current delay isn't really that noticeable. Decrease log levels in the debug module to avoid console noise when in use. Toss in some more detailed comments around the new debugger lock points.	2021-12-10 11:54:27 -05:00
Tyler Goodlet	9bee513136	Use manual debugger-in-use flag in nursery and spawn task	2021-12-09 17:53:29 -05:00
Tyler Goodlet	5d9e3d1163	Add a manual debug mode kwarg to debugger waiter	2021-12-09 17:52:35 -05:00
Tyler Goodlet	92c6ec1882	`get_loglevel()` always returns a str	2021-12-07 13:17:00 -05:00
Tyler Goodlet	72eef2a4a1	Config debug mode log level after initial setup	2021-12-07 13:16:07 -05:00
Tyler Goodlet	9bd5226e76	Only adjust logging in debug mode if not noisy enough already	2021-12-07 13:13:04 -05:00
Tyler Goodlet	e899cc42bf	Add per actor debug mode toggle	2021-12-07 13:11:06 -05:00
Tyler Goodlet	4856285dee	Add back broken send chan ignore block	2021-12-06 17:04:17 -05:00
Tyler Goodlet	4b40599c48	Fix ignore warning log message	2021-12-06 16:32:23 -05:00
Tyler Goodlet	c9132de7dc	Move maybe-raise-error-msg logic into context A context method handling all this logic makes the most sense since it contains all the state related to whether the error should be raised in a nursery scope or is expected to be raised by a consumer task which reads and processes the msg directly (via a `Portal` API call). This also makes it easy to always process remote errors even when there is no (stream) overrun condition.	2021-12-06 16:32:23 -05:00
Tyler Goodlet	1f8e1cccbb	Only pop contexts on decorated entrypoints	2021-12-06 13:48:19 -05:00
Tyler Goodlet	318027ebd1	Raise stream overruns on one side never opened A context stream overrun should normally never take place since if a stream is opened (via ``Context.open_stream()``) backpressure is applied on the message buffer (unless explicitly disabled by the ``backpressure=False`` flag) such that an overrun on the receiving task should result in blocking the (remote) sender task (eventually depending on the underlying ``MsgStream`` transport). Here we add a special error message that reports if one side never opened a stream and let's the user know in the overrun error message that they may be trying to push messages to a task that isn't ready to receive them. Further fixes / details: - pop any `Context` at the end of any `_invoke()` task that creates one and registers with the runtime. - ignore but warn about messages received for a context that either no longer exists or is unknown (guarding against crashes by malicious packets in the latter case)	2021-12-06 11:54:21 -05:00
Tyler Goodlet	b826ec8103	Better idea, enable backpressure on opened streams Keeping it disabled on context open will help with detecting any stream connection which was never opened on one side of the task pair. In that case we can report that there was an overrun and a stream wasn't opened versus if the stream is explicitly configured not to use bp then we throw the standard overflow. Use `trio.Nursery._closed` to detect "closure" XD since it seems to be the most reliable way to determine if a spawn call will trigger a runtime error.	2021-12-06 11:54:21 -05:00
Tyler Goodlet	4ea5c9b5db	Pop context on `.open_context()` exit	2021-12-06 11:54:21 -05:00
Tyler Goodlet	41a3e6a9ca	Type check fixes	2021-12-05 20:00:58 -05:00
Tyler Goodlet	185dbc7e3f	Disable msg stream backpressure by default Half of portal API usage requires a 1 message response (`.run()`, `.run_in_actor()`) and the streaming APIs should probably be explicitly enabled for backpressure if desired by the user. This makes more sense in (psuedo) realtime systems where it's better to notify on a block then freeze without notice. Make this default behaviour with a new error to be raised: `tractor._exceptions.StreamOverrun` when a sender overruns a stream by the default size (2**6 for now). The old behavior can be enabled with `Context.open_stream(backpressure=True)` but now with warning log messages when there are overruns. Add task-linked-context error propagation using a "nursery raising" technique such that if either end of context linked pair of tasks errors, that error can be relayed to other side and raised as a form of interrupt at the receiving task's next `trio` checkpoint. This enables reliable error relay without expecting the (error) receiving task to call an API which would raise the remote exception (which it might never currently if using `tractor.MsgStream` APIs). Further internal implementation details: - define the default msg buffer size as `Actor.msg_buffer_size` - expose a `msg_buffer_size: int` kwarg from `Actor.get_context()` - maybe raise aforementioned context errors using `Context._maybe_error_from_remote_msg()` inside `Actor._push_result()` - support optional backpressure on a stream when pushing messages in `Actor._push_result()` - in `_invote()` handle multierrors raised from a `@tractor.context` entrypoint as being potentially caused by a relayed error from the remote caller task, if `Context._error` has been set then raise that error inside the `RemoteActorError` that will be relayed back to that caller more or less proxying through the source side error back to its origin.	2021-12-05 19:31:41 -05:00
Tyler Goodlet	2680a9473d	Always set `Context._portal` on the caller task side	2021-12-05 19:28:00 -05:00
Tyler Goodlet	92b540d518	Add internal msg stream backpressure controls In preparation for supporting both backpressure detection (through an optional error) as well as control over the msg channel buffer size, add internal configuration flags for both to contexts. Also adjust `Context._err_on_from_remote_msg()` -> `._maybe..` such that it can be called and will only raise if a scope nursery has been set. Add a `Context._error` for stashing the remote task's error that may be delivered in an `'error'` message.	2021-12-05 19:19:53 -05:00
Tyler Goodlet	6751349987	Add a stream overrun exception	2021-12-05 18:28:02 -05:00
Tyler Goodlet	d307eab118	Rework `Actor.send_cmd()` to `.start_remote_task()` This more formally declares the runtime's remote task startingn API and uses it throughout all the dependent `Portal` API methods. Allows dropping `Portal._submit()` and simplifying `.run_in_actor()` style result waiting to be delegated to the context APIs at remote task `return` response time. We now also track the remote entrypoint "type` as `Context._remote_func_type`.	2021-12-04 18:20:43 -05:00
Tyler Goodlet	c5c3f7e789	Use `tractor.Context` throughout the runtime core Instead of tracking feeder mem chans per RPC dialog, store `Context` instances which (now) hold refs to the underlying RPC-task feeder chans and track them inside a `Actor._contexts` map. This begins a transition to making the "context" idea the primitive abstraction for representing messaging dialogs between tasks in different memory domains (i.e. usually separate processes). A slew of changes made this possible: - change `Actor.get_memchans()` -> `.get_context()`. - Add new `Context._send_chan` and `._recv_chan` vars. - implicitly create a new context on every `Actor.send_cmd()` call. - use the context created by `.send_cmd()` in `Portal.open_context()` instead of manually creating one. - call `Actor.get_context()` inside tasks run from `._invoke()` such that feeder chans are implicitly created for callee tasks thus fixing the bug #265. NB: We might change some of the internal semantics to do with when the feeder chans are actually created to denote whether or not a far end task is actually read to receive messages. For example, in the cases where it never will be ready to receive messages (one-way streaming, a context that never opens a stream, etc.) we will likely want some kind of error or at least warning to the caller that messages can't be sent (yet).	2021-12-03 14:49:55 -05:00
Tyler Goodlet	f4793af2b9	Error on mal-use of `Context.started()` Previously we were ignoring a race where the callee an opened task context could enter `Context.open_stream()` before calling `.started(). Disallow this as well as calling `.started()` more then once.	2021-12-03 10:08:55 -05:00
Tyler Goodlet	08e9593306	Suppress broken resources errors in `Portal.cancel_actor()`	2021-12-02 15:29:04 -05:00
Tyler Goodlet	14f84571fb	Don't cancel receive streams inside `.cancel_actor()` We don't need to any more presuming you get ideal remote cancellation conditions where the remote actor should teardown and kill the streams from its end.	2021-12-02 15:29:04 -05:00
Tyler Goodlet	e561a4908f	Appease mypy	2021-12-02 15:29:04 -05:00
Tyler Goodlet	46070f99de	Factor soft-wait logic into a helper, use with mp	2021-12-02 08:18:04 -05:00

... 4 5 6 7 8 ...

1020 Commits (2f854a3e86d898045c3bbf093e0df14e89a2b339)