tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	1d7cf7d1dd	Enable `stackscope` render via root in debug mode If `stackscope` is importable and debug_mode is enabled then we by default call and report `.devx.enable_stack_on_sig()` is set B) This makes debugging unexpected (SIGINT ignoring) hangs a cinch!	2024-02-20 13:23:16 -05:00
Tyler Goodlet	54a0a0000d	.log: more multi-line styling	2024-02-20 13:22:44 -05:00
Tyler Goodlet	0268b2ce91	Better subproc supervisor logging, todo for #320 Given i just similarly revamped a buncha `._runtime` log msg formatting, might as well do something similar inside the spawning machinery such that groking teardown sequences of each supervising task is much more sane XD Mostly this includes doing similar `'<field>: <value>\n'` multi-line formatting when reporting various subproc supervision steps as well as showing a detailed `trio.Process.__repr__()` as appropriate. Also adds a detailed #TODO according to the needs of #320 for which we're going to need some internal mechanism for intermediary parent actors to determine if a given debug tty locker (sub-actor) is one of their (transitive) children and thus stall the normal cancellation/teardown sequence until that locker is complete.	2024-02-20 13:12:51 -05:00
Tyler Goodlet	81f8e2d4ac	_supervise: iter nice expanded multi-line `._children` tups with typing	2024-02-20 09:18:22 -05:00
Tyler Goodlet	bf0739c194	Add `stackscope` tree pprinter triggered by SIGUSR1 Can be optionally enabled via a new `enable_stack_on_sig()` which will swap in the SIGUSR1 handler. Much thanks to @oremanj for writing this amazing project, it's thus far helped me fix some very subtle hangs inside our new IPC-context cancellation machinery that would have otherwise taken much more manual pdb-ing and hair pulling XD Full credit for `dump_task_tree()` goes to the original project author with some minor tweaks as was handed to me via the trio-general matrix room B) Slight changes from orig version: - use a `log.pdb()` emission to pprint to console - toss in an ex sh CLI cmd to trigger the dump from another terminal using `kill` + `pgrep`.	2024-02-20 09:05:34 -05:00
Tyler Goodlet	3e1d033708	WIP: solved the modden client hang..	2024-02-19 17:00:46 -05:00
Tyler Goodlet	c35576e196	Baboso! fix `chan.send(None)` indent..	2024-02-19 14:41:03 -05:00
Tyler Goodlet	8ce26d692f	Improved log msg formatting in core As part of solving some final edge cases todo with inter-peer remote cancellation (particularly a remote cancel from a separate actor tree-client hanging on the request side in `modden`..) I needed less dense, more line-delimited log msg formats when understanding ipc channel and context cancels from console logging; this adds a ton of that to: - `._invoke()` which now does, - better formatting of `Context`-task info as multi-line `'<field>: <value>\n'` messages, - use of `trio.Task` (from `.lowlevel.current_task()` for full rpc-func namespace-path info, - better "msg flow annotations" with `<=` for understanding `ContextCancelled` flow. - `Actor._stream_handler()` where in we break down IPC peers reporting better as multi-line `\|_<Channel>` log msgs instead of all jammed on one line.. - `._ipc.Channel.send()` use `pformat()` for repr of packet. Also tweak some optional deps imports for debug mode: - add `maybe_import_gb()` for attempting to import `greenback`. - maybe enable `stackscope` tree pprinter on `SIGUSR1` if installed. Add a further stale-debugger-lock guard before removal: - read the `._debug.Lock.global_actor_in_debug: tuple` uid and possibly `maybe_wait_for_debugger()` when the child-user is known to have a live process in our tree. - only cancel `Lock._root_local_task_cs_in_debug: CancelScope` when the disconnected channel maps to the `Lock.global_actor_in_debug`, though not sure this is correct yet? Started adding missing type annots in sections that were modified.	2024-02-19 14:00:23 -05:00
Tyler Goodlet	7f29fd8dcf	Let `pack_error()` take a msg injected `cid: str\|None`	2024-02-18 17:17:31 -05:00
Tyler Goodlet	7fbada8a15	Add `StreamOverrun.sender: tuple` for better handling Since it's generally useful to know who is the cause of an overrun (say bc you want your system to then adjust the writer side to slow tf down) might as well pack an extra `.sender: tuple[str, str]` actor uid field which can be relayed through `RemoteActorError` boxing. Add an extra case for the exc-type to `unpack_error()` to match B)	2024-02-16 15:23:02 -05:00
Tyler Goodlet	286e75d342	Offer `unpack_error(hid_tb: bool)` for `pdbp` REPL config	2024-02-14 16:13:32 -05:00
Tyler Goodlet	df641d9d31	Bring in pretty-ified `msgspec.Struct` extension Originally designed and used throughout `piker`, the subtype adds some handy pprinting and field diffing extras often handy when viewing struct types in logging or REPL console interfaces B) Obvi this rejigs the `tractor.msg` mod into a sub-pkg and moves the existing namespace obj-pointer stuff into a new `.msg.ptr` sub mod.	2024-01-28 16:33:10 -05:00
Tyler Goodlet	35b0c4bef0	Never mask original `KeyError` in portal-error unwrapper, for now?	2024-01-23 11:14:10 -05:00
Tyler Goodlet	c4496f21fc	Try allowing multi-pops of `_Cache.locks` for now?	2024-01-23 11:13:07 -05:00
Tyler Goodlet	7e0e627921	Use `import <blah> as blah` over `__all__` in `.trionics`	2024-01-23 11:09:38 -05:00
Tyler Goodlet	0294455c5e	`_root`: drop unused `typing` import	2024-01-02 18:43:43 -05:00
Tyler Goodlet	734bc09b67	Move missing-key-in-msg raiser to `._exceptions` Since we use basically the exact same set of logic in `Portal.open_context()` when expecting the first `'started'` msg factor and generalize `._streaming._raise_from_no_yield_msg()` into a new `._exceptions._raise_from_no_key_in_msg()` (as per the lingering todo) which obvi requires a more generalized / optional signature including a caller specific `log` obj. Obvi call the new func from all the other modules X)	2024-01-02 18:34:15 -05:00
Tyler Goodlet	0bcdea28a0	Fmt repr as multi-line style call	2024-01-02 11:28:55 -05:00
Tyler Goodlet	fdf3a1b01b	Only use `greenback` if actor-runtime is up..	2024-01-02 11:28:02 -05:00
Tyler Goodlet	ce7b8a5e18	Drop unused walrus assign of `re`	2024-01-02 11:21:20 -05:00
Tyler Goodlet	00024181cd	`StackLevelAdapter._log(stacklevel: int)` for custom levels.. Apparently (and i don't know if this was always broken [i feel like no?] or is a recent change to stdlib's `logging` stuff) we need increment the `stacklevel` input by one for our custom level methods now? Without this you're going to see the path to the method's-callstack-frame on every emission instead of to the caller's. I first noticed this when debugging the workspace layer spawning in `modden.bigd` and then verified it in other depended projects.. I guess we should add some tests for this as well XD	2024-01-02 10:38:04 -05:00
Tyler Goodlet	814384848d	Use `import <name> as <name>,` style over `__all__` in pkg mod	2024-01-02 10:25:17 -05:00
Tyler Goodlet	bea31f6d19	._child: remove some unused imports..	2024-01-02 10:24:39 -05:00
Tyler Goodlet	250275d98d	Guarding for IPC failures in `._runtime._invoke()` Took me longer then i wanted to figure out the source of a failed-response to a remote-cancellation (in this case in `modden` where a client was cancelling a workspace layer.. but disconnects before receiving the ack msg) that was triggering an IPC error when sending the error msg for the cancellation of a `Actor._cancel_task()`, but since this (non-rpc) `._invoke()` task was trying to send to a now disconnected canceller it was resulting in a `BrokenPipeError` (or similar) error. Now, we except for such IPC errors and only raise them when, 1. the transport `Channel` is for sure up (bc ow what's the point of trying to send an error on the thing that caused it..) 2. it's definitely for handling an RPC task Similarly if the entire main invoke `try:` excepts, - we only hide the call-stack frame from the debugger (with `__tracebackhide__: bool`) if it's an RPC task that has a connected channel since we always want to see the frame when debugging internal task or IPC failures. - we don't bother trying to send errors to the context caller (actor) when it's a non-RPC request since failures on actor-runtime-internal tasks shouldn't really ever be reported remotely, only maybe raised locally. Also some other tidying, - this properly corrects for the self-cancel case where an RPC context is cancelled due to a local (runtime) task calling a method like `Actor.cancel_soon()`. We now set our own `.uid` as the `ContextCancelled.canceller` value so that other-end tasks know that the cancellation was due to a self-cancellation by the actor itself. We still need to properly test for this though! - add a more detailed module doc-str. - more explicit imports for `trio` core types throughout.	2024-01-02 10:23:45 -05:00
Tyler Goodlet	f415fc43ce	`.discovery.get_arbiter()`: add warning around this now deprecated usage	2023-12-11 19:37:45 -05:00
Tyler Goodlet	3f15923537	More thurough hard kill doc strings	2023-12-11 18:17:42 -05:00
Tyler Goodlet	87cd725adb	Add `open_root_actor(ensure_registry: bool)` Allows forcing the opened actor to either obtain the passed registry addrs or raise a runtime error.	2023-11-07 16:45:24 -05:00
Tyler Goodlet	48accbd28f	Fix doc string "its" typo..	2023-11-06 15:44:21 -05:00
Tyler Goodlet	227c9ea173	Test with `any(portals)` since `gather_contexts()` will return `list[None \| tuple]`	2023-11-06 15:43:43 -05:00
Tyler Goodlet	f4e63465de	Tweak `Channel._cancel_called` comment	2023-10-23 17:47:55 -04:00
Tyler Goodlet	df31047ecb	Be ultra-correct in `Portal.open_context()` This took way too long to get right but hopefully will give us grok-able and correct context exit semantics going forward B) The main fixes were: - always shielding the `MsgStream.aclose()` call on teardown to avoid bubbling a `Cancelled`. - properly absorbing any `ContextCancelled` in cases due to "self cancellation" using the new `Context.canceller` in the logic. - capturing any error raised by the `Context.result()` call in the "normal exit, result received" case and setting it as the `Context._local_error` so that self-cancels can be easily measured via `Context.cancelled_caught` in same way as remote-error caused cancellations. - extremely detailed comments around all of the cancellation-error cases to avoid ever getting confused about the control flow in the future XD	2023-10-23 17:34:28 -04:00
Tyler Goodlet	131674eabd	Be mega-pedantic with `ContextCancelled` semantics As part of extremely detailed inter-peer-actor testing, add much more granular `Context` cancellation state tracking via the following (new) fields: - `.canceller: tuple[str, str]` the uuid of the actor responsible for the cancellation condition - always set by `Context._maybe_cancel_and_set_remote_error()` and replaces `._cancelled_remote` and `.cancel_called_remote`. If set, this value should normally always match a value from some `ContextCancelled` raised or caught by one side of the context. - `._local_error` which is always set to the locally raised (and caller or callee task's scope-internal) error which caused any eventual cancellation/error condition and thus any closure of the context's per-task-side-`trio.Nursery`. - `.cancelled_caught: bool` is now always `True` whenever the local task catches (or "silently absorbs") a `ContextCancelled` (a `ctxc`) that indeed originated from one of the context's linked tasks or any other context which raised its own `ctxc` in the current `.open_context()` scope. => whenever there is a case that no `ContextCancelled` was raised in the `.open_context().__aexit__()` (eg. `ctx.result()` called after a call `ctx.cancel()`), we still consider the context's as having "caught a cancellation" since the `ctxc` was indeed silently handled by the cancel requester; all other error cases are already represented by mirroring the state of the `._scope: trio.CancelScope` => IOW there should be no case where an error is not raised in the context's scope and `.cancelled_caught: bool == False`, i.e. no case where `._scope.cancelled_caught == False and ._local_error is not None`! - always raise any `ctxc` from `.open_stream()` if `._cancel_called == True` - if the cancellation request has not already resulted in a `._remote_error: ContextCancelled` we raise a `RuntimeError` to indicate improper usage to the guilty side's task code. - make `._maybe_raise_remote_err()` a sync func and don't raise any `ctxc` which is matched against a `.canceller` determined to be the current actor, aka a "self cancel", and always set the `._local_error` to any such `ctxc`. - `.side: str` taken from inside `.cancel()` and unused as of now since it might be better re-written as a similar `.is_opener() -> bool`? - drop unused `._started_received: bool`.. - TONS and TONS of detailed comments/docs to attempt to explain all the possible cancellation/exit cases and how they should exhibit as either silent closes or raises from the `Context` API! Adjust the `._runtime._invoke()` code to match: - use `ctx._maybe_raise_remote_err()` in `._invoke()`. - adjust to new `.canceller` property. - more type hints. - better `log.cancel()` msging around self-cancels vs. peer-cancels. - always set the `._local_error: BaseException` for the "callee" task just like `Portal.open_context()` now will do B) Prior we were raising any `Context._remote_error` directly and doing (more or less) the same `ContextCancelled` "absorbing" logic (well kinda) in block; instead delegate to the method	2023-10-23 16:24:54 -04:00
Tyler Goodlet	5a94e8fb5b	Raise a `MessagingError` from the src error on msging edge cases	2023-10-23 14:34:12 -04:00
Tyler Goodlet	0518b3ab04	Move `MessagingError` into `._exceptions` set	2023-10-23 14:17:36 -04:00
Tyler Goodlet	2f0bed3018	Ignore `greenback` import error if not installed	2023-10-19 12:41:15 -04:00
Tyler Goodlet	9da3b63644	Change remaining internals to use `Actor.reg_addrs`	2023-10-19 12:40:37 -04:00
Tyler Goodlet	1d6f55543d	Expose per-actor registry addrs via `.reg_addrs` Since it's handy to be able to debug the writing of this instance var (particularly when checking state passed down to a child in `Actor._from_parent()`), rename and wrap the underlying `Actor._reg_addrs` as a settable `@property` and add validation to the `.setter` for sanity - actor discovery is a critical functionality. Other tweaks: - fix `.cancel_soon()` to pass expected argument.. - update internal runtime error message to be simpler and link to GH issues. - use new `Actor.reg_addrs` throughout core.	2023-10-19 12:38:27 -04:00
Tyler Goodlet	42d621bba7	Always dynamically re-read the `._root._default_lo_addrs` value in `find_actor()`	2023-10-18 19:10:04 -04:00
Tyler Goodlet	2e81ccf5b4	Dump `.msgdata` in `RemoteActorError.__repr__()`	2023-10-18 19:09:07 -04:00
Tyler Goodlet	022bf8ce75	Ensure `registry_addrs` is always set to something	2023-10-18 19:08:35 -04:00
Tyler Goodlet	190845ce1d	Add masked super timeout line to `do_hard_kill()` for would-be runtime hackers	2023-10-18 15:29:43 -04:00
Tyler Goodlet	0c74b04c83	Facepalm, `wait_for_actor()` dun take an addr `list`..	2023-10-18 15:22:54 -04:00
Tyler Goodlet	215fec1d41	Change old `._debug._pause()` name, cherry to #362 re `greenback`	2023-10-18 15:01:04 -04:00
Tyler Goodlet	fcc8cee9d3	._root: set a `_default_lo_addrs` and apply it when not provided by caller	2023-10-18 14:12:58 -04:00
Tyler Goodlet	87c1113de4	Always set default reg addr in `find_actor()` if not defined	2023-10-18 13:20:29 -04:00
Tyler Goodlet	43b659dbe4	Tidy/clarify another `._runtime` comment	2023-10-18 13:19:34 -04:00
Tyler Goodlet	63b1488ab6	Get mega-pedantic in `Portal.open_context()` Specifically in the `.__aexit__()` phase to ensure remote, runtime-internal, and locally raised error-during-cancelled-handling exceptions are NEVER masked by a local `ContextCancelled` or any exception group of `trio.Cancelled`s. Also adds a ton of details to doc strings including extreme detail surrounding the `ContextCancelled` raising cases and their processing inside `.open_context()`'s exception handler blocks. Details, details: - internal rename `err`/`_err` stuff to just be `scope_err` since it's effectively the error bubbled up from the context's surrounding (and cross-actor) "scope". - always shield `._recv_chan.aclose()` to avoid any `Cancelled` from masking the `scope_err` with a runtime related `trio.Cancelled`. - explicitly catch the specific set of `scope_err: BaseException` that we can reasonably expect to handle instead of the catch-all parent type including exception groups, cancels and KBIs.	2023-10-18 13:18:29 -04:00
Tyler Goodlet	7eb31f3fea	Runtime import `.get_root()` in stdin hijacker to avoid import cycle	2023-10-17 16:52:31 -04:00
Tyler Goodlet	534e5d150d	Drop `msg` kwarg from `Context.cancel()` Well first off, turns out it's never used and generally speaking doesn't seem to help much with "runtime hacking/debugging"; why would we need to "fabricate" a msg when `.cancel()` is called to self-cancel? Also (and since `._maybe_cancel_and_set_remote_error()` now takes an `error: BaseException` as input and thus expects error-msg unpacking prior to being called), we now manually set `Context._cancel_msg: dict` just prior to any remote error assignment - so any case where we would have fabbed a "cancel msg" near calling `.cancel()`, just do the manual assign. In this vein some other subtle changes: - obviously don't set `._cancel_msg` in `.cancel()` since it's no longer an input. - generally do walrus-style `error := unpack_error()` before applying and setting remote error-msg state. - always raise any `._remote_error` in `.result()` instead of returning the exception instance and check before AND after the underlying mem chan read. - add notes/todos around `raise self._remote_error from None` masking of (runtime) errors in `._maybe_raise_remote_err()` and use it inside `.result()` since we had the inverse duplicate logic there anyway.. Further, this adds and extends a ton of (internal) interface docs and details comments around the `Context` API including many subtleties pertaining to calling `._maybe_cancel_and_set_remote_error()`.	2023-10-17 16:50:52 -04:00
Tyler Goodlet	e4a6223256	`._exceptions`: typing and error unpacking updates Bump type annotations to 3.10+ style throughout module as well as fill out doc strings a bit. Inside `unpack_error()` pop any `error_dict: dict` and, - return `None` early if not found, - versus pass directly as `**error_dict` to the error constructor instead of a double field read.	2023-10-16 16:23:30 -04:00
Tyler Goodlet	ab2664da70	Runtime level log on debug REPL exits	2023-10-16 15:46:21 -04:00
Tyler Goodlet	ae326cbb9a	Ignore kbis in `open_crash_handler()` by default	2023-10-16 15:45:34 -04:00
Tyler Goodlet	07cec02303	Add comments around diff between `C/context` refs	2023-10-16 15:45:02 -04:00
Tyler Goodlet	2fdb8fc25a	Factor non-yield stream msg processing into helper Since both `MsgStream.receive()` and `.receive_nowait()` need the same raising logic when a non-stream msg arrives (so that maybe an appropriate IPC translated error can be raised) move the `KeyError` handler code into a new `._streaming._raise_from_no_yield_msg()` func and call it from both methods to make the error-interface-raising symmetrical across both methods.	2023-10-16 15:35:16 -04:00
Tyler Goodlet	6d951c526a	Comment all `.pause(shield=True)` attempts again, need to solve cancel scope `.__exit__()` frame hiding issue..	2023-10-10 09:55:11 -04:00
Tyler Goodlet	575a24adf1	Always raise remote (cancelled) error if set Previously we weren't raising a remote error if the local scope was cancelled during a call to `Context.result()` which is problematic if the caller WAS NOT the requester for said remote cancellation; in that case we still want a `ContextCancelled` raised with the `.canceller: str` set to the cancelling actor uid. Further fix a naming bug where the (seemingly older) `._remote_err` was being set to such an error instead of `._remote_error` XD	2023-10-10 09:45:49 -04:00
Tyler Goodlet	919e462f88	Write more comprehensive `Portal.cancel_actor()` doc str	2023-10-08 15:57:18 -04:00
Tyler Goodlet	a09b8560bb	Oof, default reg addrs needs to be in `list[tuple]` form..	2023-10-07 18:52:37 -04:00
Tyler Goodlet	d24a9e158f	Msg-ified `ContextCancelled`s sub-error type should always be just, its type..	2023-10-07 18:51:03 -04:00
Tyler Goodlet	18a1634025	Add shielding support to `.pause()` Implement it like you'd expect using simply a wrapping `trio.CancelScope` which is itself shielded by the input `shield: bool` B) There's seemingly still some issues with the frame selection when the REPL engages and not sure how to resolve it yet but at least this does indeed work for practical purposes. Still needs a test obviously!	2023-10-06 15:49:23 -04:00
Tyler Goodlet	4314a59327	Add post-mortem catch around failed transport addr binds to aid with runtime debugging	2023-10-03 10:54:46 -04:00
Tyler Goodlet	e94f1261b5	Move `maybe_open_crash_handler()` CLI `--pdb`-driven wrapper to debug mod	2023-10-02 18:10:34 -04:00
Tyler Goodlet	86da79a854	Rename to `parse_maddr()` and fill out doc strings	2023-09-29 14:49:18 -04:00
Tyler Goodlet	de89e3a9c4	Add libp2p style "multi-address" parser from `piker` Details are in the module docs; this is a first draft with lotsa room for refinement and extension.	2023-09-29 14:11:31 -04:00
Tyler Goodlet	7bed470f5c	Start `.devx.cli` extensions for pop CLI frameworks Starting of with just a `typer` (and thus transitively `click`) `typer.Typer.callback` hook which allows passthrough of the `--ll <loglevel: str>` and `--pdb <debug_mode: bool>` flags for use when building CLIs that use the runtime Bo Still needs lotsa refinement and obviously better docs but, the doc string for `load_runtime_vars()` shows how to use the underlying `.devx._debug.open_crash_handler()` via a wrapper that can be passed the `--pdb` flag and then enable debug mode throughout the entire actor system.	2023-09-28 15:36:24 -04:00
Tyler Goodlet	fa9a9cfb1d	Kick off `.devx` subpkg for our dev tools B) Where `.devx` is "developer experience", a hopefully broad enough subpkg name for all the slick stuff planned to augment working on the actor runtime 💥 Move the `._debug` module into the new subpkg and adjust rest of core code base to reflect import path change. Also add a new `.devx._debug.open_crash_handler()` manager for wrapping any sync code outside a `trio.run()` which is handy for eventual CLI addons for popular frameworks like `click`/`typer`.	2023-09-28 14:14:50 -04:00
Tyler Goodlet	3d0e95513c	Init-support for "multi homed" transports Since we'd like to eventually allow a diverse set of transport (protocol) methods and stacks, and a multi-peer discovery system for distributed actor-tree applications, this reworks all runtime internals to support multi-homing for any given tree on a logical host. In other words any actor can now bind its transport server (currently only unsecured TCP + `msgspec`) to more then one address available in its (linux) network namespace. Further, registry actors (now dubbed "registars" instead of "arbiters") can also similarly bind to multiple network addresses and provide discovery services to remote actors via multiple addresses which can now be provided at runtime startup. Deats: - adjust `._runtime` internals to use a `list[tuple[str, int]]` (and thus pluralized) socket address sequence where applicable for transport server socket binds, now exposed via `Actor.accept_addrs`: - `Actor.__init__()` now takes a `registry_addrs: list`. - `Actor.is_arbiter` -> `.is_registrar`. - `._arb_addr` -> `._reg_addrs: list[tuple]`. - always reg and de-reg from all registrars in `async_main()`. - only set the global runtime var `'_root_mailbox'` to the loopback address since normally all in-tree processes should have access to it, right? - `._serve_forever()` task now takes `listen_sockaddrs: list[tuple]` - make `open_root_actor()` take a `registry_addrs: list[tuple[str, int]]` and defaults when not passed. - change `ActorNursery.start_..()` methods take `bind_addrs: list` and pass down through the spawning layer(s) via the parent-seed-msg. - generalize all `._discovery()` APIs to accept `registry_addrs`-like inputs and move all relevant subsystems to adopt the "registry" style naming instead of "arbiter": - make `find_actor()` support batched concurrent portal queries over all provided input addresses using `.trionics.gather_contexts()` Bo - syntax: move to using `async with <tuples>` 3.9+ style chained @acms. - a general modernization of the code to a python 3.9+ style. - start deprecation and change to "registry" naming / semantics: - `._discovery.get_arbiter()` -> `.get_registry()`	2023-09-27 16:25:21 -04:00
Tyler Goodlet	ee151b00af	Mk `gather_contexts()` support `@acm`s yielding `None` We were using a `all(<yielded values>)` condition which obviously won't work if the batched managers yield any non-truthy value. So instead see the `unwrapped: dict` with the `id(mngrs)` and only unblock once all values have been filled in to be something that is not that value.	2023-09-27 14:05:22 -04:00
Tyler Goodlet	22c14e235e	Expose `Channel` @ pkg level, drop `_debug.pp()` alias	2023-08-18 10:18:25 -04:00
Tyler Goodlet	1102843087	Teensie tidy up on actor doc string	2023-08-18 10:10:36 -04:00
Tyler Goodlet	e03bec5efc	Move `.to_asyncio` to modern optional value type annots	2023-07-21 15:08:46 -04:00
Tyler Goodlet	bee2c36072	Make `NamespacePath` work on object refs Detect if the input ref is a non-func (like an `object` instance) in which case grab its type name using `type()`. Wrap all the name-getting into a new `_mk_fqpn()` static meth: gets the "fully qualified path name" and returns path and name in tuple; port other methds to use it. Refine and update the docs B)	2023-07-12 13:07:30 -04:00
Tyler Goodlet	b36b3d522f	Map `breakpoint()` built-in to new `.pause_from_sync()` ep	2023-07-07 15:35:52 -04:00
Tyler Goodlet	4ace8f6037	Fix frame-selection display on first REPL entry For whatever reason pdb(p), and in general, will show the frame of the next python instruction/LOC on initial entry (at least using `.set_trace()`), as such remove the `try/finally` block in the sync code entrypoint `.pause_from_sync()`, and also since doesn't seem like we really need it anyway. Further, and to this end: - enable hidden frames support in our default config. - fix/drop/mask all the frame ref-ing/mangling we had prior since it's no longer needed as well as manual `Lock` releasing which seems to work already by having the `greenback` spawned task do it's normal thing? - move to no `Union` type annots. - hide all frames that can add "this is the runtime confusion" to traces.	2023-07-07 14:51:44 -04:00
Tyler Goodlet	98a7326c85	._runtime: log level tweaks, use crit for stale debug lock detection	2023-07-07 14:49:23 -04:00
Tyler Goodlet	46972df041	.log: more correct handling for `get_logger(__name__)` usage	2023-07-07 14:48:37 -04:00
Tyler Goodlet	ac695a05bf	Updates from latest `piker.data._sharedmem` changes	2023-06-22 17:16:17 -04:00
Tyler Goodlet	fc56971a2d	First proto: use `greenback` for sync func breakpointing This works now for supporting a new `tractor.pause_from_sync()` `tractor`-aware-replacement for `Pdb.set_trace()` from sync functions which are also scheduled from our runtime. Uses `greenback` to do all the magic of scheduling the bg `tractor._debug._pause()` task and engaging the normal TTY locking machinery triggered by `await tractor.breakpoint()` Further this starts some public API renaming, making a switch to `tractor.pause()` from `.breakpoint()` which IMO much better expresses the semantics of the runtime intervention required to suffice multi-process "breakpointing"; it also is an alternate name for the same in computer science more generally: https://en.wikipedia.org/wiki/Breakpoint It also avoids using the same name as the `breakpoint()` built-in which is important since there is alot more going on when you call our equivalent API. Deats of that: - add deprecation warning for `tractor.breakpoint()` - add `tractor.pause()` and a shorthand, easier-to-type, alias `.pp()` for "pause-point" B) - add `pause_from_sync()` as the new `breakpoint()`-from-sync-function hack which does all the `greenback` stuff for the user. Still TODO: - figure out where in the runtime and when to call `greenback.ensure_portal()`. - fix the frame selection issue where `trio._core._ki._ki_protection_decorator:wrapper` seems to be always shown on REPL start as the selected frame..	2023-06-21 16:08:18 -04:00
Tyler Goodlet	4f442efbd7	Pass `str` dtype for `use_str` case	2023-06-15 12:20:20 -04:00
Tyler Goodlet	f9a84f0732	Allocate size-specced "empty" sequence from default values by type	2023-06-15 12:20:20 -04:00
Tyler Goodlet	e0bf964ff0	Mod define `_USE_POSIX`, add a of of todos	2023-06-15 12:20:20 -04:00
Tyler Goodlet	b52ff270c5	Add `ShmList` slice support in `.__getitem__()`	2023-06-15 12:20:20 -04:00
Tyler Goodlet	1713ecd9f8	Rename token type to `NDToken` in the style of `nptyping`	2023-06-15 12:20:20 -04:00
Tyler Goodlet	edb82fdd78	Don't require runtime (for now), type annot fixing	2023-06-15 12:20:20 -04:00
Tyler Goodlet	71477290fc	Add `ShmList` wrapping the stdlib's `ShareableList` First attempt at getting `multiprocessing.shared_memory.ShareableList` working; we wrap the stdlib type with a readonly attr and a `.key` for cross-actor lookup. Also, rename all `numpy` specific routines to have a `ndarray` suffix in the func names.	2023-06-15 12:20:20 -04:00
Tyler Goodlet	9716d86825	Initial module import from `piker.data._sharemem` More or less a verbatim copy-paste minus some edgy variable naming and internal `piker` module imports. There is a bunch of OHLC related defaults that need to be dropped and we need to adjust to an optional dependence on `numpy` by supporting shared lists as per the mp docs.	2023-06-15 12:20:20 -04:00
Tyler Goodlet	7507e269ec	Just import `mp` top level in `._spawn`	2023-06-14 15:32:15 -04:00
Tyler Goodlet	17ae449160	Tidy up `typing` imports in broadcaster mod	2023-06-14 15:31:52 -04:00
Tyler Goodlet	6495688730	Drop `Optional` style from runtime mod	2023-05-25 16:00:05 -04:00
Tyler Goodlet	a0276f41c2	Remote cancellation runtime-internal vars renames - `Context._cancel_called_remote` -> `._cancelled_remote` since "called" implies the cancellation was "requested" when it could be due to another error and the actor uid is the value - only set once the far end task scope is terminated due to either error or cancel, which has nothing to do with what caused the cancellation. - `Actor._cancel_called_remote` -> `._cancel_called_by_remote` which emphasizes that this variable is only set IFF some remote actor requested that this actor's runtime be cancelled via `Actor.cancel()`.	2023-05-19 14:31:55 -04:00
Tyler Goodlet	ead9e418de	Expose `allow_overruns` to `Portal.open_context()` Turns out you can get a case where you might be opening multiple ctx-streams concurrently and during the context opening phase you block for all contexts to open, but then when you eventually start opening streams some slow to start context has caused the others become in an overrun state.. so we need to let the caller control whether that's an error ;) This also needs a test!	2023-05-15 10:00:45 -04:00
Tyler Goodlet	60791ed546	Oof, fix remaining `Actor.cancel()` in `Actor._from_parent()`	2023-05-15 10:00:45 -04:00
Tyler Goodlet	7293b82bcc	Tweak doc string	2023-05-15 10:00:45 -04:00
Tyler Goodlet	20d75ff934	Move move context code into new `._context` mod	2023-05-15 10:00:45 -04:00
Tyler Goodlet	04e4397a8f	Ignore drainer-task nursery RTE during context exit	2023-05-15 10:00:45 -04:00
Tyler Goodlet	968f13f9ef	Set `Context._scope_nursery` on callee side too Because obviously we probably want to support `allow_overruns` on the remote callee side as well XD Only found the bugs fixed in this patch this thanks to writing a much more exhaustive test set for overrun cases B)	2023-05-15 10:00:45 -04:00
Tyler Goodlet	f9911c22a4	Seriously cover all overrun cases This actually caught further runtime bugs so it's gud i tried.. Add overrun-ignore enabled / disabled cases and error catching for all of them. More or less this should cover every possible outcome when it comes to setting `allow_overruns: bool` i hope XD	2023-05-15 10:00:45 -04:00
Tyler Goodlet	6db656fecf	Flip allocate log msgs to debug	2023-05-15 10:00:45 -04:00
Tyler Goodlet	c72026091e	Remote `Context` cancellation semantics rework B) This adds remote cancellation semantics to our `tractor.Context` machinery to more closely match that of `trio.CancelScope` but with operational differences to handle the nature of parallel tasks interoperating across multiple memory boundaries: - if an actor task cancels some context it has opened via `Context.cancel()`, the remote (scope linked) task will be cancelled using the normal `CancelScope` semantics of `trio` meaning the remote cancel scope surrounding the far side task is cancelled and `trio.Cancelled`s are expected to be raised in that scope as per normal `trio` operation, and in the case where no error is raised in that remote scope, a `ContextCancelled` error is raised inside the runtime machinery and relayed back to the opener/caller side of the context. - if any actor task cancels a full remote actor runtime using `Portal.cancel_actor()` the same semantics as above apply except every other remote actor task which also has an open context with the actor which was cancelled will also be sent a `ContextCancelled` but with the `.canceller` field set to the uid of the original cancel requesting actor. This changeset also includes a more "proper" solution to the issue of "allowing overruns" during streaming without attempting to implement any form of IPC streaming backpressure. Implementing task-granularity backpressure cross-process turns out to be more or less impossible without augmenting out streaming protocol (likely at the cost of performance). Further allowing overruns requires special care since any blocking of the runtime RPC msg loop task effectively can block control msgs such as cancels and stream terminations. The implementation details per abstraction layer are as follows. ._streaming.Context: - add a new contructor factor func `mk_context()` which provides a strictly private init-er whilst allowing us to not have to define an `.__init__()` on the type def. - add public `.cancel_called` and `.cancel_called_remote` properties. - general rename of what was the internal `._backpressure` var to `._allow_overruns: bool`. - move the old contents of `Actor._push_result()` into a new `._deliver_msg()` allowing for better encapsulation of per-ctx msg handling. - always check for received 'error' msgs and process them with the new `_maybe_cancel_and_set_remote_error()` before any msg delivery to the local task, thus guaranteeing error and cancellation handling despite any overflow handling. - add a new `._drain_overflows()` task-method for use with new `._allow_overruns: bool = True` mode. - add back a `._scope_nursery: trio.Nursery` (allocated in `Portal.open_context()`) who's sole purpose is to spawn a single task which runs the above method; anything else is an error. - augment `._deliver_msg()` to start a task and run the above method when operating in no overrun mode; the task queues overflow msgs and attempts to send them to the underlying mem chan using a blocking `.send()` call. - on context exit, any existing "drainer task" will be cancelled and remaining overflow queued msgs are discarded with a warning. - rename `._error` -> `_remote_error` and set it in a new method `_maybe_cancel_and_set_remote_error()` which is called before processing - adjust `.result()` to always call `._maybe_raise_remote_err()` at its start such that whenever a `ContextCancelled` arrives we do logic for whether or not to immediately raise that error or ignore it due to the current actor being the one who requested the cancel, by checking the error's `.canceller` field. - set the default value of `._result` to be `id(Context()` thus avoiding conflict with any `.result()` actually being `False`.. ._runtime.Actor: - augment `.cancel()` and `._cancel_task()` and `.cancel_rpc_tasks()` to take a `requesting_uid: tuple` indicating the source actor of every cancellation request. - pass through the new `Context._allow_overruns` through `.get_context()` - call the new `Context._deliver_msg()` from `._push_result()` (since the factoring out that method's contents). ._runtime._invoke: - `TastStatus.started()` back a `Context` (unless an error is raised) instead of the cancel scope to make it easy to set/get state on that context for the purposes of cancellation and remote error relay. - always raise any remote error via `Context._maybe_raise_remote_err()` before doing any `ContextCancelled` logic. - assign any `Context._cancel_called_remote` set by the `requesting_uid` cancel methods (mentioned above) to the `ContextCancelled.canceller`. ._runtime.process_messages: - always pass a `requesting_uid: tuple` to `Actor.cancel()` and `._cancel_task` to that any corresponding `ContextCancelled.canceller` can be set inside `._invoke()`.	2023-05-15 10:00:45 -04:00
Tyler Goodlet	90e41016b9	Only tuplize `.canceller` if non-`None`	2023-05-15 10:00:45 -04:00
Tyler Goodlet	f54c415060	Move `NoRuntime` import inside `current_actor()` to avoid cycle	2023-05-15 10:00:45 -04:00
Tyler Goodlet	67f82c6ebd	Add new remote error introspection attrs To handle both remote cancellation this adds `ContextCanceled.canceller: tuple` the uid of the cancel requesting actor and is expected to be set by the runtime when servicing any remote cancel request. This makes it possible for `ContextCancelled` receivers to know whether "their actor runtime" is the source of the cancellation. Also add an explicit `RemoteActor.src_actor_uid` which better formalizes the notion of "which remote actor" the error originated from. Both of these new attrs are expected to be packed in the `.msgdata` when the errors are loaded locally.	2023-05-15 10:00:45 -04:00
Tyler Goodlet	220b244508	Log waiter task cancelling msg as cancel-level	2023-05-15 10:00:45 -04:00
Tyler Goodlet	831790377b	Assign `RemoteActorError` boxed error type for context cancelleds	2023-05-15 10:00:45 -04:00
Tyler Goodlet	e80e0a551f	Change a bunch of log levels to cancel, including any `ContextCancelled` handling	2023-05-15 10:00:45 -04:00
Tyler Goodlet	b3f9251eda	Add some log-level method doc-strings	2023-05-15 10:00:45 -04:00
Tyler Goodlet	903537ce04	Tweak context doc str	2023-05-15 10:00:45 -04:00
Tyler Goodlet	d75343106b	More single doc-strs in discovery mod	2023-05-15 10:00:45 -04:00
Tyler Goodlet	cfb2bc0fee	Enable `Context` backpressure by default; avoid startup race-crashes?	2023-05-15 10:00:45 -04:00
Tyler Goodlet	1c3893a383	Drop commented `pdbpp` import logic	2023-05-15 09:01:55 -04:00
Tyler Goodlet	79622bbeea	Restore `breakpoint()` hook after runtime exits Previously we were leaking our (pdb++) override into the Python runtime which would always result in a runtime error whenever `breakpoint()` is called outside our runtime; after exit of the root actor . This explicitly restores any previous hook override (detected during startup) or deletes the hook and restores the environment if none existed prior. Also adds a new WIP debugging example script to ensure breakpointing works as normal after runtime close; this will be added to the test suite.	2023-05-15 00:47:29 -04:00
Tyler Goodlet	95535b2226	Some more 3.10+ optional type sigs	2023-05-15 00:47:29 -04:00
Tyler Goodlet	ae4ff5dc8d	pdbp: adding typing to config settings vars	2023-05-14 22:38:46 -04:00
Tyler Goodlet	705538398f	`pdbp`: turn off line truncating by default, fixes terminal resizing stuff	2023-05-14 22:38:16 -04:00
Tyler Goodlet	86aef5238d	Hide actor nursery exit frame	2023-05-14 21:24:26 -04:00
Tyler Goodlet	cc82447db6	First try: switch debug machinery over to `pdbp` B)	2023-05-14 21:24:26 -04:00
Tyler Goodlet	23cffbd940	Use multiline import for debug mod	2023-05-14 21:24:26 -04:00
Tyler Goodlet	f667d16d66	Copy the now deprecated `trio.Process.aclose()` Move it into our `_spawn.do_hard_kill()` since we do indeed rely on the particular process killing sequence on "soft kill" failure cases.	2023-05-14 19:31:50 -04:00
Tyler Goodlet	24a062341e	Just call `trio.Process.aclose()` directly for now?	2023-04-02 14:34:41 -04:00
Tyler Goodlet	8637778739	Expose `raise_on_lag: bool` flag through factory	2023-01-30 12:18:23 -05:00
Tyler Goodlet	47166e45f0	Be explicit with passthrough kwargs (there's so few)	2023-01-29 17:31:21 -05:00
Tyler Goodlet	4ce2dcd12b	Switch back to raising `Lagged` by default Makes the broadcast test suite not hang xD, and is our expected default behaviour. Also removes a ton of commented legacy cruft from before the refactor to remove the `.receive()` recursion and fixes some typing. Oh right, and in the case where there's only one subscriber left we warn log about it since in theory we could actually entirely unwind the bcaster back to the original underlying, though not sure if that's sane or works for some use cases (like wanting to have some other subscriber get added dynamically later).	2023-01-29 15:03:34 -05:00
Tyler Goodlet	80f983818f	Ignore monkey patched `.send()` type annot	2023-01-29 15:03:34 -05:00
Tyler Goodlet	6ba29f8d56	Recurse and get the last value when in warn mode	2023-01-29 15:03:34 -05:00
Tyler Goodlet	2707a0e971	Add `._raise_on_lag` flag to disable `Lag` raising	2023-01-29 15:03:34 -05:00
Tyler Goodlet	9f9907271b	Merge `ReceiveMsgStream` and `MsgStream` Since one-way streaming can be accomplished by just not sending on one side (and/or thus wrapping such usage in a more restrictive API), we just drop the recv-only parent type. The only method different was `MsgStream.send()`, now merged in. Further in usage of `.subscribe()` we monkey patch the underlying stream's `.send()` onto the delivered broadcast receiver so that subscriber tasks can two-way stream as though using the stream directly. This allows us to more definitively drop `tractor.open_stream_from()` in the longer run if we so choose as well; note currently this will potentially create an issue if a caller tries to `.send()` on such a one way stream.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	c2367c1c5e	Better `trio`-ize `BroadcastReceiver` internals Driven by a bug found in `piker` where we'd get an inf recursion error due to `BroadcastReceiver.receive()` being called when consumer tasks are awoken but no value is ready to `.nowait_receive()`. This new rework takes an approach closer to the interface and internals of `trio.MemoryReceiveChannel` particularly in terms of, - implementing a `BroadcastReceiver.receive_nowait()` and using it within the async `.receive()`. - failing over to an internal `._receive_from_underlying()` when the `_nowait()` call raises `trio.WouldBlock`. - adding `BroadcastState.statistics()` for debugging and testing dropping recursion from `.receive()`.	2023-01-29 15:03:34 -05:00
Tyler Goodlet	13c9eadc8f	Move result log msg up and drop else block	2023-01-29 14:55:02 -05:00
Tyler Goodlet	aa4871b13d	Call `MsgStream.aclose()` in `Context.open_stream.__aexit__()` We weren't doing this originally I think just because of the path dependent nature of the way the code was developed (originally being mega pedantic about one-way vs. bidirectional streams) but, it doesn't seem like there's any issue just calling the stream's `.aclose()`; also have the benefit of just being less code and logic checks B)	2023-01-29 14:55:02 -05:00
Tyler Goodlet	556f4626db	Tweak warning msg for still-alive-after-cancelled actor	2023-01-29 14:55:02 -05:00
Tyler Goodlet	df01294bb2	Show more functiony syntax in ctx-cancelled log msgs	2023-01-29 14:55:02 -05:00
Tyler Goodlet	ddf3d0d1b3	Show tracebacks for un-shipped/propagated errors	2023-01-29 14:55:02 -05:00
Tyler Goodlet	97d5f7233b	Fix uid2nursery lookup table type annot	2023-01-29 14:55:02 -05:00
Tyler Goodlet	d27c081a15	Ensure arbiter sockaddr type before usage	2023-01-29 14:55:02 -05:00
Tyler Goodlet	a4874a3227	Always set the `parent_exit: trio.Event` on exit	2023-01-29 14:55:02 -05:00
Tyler Goodlet	de04bbb2bb	Don't raise on a broken IPC-context when sending stop msg	2023-01-29 14:55:02 -05:00
Tyler Goodlet	4f977189c0	Handle broken mem chan on `Actor._push_result()` When backpressure is used and a feeder mem chan breaks during msg delivery (usually because the IPC allocating task already terminated) instead of raising we simply warn as we do for the non-backpressure case. Also, add a proper `Actor.is_arbiter` test inside `._invoke()` to avoid doing an arbiter-registry lookup if the current actor is the registrar.	2023-01-29 14:55:02 -05:00
Tyler Goodlet	121a8cc891	Drop `Optional` usage from root mod	2023-01-26 16:00:08 -05:00
Tyler Goodlet	c54b8ca4ba	Begin deprecation of `arbiter_addr` -> `registry_addr`	2023-01-26 16:00:08 -05:00
Tyler Goodlet	5b8a87d0f6	Slightly better `xonsh` check hack, fix typing	2023-01-26 15:48:15 -05:00
Tyler Goodlet	2e278ceb74	Add a super hacky check for `xonsh`, smh..	2023-01-26 15:26:43 -05:00
Tyler Goodlet	dba8118553	Always attempt prompt redraw on ctl-c in REPL The stdlib has all sorts of muckery with ignoring SIGINT in the `Pdb._cmdloop()` but here we just override all that since we don't trust their decisions about cancellation handling whatsoever. Adds a `Lock.repl: MultiActorPdb` attr which is set by any task which acquires root TTY lock indicating (via actor global state) that the current actor is using the debugger REPL and can be expected to re-draw the prompt on SIGINT. Further we mask out log messages from any actor who also has the `shield_sigint_handler()` enabled to avoid logging noise when debugging.	2023-01-26 12:44:13 -05:00
Tyler Goodlet	fca2e7c10e	Simplify closed abruptly log msg	2023-01-26 12:44:13 -05:00
Tyler Goodlet	5ed62c5c54	Add note about intermediary-actor in debug issue	2023-01-26 12:44:13 -05:00
Tyler Goodlet	6c8cacc9d1	Adjust all default is `None` annots (per new `mypy`)	2022-12-12 13:18:22 -05:00
Tyler Goodlet	38326e8c15	Avoid error on context double pops	2022-12-11 23:46:33 -05:00
Tyler Goodlet	b5192cca8e	Always greedily `list`-cast`mngrs` input sequence	2022-12-11 23:20:58 -05:00
Tyler Goodlet	c606be8c64	Passthrough runtime kwargs from `open_actor_cluster()`	2022-12-11 19:56:08 -05:00
Tyler Goodlet	f2641c8964	Avoid "task never called `.started()`" runtime erros when cancelling	2022-10-14 19:42:23 -04:00
Tyler Goodlet	f39414ce12	Drop error-repacking for `.run_in_actor()`s block If we pack the nursery parent task's error into the `errors` table directly in the handler, we don't need to specially handle packing that same error into any exception group raised while handling sub-actor cancellation; drops some ugly indentation ;)	2022-10-14 19:42:23 -04:00
Tyler Goodlet	e298b70edf	Drop added `.pdp()` level msgs used duringn dev	2022-10-14 19:42:23 -04:00
Tyler Goodlet	38f9d35dee	Fix errors table type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	88448f7281	Fix handler type annot	2022-10-14 19:42:23 -04:00
Tyler Goodlet	0956d5f461	Restore the `trio` SIGINT handler, cancel root lock tasks on no-peers Pretty sure this is the final touch to alleviate all our debug lock headaches! Instead of trying to revert to the "last" handler (as `pdb` does internally in the stdlib) we always just revert to the handler `trio` registers during startup. Further this seems to allow cancelling the root-side locking task if it's detected as stale IFF we only do this when the root actor is in a "no more IPC peers" state. Deatz: - (always) set `._debug.Lock._trio_handler` as the `trio` version, not some last used handler to make sure we're getting the ctrl-c handling we want when not in debug mode. - assign the trio handler in `open_root_actor()` `._runtime._async_main()` to be sure it's applied in subactors as well as the root. - only do debug lock blocking and root-side-locking-task cancels when a "no peers" condition is detected in the root actor: i.e. no IPC channels are detected by the root meaning it's impossible any actor has a sane lock-state ongoing for debug mode.	2022-10-14 18:18:01 -04:00
Tyler Goodlet	33f2234baf	Hide some stack layers the user doesn't really need to see	2022-10-14 18:18:01 -04:00
Tyler Goodlet	7521bded3d	Pack error from the parent task into the actor nursery	2022-10-14 18:16:51 -04:00
Tyler Goodlet	50fe098e06	First pass, swap `MultiError` for `BaseExceptionGroup`	2022-10-14 18:16:51 -04:00
Tyler Goodlet	98056f6ed7	Move logging context map into `log.py` module	2022-10-12 12:46:20 -04:00
Tyler Goodlet	b81b6be98a	Drop extra log msgs, some old commented code	2022-10-12 12:35:35 -04:00
Tyler Goodlet	fb721f36ef	Support debug-lock blocking, use on no-more IPC This is a lingering debugger locking race case we needed to handle: - child crashes acquires TTY lock in root and attaches to `pdb` - child IPC goes down such that all channels to the root are broken / non-functional. - root is stuck thinking the child is still in debug even though it can't be contacted and the child actor machinery hasn't been cancelled by its parent. - root get's stuck in deadlock with child since it won't send a cancel request until the child is finished debugging, but the child can't unlock the debugger bc IPC is down. To avoid this scenario add debug lock blocking list via `._debug.Lock._blocked: set[tuple]` which holds actor uids for any actor that is detected by the root as having no transport channel connections with said root (of which at least one should exist if this sub-actor at some point acquired the debug lock). The root consequently checks this list for any actor that tries to (re)acquire the lock and blocks with a `ContextCancelled`. When a debug condition is tested in `._runtime._invoke` the context's `._enter_debugger_on_cancel` which is set to `False` if the actor is on the block list in which case the post-mortem entry is skipped. Further this adds a root-locking-task side cancel scope to `Lock._root_local_task_cs_in_debug` which can be cancelled by the root runtime when a stale lock is detected after all IPC channels for the actor have been torn down. NOTE: right now we're NOT doing this since it seems to cause test failures likely due because it may cause pre-mature cancellation and maybe needs a bit more experimenting?	2022-10-11 20:00:05 -04:00
Tyler Goodlet	734d8dd663	Move `trio` scope outside first inter-task-chan receive	2022-10-11 20:00:05 -04:00
Tyler Goodlet	1c480e6c92	Add `Context` cancel message and debug toggle flag In the case of a callee-side context cancelling itself it can be handy to let the caller-side task know (even if through logging) that the cancel was due to some known reason. Make `.cancel()` accept such a message on the callee side and have it included in the `._runtime._invoke()` raised `ContextCancelled` emission. Also add a `Context._trigger_debugger_on_cancel: bool` flag which can be set to `False` to avoid the debugger post-mortem crash mode from engaging on cross-context tasks which cancel themselves for a known reason (as is needed for blocked tasks in the debug TTY-lock machinery).	2022-10-11 20:00:05 -04:00
Tyler Goodlet	44b59f3338	Go back to a `global` single-ton nursery per actor Turns out the lifetime mgmt of separate nurseries per delegate manager is tricky; a new nursery can't be naively allocated on cache-misses since it may get closed by some early terminating task instead of by the "last using" consumer task. In theory if we allocate using the same logic as that used for the last-task-triggers-exit then this should work? For now just go back to a single global nursery per `_Cache` which still avoids use of the internal actor service nursery.	2022-10-09 21:27:23 -04:00
Tyler Goodlet	7a719ac2a7	Use one nursery per unique manager (signature) Instead of sticking all `trionics.maybe_open_context()` tasks inside the actor's (root) service nursery, open a unique one per manager function instance (id). Further, accept a callable for the `key` such that a user can have more flexible control on the caching logic and move the `maybe_open_nursery()` helper out of the portal mod and into this trionics "managers" module.	2022-10-09 21:27:23 -04:00
Tyler Goodlet	d24fae8381	'Rename mp spawn methods to have a `'mp_'` prefix'	2022-10-09 17:54:55 -04:00
Tyler Goodlet	5ab98513b7	Move `@tractor_test` into `conftest.py`	2022-10-09 17:14:20 -04:00
Tyler Goodlet	90f4912580	Organize process spawning into lookup table Instead of the logic branching create a table `._spawn._methods` which is used to lookup the desired backend framework (in this case still only one of `multiprocessing` or `trio`) and make the top level `.new_proc()` do the lookup and any common logic. Use a `typing.Literal` to define the lookup table's key set. Repair and ignore a bunch of type-annot related stuff todo with `mypy` updates and backend-specific process typing.	2022-10-09 16:51:21 -04:00
Tyler Goodlet	15047341bd	Ignore forserver override attrs with `mypy`	2022-10-09 16:14:11 -04:00
Tyler Goodlet	e609183242	Expose lifetime stack as class attr, add base test suite	2022-09-15 23:50:15 -04:00
Tyler Goodlet	10eeda2d2b	Use built-ins for all data-structure-type annotations	2022-09-15 23:41:28 -04:00
Tyler Goodlet	ad19bf2cf1	Remove `tractor.run()` once and for all It's been deprecated for a while now and all docs and tests have been changed. Closes #183	2022-09-15 23:41:28 -04:00
Tyler Goodlet	9aef03772a	Expose `Actor` at pkg level, adjust debug type annots	2022-09-15 23:41:28 -04:00
Tyler Goodlet	7548dba8f2	Change to new doc string style	2022-09-15 23:41:28 -04:00
Tyler Goodlet	208d56af2c	Make `async_main()` a module func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	a3a5bc267e	Make `process_messages()` a mod func	2022-09-15 23:41:28 -04:00
Tyler Goodlet	d4084b2032	Rename our core module to `_runtime`	2022-09-15 23:41:28 -04:00
Tyler Goodlet	bafd10a260	Make `maybe_open_context()` re-entrant safe, use per factory locks	2022-09-15 19:02:02 -04:00
Tyler Goodlet	5ad540c417	Add debug complete event `None`-guard for when already reset	2022-09-15 19:02:02 -04:00
Tyler Goodlet	8f1fe2376a	Simplify all hooks to a common `Lock.release()`	2022-08-02 18:14:05 -04:00
Tyler Goodlet	650313dfef	Drop legacy handler blocks factored into `_acquire_debug_lock()`	2022-08-02 12:50:27 -04:00
Tyler Goodlet	e4006da6f4	Drop `pdbpp` bug notes, add follow up issue #320 note	2022-08-02 12:48:40 -04:00
Tyler Goodlet	7f6169a050	Drop legacy commented/todo remote debug helper block	2022-08-02 12:43:14 -04:00
Tyler Goodlet	02c3b9a672	Put `pygments` back to default	2022-08-02 12:17:34 -04:00
Tyler Goodlet	c5c7a9027c	Line len lint and drop rpc log msg level again	2022-08-02 12:17:34 -04:00
Tyler Goodlet	937ed99e39	Factor sigint overriding into lock methods	2022-08-02 12:17:28 -04:00
Tyler Goodlet	91f034a136	Move all module vars into a `Lock` type	2022-08-02 12:17:28 -04:00
Tyler Goodlet	6f01c78122	Disable `pygments` highlighting on ctlc tests	2022-08-02 12:17:28 -04:00
Tyler Goodlet	c0cd99e374	Timeout on arbiter ping, avoid TCP SYN hangs in CI?	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b01daa5319	Factor lock-state release logic into helper The common logic to both remove our custom SIGINT handler as well as signal the actor global event that pdb is complete. Call this whenever we exit a post mortem call and thus any time some rpc task get's debugged inside `._actor._invoke()`. Further, we have to manually print the REPL prompt on 3.9 for some wack reason, so stick a version guard in the sigint handler for that..	2022-08-02 12:17:28 -04:00
Tyler Goodlet	bd362a05f0	Run release hook around `next` repl commands as well	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b21f2e16ad	Always consider the debugger when exiting contexts When in an uncertain teardown state and in debug mode a context can be popped from actor runtime before a child finished debugging (the case when the parent is tearing down but the child hasn't closed/completed its tty lock IPC exit phase) and the child sends the "stop" message to unlock the debugger but it's ignored bc the parent has already dropped the ctx. Instead we call `._debug.maybe_wait_for_deugger()` before these context removals to avoid the root getting stuck thinking the lock was never released. Further, add special `Actor._cancel_task()` handling code inside `_invoke()` which continues to execute the method despite the IPC channel to the caller being broken and thus avoiding potential hangs due to a target (child) actor task remaining alive.	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ba7b355d9c	Add note about default behaviour of `fancycompleter`	2022-08-02 12:17:28 -04:00
Tyler Goodlet	ef8dc0204c	Just drop all longlisting for now and leave comments	2022-08-02 12:17:28 -04:00
Tyler Goodlet	a101971027	Go back to original longlist code	2022-08-02 12:17:28 -04:00
Tyler Goodlet	835836123b	Just don't call longlist on 3.10+ for now	2022-08-02 12:17:28 -04:00
Tyler Goodlet	b9eb601265	General typing fixes for `mypy`	2022-08-02 12:17:27 -04:00
Tyler Goodlet	4dcc21234e	Only call `.poll()` if a method on the spawn backend	2022-08-02 12:17:27 -04:00
Tyler Goodlet	8b9f342eef	Port to new `.lowlevel.open_process()` API	2022-08-02 12:17:27 -04:00
Tyler Goodlet	a90ca4b384	Call longlist normally when on py < 3.10	2022-08-02 12:17:06 -04:00
Tyler Goodlet	d0dcd55f47	Only report disconnected actors if proc is still alive?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	519f4c300b	I dunno, seems like `breakpoint()` needs this?	2022-08-02 12:17:06 -04:00
Tyler Goodlet	ff3f5959e9	Always enable debug level logging if mode enabled	2022-08-02 12:16:58 -04:00
Tyler Goodlet	abb00531d3	Add help msg for non `__main__` modules as well	2022-08-02 12:16:58 -04:00
Tyler Goodlet	18c525d2f1	Hack around double long list print issue.. See https://github.com/pdbpp/pdbpp/issues/496	2022-08-02 12:16:58 -04:00
Tyler Goodlet	e2453fd3da	Add spaces before values in log msg	2022-08-02 12:16:58 -04:00
Tyler Goodlet	b29def8b5d	Add runtime level msg around channel draining	2022-08-02 12:16:58 -04:00
Tyler Goodlet	f07e9dbb2f	Always undo SIGINT overrides, cancel detached children Ensure that even when `pdb` resumption methods are called during a crash where `trio`'s runtime has already terminated (eg. `Event.set()` will raise) we always revert our sigint handler to the original. Further inside the handler if we hit a case where a child is in debug and (thinks it) has the global pdb lock, if it has no IPC connection to a parent, simply presume tty sync-coordination is now lost and cancel the child immediately.	2022-08-02 12:16:49 -04:00
Tyler Goodlet	c7035be2fc	Tolerate double `.remove()`s of stream on portal teardowns	2022-07-27 11:40:02 -04:00
Tyler Goodlet	deaca7d6cc	Always propagate SIGINT when no locking peer found A hopefully significant fix here is to always avoid suppressing a SIGINT when the root actor can not detect an active IPC connections (via a connected channel) to the supposed debug lock holding actor. In that case it is most likely that the actor has either terminated or has lost its connection for debugger control and there is no way the root can verify the lock is in use; thus we choose to allow KBI cancellation. Drop the (by comment) `try`-`finally` block in `_hijoack_stdin_for_child()` around the `_acquire_debug_lock()` call since all that logic should now be handled internal to that locking manager. Try to catch a weird error around the `.do_longlist()` method call that seems to sometimes break on py3.10 and latest `pdbpp`.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	d47d0e7c37	Always call pdb hook even if tty locking fails	2022-07-27 11:40:02 -04:00
Tyler Goodlet	0062c96a3c	Log cancels with appropriate level	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4be13b7387	Just warn on IPC breaks	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7bb5addd4c	Only warn on `trio.BrokenResourceError`s from `_invoke()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	89b44f8163	Pre-declare disconnected flag	2022-07-27 11:40:02 -04:00
Tyler Goodlet	2819b6a5b2	Avoid attr error XD	2022-07-27 11:40:02 -04:00
Tyler Goodlet	f2671ed026	Type annot updates	2022-07-27 11:40:02 -04:00
Tyler Goodlet	41924c86a6	Drop uneeded backframe traceback hide annotation	2022-07-27 11:40:02 -04:00
Tyler Goodlet	206c7c0720	Make `Actor._process_messages()` report disconnects The method now returns a `bool` which flags whether the transport died to the caller and allows for reporting a disconnect in the channel-transport handler task. This is something a user will normally want to know about on the caller side especially after seeing a traceback from the peer (if in tree) on console.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	bf0ac3116c	Only cancel/get-result from a ctx if transport is up There's no point in sending a cancel message to the remote linked task and especially no reason to block waiting on a result from that task if the transport layer is detected to be disconnected. We expect that the transport shouldn't go down at the layer of the message loop (reconnection logic should be handled in the transport layer itself) so if we detect the channel is not connected we don't bother requesting cancels nor waiting on a final result message. Why? - if the connection goes down in error the caller side won't have a way to know "how long" it should block to wait for a cancel ack or result and causes a potential hang that may require an additional ctrl-c from the user especially if using the debugger or if the traceback is not seen on console. - obviously there's no point in waiting for messages when there's no transport to deliver them XD Further, add some more detailed cancel logging detailing the task and actor ids.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	74b819a857	Typing fixes, simplify `_set_trace()`	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8892204c84	Add notes around py3.10 stdlib bug from `pdb++` There's a bug that's triggered in the stdlib without latest `pdb++` installed; add a note for that. Further inside `wait_for_parent_stdin_hijack()` don't `.started()` until the interactor stream has been opened to avoid races when debugging this `._debug.py` module (at the least) since we usually don't want the spawning (parent) task to resume until we know for sure the tty lock has been acquired. Also, drop the random checkpoint we had inside `_breakpoint()`, not sure it was actually adding anything useful since we're (mostly) carefully shielded throughout this func.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	8f4bbf1cbf	Add and use a pdb instance factory	2022-07-27 11:40:02 -04:00
Tyler Goodlet	aea8f63bae	Drop all the `@cm.__exit__()` override attempts.. None of it worked (you still will see `.__exit__()` frames on debugger entry - you'd think this would have been solved by now but, shrug) so instead wrap the debugger entry-point in a `try:` and put the SIGINT handler restoration inside `MultiActorPdb` teardown hooks. This seems to restore the UX as it was prior but with also giving the desired SIGINT override handler behaviour.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	7964a9f6f8	Try overriding `_GeneratorContextManager.__exit__()`; didn't work.. Using either of `@pdb.hideframe` or `__tracebackhide__` on stdlib methods doesn't seem to work either.. This all seems to have something to do with async generator usage I think ?	2022-07-27 11:40:02 -04:00
Tyler Goodlet	e5195264a1	Handle a context cancel? Might be a noop	2022-07-27 11:40:02 -04:00
Tyler Goodlet	345573e602	Make `mypy` happy	2022-07-27 11:40:02 -04:00
Tyler Goodlet	4e60c17375	Refine the handler for child vs. root cases This gets very close to avoiding any possible hangs to do with tty locking and SIGINT handling minus a special case that will be detailed below. Summary of implementation changes: - convert `_mk_pdb()` -> `with _open_pdb() as pdb:` which implicitly handles the `bdb.BdbQuit` case such that debugger teardown hooks are always called. - rename the handler to `shield_sigint()` and handle a variety of new cases: * the root is in debug but hasn't been cancelled -> call `Actor.cancel_soon()` * the root is in debug but has been called (`Actor.cancel_soon()` already called) -> raise KBI * a child is in debug and has a task locking the debugger -> ignore SIGINT in child and the root actor. - if the debugger instance is provided to the handler at acquire time, on SIGINT handling completion re-print the last pdb++ REPL output so that the user realizes they are still actively in debug. - ignore the unlock case where a race condition of "no task" holding the lock causes the `RuntimeError` normally associated with the "wrong task" doing so (not sure if this is a `trio` bug?). - change debug logs to runtime level. Unhandled case(s): - a child is maybe in debug mode but does not itself have any task using the debugger. * ToDo: we need a way to decide what to do with "intermediate" child actors who themselves either are not in `debug_mode=True` but have children who are such that a SIGINT won't cause cancellation of that child-as-parent-of-another-child iff any of their children are in in debug mode.	2022-07-27 11:40:02 -04:00
Tyler Goodlet	6b7b58346f	(facepalm) Reraise `BdbQuit` and discard ownerless lock releases	2022-07-27 11:40:02 -04:00
Tyler Goodlet	3cac323421	Add WIP while-debugger-active SIGINT ignore handler	2022-07-27 11:40:02 -04:00
goodboy	4902e184e9	Merge pull request #318 from goodboy/aio_error_propagation Add context test that opens an inter-task-channel that errors	2022-07-15 12:42:19 -04:00
Tyler Goodlet	05790a20c1	Slight lint fixes	2022-07-15 11:18:48 -04:00
Tyler Goodlet	f0d78e1a6e	Use local task ref, fixes `mypy`	2022-07-15 10:39:49 -04:00
Tyler Goodlet	0906559ed9	Drop manual stack construction, fix attr typo	2022-07-14 20:43:17 -04:00
Tyler Goodlet	38d03858d7	Fix `asyncio`-task-sync and error propagation This fixes an previously undetected bug where if an `.open_channel_from()` spawned task errored the error would not be propagated to the `trio` side and instead would fail silently with a console log error. What was most odd is that it only seems easy to trigger when you put a slight task sleep before the error is raised (:eyeroll:). This patch adds a few things to address this and just in general improve iter-task lifetime syncing: - add `LinkedTaskChannel._trio_exited: bool` a flag set from the `trio` side when the channel block exits. - add a `wait_on_aio_task: bool` flag to `translate_aio_errors` which toggles whether to wait the `asyncio` task termination event on exit. - cancel the `asyncio` task if the trio side has ended, when `._trio_exited == True`. - always close the `trio` mem channel when the task exits such that the `asyncio` side can error on any next `.send()` call.	2022-07-14 16:35:41 -04:00
Tyler Goodlet	41983edc43	Use `str` \| `bytes` union for typing msg dump	2022-07-12 11:59:11 -04:00
Tyler Goodlet	5168700fbf	Tolerate non-decode-able bytes	2022-07-12 11:55:55 -04:00
Tyler Goodlet	673c4a8c66	Decode bytes prior to log msg	2022-07-12 11:55:55 -04:00
Tyler Goodlet	932b841176	Allow up to 4 `msgpsec` decode failures	2022-07-12 11:55:55 -04:00
Tyler Goodlet	f594f1bdda	Handle a connection reset on `msgspec` transport	2022-07-12 11:55:55 -04:00
Tyler Goodlet	4e7ab54452	Appease `mypy`	2022-07-12 11:22:30 -04:00
Tyler Goodlet	f94b7cd991	Drop `msgpack` lib and use `msgspec` for transport	2022-07-12 10:37:13 -04:00
Tyler Goodlet	8901272854	Fix typing	2022-04-13 08:20:53 -04:00
Tyler Goodlet	80897a8f2b	Add `tractor.query_actor()` an addr looker-upper Sometimes it's handy to just have a non-`Portal` yielding way to figure out if a "service" actor is up, so add this discovery helper for that. We'll prolly just leave it undocumented for now until we figure out a longer-term/better discovery system.	2022-04-13 07:50:42 -04:00
Tyler Goodlet	f3606d5bd8	Type fixes	2022-04-12 11:48:32 -04:00
Tyler Goodlet	c322a193f2	Make `LinkedTaskChannel` trio-task-broadcastable with `.subscribe()`	2022-04-12 11:42:44 -04:00
Tyler Goodlet	46963c2e63	Don't handle `GeneratorExit` on `asyncio` tasks	2022-04-12 11:42:44 -04:00
Tyler Goodlet	9b77b8c9ee	Add more explicit `asyncio` task error logging When an `asyncio` side task errors or is cancelled we now explicitly report the traceback and task name if possible as well as the source reason for the error (some come from the `trio` side). Further, properly set any `trio` side exception (after unwrapping it from the `outcome.Error`) on the future that runs the `trio` guest run.	2022-04-12 11:42:44 -04:00
Tyler Goodlet	c30cece37a	Fix one missing import/ref	2022-02-17 13:03:37 -05:00
Tyler Goodlet	509082c935	Port to new `msgspec` error type	2022-02-17 11:55:26 -05:00
Tyler Goodlet	75bb1added	Avoid importing mp for as long as possible	2022-02-17 11:55:26 -05:00

... 3 4 5 6 7 ...

1020 Commits (2f854a3e86d898045c3bbf093e0df14e89a2b339)