tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	d2ad58d5d6	Toss in masked `.set_trace()` for unshielded `.pause()` debug	2025-08-10 15:07:42 -04:00
Tyler Goodlet	f7ca31c0e4	Set `shield` when `.cancel_called` for root crashes Such that we handle them despite a cancellation condition. This is almost always the case, that `root_tn.cancel_scope.cancel_called` is set, by the time the `debug._maybe_enter_pm()` hits. Previous I guess we just weren't actually ever REPL-debugging such cases? Still needs a test obvi.	2025-08-10 15:03:15 -04:00
Tyler Goodlet	e50ba3e8a6	Toss in some breakpointed collapsers to catch eg Such that when that strange (now marked xfail) case from `6348c83` shows we can attempt to REPL it. Thus far this has led to zero insight as to why the fail-after is not raising a `TooSlowError` and instead the eg with embedded assert-error, but likely mucking with the `trio.NurseryManager.__aexit__()` (prolly by overriding it unfortunately..) will get us somewhere??	2025-08-10 14:56:09 -04:00
Tyler Goodlet	5fc64107e5	Drop unused `Actor._root_n`..	2025-08-10 14:48:55 -04:00
Tyler Goodlet	6348c83d28	Resolve `test_cancel_while_childs_child_in_sync_sleep` Was failing due to the `.fail_after()` timeout being too short and somehow the new interplay of that with strict-exception groups resulting in the `TooSlowError` never raising but instead an eg with the embedded `AssertionError`?? I still don't really get it honestly.. I've written up lengthy notes around the different `delay` settings that can be used to see the diff outcomes, the failing case being the one i still don't really grok and think is justification for `trio` to bubble inner `Cancelled`s differently possibly? For now i've included the original failing case as an `xfail` parametrization for now which will hopefully drive a follow lowlevel `trio` test in `test_trioisms`!	2025-08-10 13:57:04 -04:00
Tyler Goodlet	8b8390e83c	Mk temp collapser bp work outside runtime as well..	2025-08-10 13:18:41 -04:00
Tyler Goodlet	f5c6fc2f02	Add temp breakpoint support to `collapse_eg()`	2025-08-08 19:08:33 -04:00
Tyler Goodlet	444b9bfc22	Fix cluster suite, chng to new `gather_contexts()` Namely `test_empty_mngrs_input_raises()` was failing due to lazy-iterator use as input to `mngrs` which i guess i added support for a while back (by it doing a `list(mngrs)` internally)? So just change it to `gather_contexts(mngrs=())` and also tweak the `trio.fail_after(3)` since it appears that the prior 1sec was causing too-fast-of-a-cancellation (before the cluster fully spawned) and thus the expected `ValueError` never to show.. Also, mask the `tractor.trionics.collapse_eg()` usage (again?) in `open_actor_cluster()` since it seems unnecessary.	2025-08-08 17:37:20 -04:00
Tyler Goodlet	79e70a9b08	WIP tinkering with strict-eg-tns and cluster API Seems that the way the actor-nursery interacts with the `.trionics.gather_contexts()` API on cancellation makes our `.trionics.collapse_eg()` not work as intended? I need to dig into how `ActorNursery.cancel()` and `.__aexit__()` might be causing this discrepancy.. Consider this a commit-of-my-index type save for rn.	2025-08-08 17:37:20 -04:00
Tyler Goodlet	23240c31e3	Stackscope import fail msg dun need braces..	2025-07-29 15:18:13 -04:00
Tyler Goodlet	6a82bab627	Always pop `._Cache.resources` AFTER `mng.__aexit__()` The correct ordering is to de-alloc the surrounding `service_n` + `trio.Event` after the `mng` teardown ensuring the `mng.__aexit__()` never can hit a ref-error if it touches either (like if a `tn` is passed to `maybe_open_context()`!	2025-07-29 15:17:37 -04:00
Tyler Goodlet	b485297411	Multi-line-style up the UDS fast-connect handler Shift around comments and expressions for better reading, assign `tpt_closed` for easier introspection from REPL during debug oh and fix the `MsgpackTransport.pformat()` to render '\|_peers: 1' .. XD	2025-07-29 15:07:43 -04:00
Tyler Goodlet	dd23ef1d95	Drop duplicated (masked) debugging-`terminate_after`, prolly a rebase slip..	2025-07-29 15:05:38 -04:00
Tyler Goodlet	2ec3ff46cd	Log "out-of-layer" cancellation in `._rpc._invoke()` Similar to what was just changed for `Context.repr_state`, when the child task is cancelled but by a different "layer" of the runtime (i.e. a `Portal.cancel_actor()` / `SIGINT`-to-process canceller) we don't dump a traceback instead just `log.cancel()` emit.	2025-07-29 15:01:47 -04:00
Tyler Goodlet	967d0e4836	Handle "out-of-layer" remote `Context` cancellation Such that if the local task hasn't resolved but is `trio.Cancelled` and a `.canceller` was set, we report a `'actor-cancelled'` from `.repr_state: str`. Bit of formatting to avoid needless newlines too!	2025-07-29 14:58:18 -04:00
Tyler Goodlet	5ccb36af57	Mk `pause_from_sync()` raise `InternalError` on no `greenback` init	2025-07-29 14:57:16 -04:00
Tyler Goodlet	28f8546ac5	Hide `_maybe_enter_pm()` frame (again?)	2025-07-29 14:55:18 -04:00
Tyler Goodlet	0ff0971aca	Adjust `test_trio_prestarted_task_bubbles()` suite to expect non-eg raises	2025-07-29 14:54:10 -04:00
Tyler Goodlet	dc1091016b	Bit of multi-line styling / name tweaks in cancellation suites	2025-07-29 14:51:44 -04:00
Tyler Goodlet	69bba30557	Add LoC pattern matches for `test_post_mortem_api`	2025-07-29 14:50:37 -04:00
Tyler Goodlet	da9bc1237d	Change one infected-aio test to use `chan` in fn sig	2025-07-29 14:47:24 -04:00
Tyler Goodlet	ab11ee4fbe	Support `chan.started_nowait()` in `.open_channel_from()` target That is the `target` can declare a `chan: LinkedTaskChannel` instead of `to_trio`/`from_aio`. To support it, - change `.started()` -> the more appropriate `.started_nowait()` which can be called sync from the aio child task. - adjust the `provide_channels` assert to accept either fn sig declaration (for now). Still needs test(s) obvi..	2025-07-29 14:42:15 -04:00
Tyler Goodlet	466dce8aed	Relay `asyncio` errors via EoC and raise from rent Makes the newly added `test_aio_side_raises_before_started` test pass by ensuring errors raised by any `.to_asyncio.open_channel_from()` spawned child-`asyncio.Task` are relayed by any caught `trio.EndOfChannel` by checking for a new `LinkedTaskChannel._closed_by_aio_task: bool`. Impl deats, - obvi add `LinkedTaskChannel._closed_by_aio_task: bool = False` - in `translate_aio_errors()` always check for the new flag on EOC conditions and in such cases set `chan._trio_to_raise = aio_err` such that the `trio`-parent-task always raises the child's exception directly, OW keep original EoC passthrough in place. - include very detailed per-case comments around the extended handler. - adjust re-raising logic with a new `raise_from` where we only give the `aio_err` priority if it's not already set as to `trio_to_raise`. Also, - hide the `_run_asyncio_task()` frame by def.	2025-07-29 14:30:42 -04:00
Tyler Goodlet	808dd9d73c	Add "raises-pre-started" `open_channel_from()` test Verifying that if any exc is raised pre `chan.send_nowait()` (our currentlly shite version of a `chan.started()`) then that exc is indeed raised through on the `trio`-parent task side. This case was reproduced from a `piker.brokers.ib` issue with a similar embedded `.trionics.maybe_open_context()` call. Deats, - call the suite `test_aio_side_raises_before_started`. - mk the `@context` simply `maybe_open_context(acm_func=open_channel_from)` with a `target=raise_before_started` which, - simply sleeps then immediately raises a RTE. - expect the RTE from the aio-child-side to propagate all the way up to the root-actor's task right up through the `trio.run()`.	2025-07-29 13:58:48 -04:00
Tyler Goodlet	aef306465d	Add `never_warn_on: dict` support to unmasker Such that key->value pairs can be defined which should never be unmasked where values of - the keys are exc-types which might be masked, and - the values are exc-types which masked the equivalent key. For example, the default includes: - KBI->taskc: a kbi should never be unmasked from its masking `trio.Cancelled`. For the impl, a new `do_warn: bool` in the fn-body determines the primary guard for whether a warning or re-raising is necessary.	2025-07-28 12:57:48 -04:00
Tyler Goodlet	7459a4127c	Accept `tn` to `gather_contexts()/maybe_open_context()` Such that the caller can be responsible for their own (nursery) scoping as needed and, for the latter fn's case with a `trio.Nursery.CancelStatus.encloses()` check to ensure the `tn` is a valid parent-ish. Some deats, - in `gather_contexts()`, mv the `try/finally` outside the nursery block to ensure we always do the `parent_exit`. - for `maybe_open_context()` we do a naive task-tree hierarchy audit to ensure the provided scope is not too child-ish (with what APIs `trio` gives us, see above), OW go with the old approach of using the actor's private service nursery. Also, * better report `trio.Cancelled` around the cache-miss `yield` cases and ensure we never unmask triggering key-errors. * report on any stale-state with the mutex in the `finally` block.	2025-07-27 13:55:11 -04:00
Tyler Goodlet	fc77e6eca5	Suppress beg tbs from `collapse_eg()` It was originally this way; I forgot to flip it back when discarding the `except*` handler impl.. Specially handle the `exc.__cause__` case where we raise from any detected underlying cause and OW `from None` to suppress the eg's tb.	2025-07-25 20:05:51 -04:00
Tyler Goodlet	26526b86c3	Facepalm, actually use `.log.cancel()`-level to report parent-side taskc..	2025-07-25 19:03:21 -04:00
Tyler Goodlet	d079675dd4	UDS: implicitly create `Address.bindspace: Path` Since it's merely a local-file-sys subdirectory and there should be no reason file creation conflicts with other bind spaces. Also add 2 test suites to match, - `tests/ipc/test_each_tpt::test_uds_bindspace_created_implicitly` to verify the dir creation when DNE. - `..test_uds_double_listen_raises_connerr` to ensure a double bind raises a `ConnectionError` from the src `OSError`.	2025-07-25 13:32:23 -04:00
Tyler Goodlet	c2acc4f55c	Rm `assert` from `Channel.from_addr()`, for UDS we re-created to extract the peer PID	2025-07-25 11:27:30 -04:00
Tyler Goodlet	326b258fd5	Drop `tn` input from `maybe_raise_from_masking_exc()` Including all caller usage throughout. Moving to a non-`except*` impl means it's never needed as a signal from the caller - we can just catch the beg outright (like we should have always been doing)..	2025-07-25 11:16:02 -04:00
Tyler Goodlet	4f4c7e6b67	Adjust test suites to new `maybe_raise_from_masking_exc()` changes	2025-07-25 11:02:22 -04:00
Tyler Goodlet	c05d08e426	Pass `tuple` from `._invoke()` unmasker usage Since `maybe_raise_from_masking_exc()` now requires the general case instead explicitly pass `unmask_from=(Cancelled,)` (yes i know it's the current default). Also add some extra `TransportClosed`-handling for some IPC-disconnects-during-teardown edge cases, - in `._invoke()` around the `await chan.send(return_msg)` where we suppress if the underlying chan already disconnected. - add a disjoint handler in `_errors_relayed_via_ipc()` which just reports the exc but raises it through (as prior). * I originally thought it needed to be handled specially (to avoid being crash handled) but turns out that isn't necessary? * Hence the masked-out `debug_filter` / guard expression around the `await debug._maybe_enter_pm()` line.	2025-07-25 10:52:06 -04:00
Tyler Goodlet	02062c5dc0	Drop `except*` usage from `._taskc` unmasker That is from `maybe_raise_from_masking_exc()` thus minimizing us to a single `except BaseException` block with logic branching for the beg vs. `unmask_from` exc cases. Also, - raise val-err when `unmask_from` is not a `tuple`. - tweak the exc-note warning format. - drop all pausing from dev work.	2025-07-25 10:25:33 -04:00
Tyler Goodlet	72c4a9d20b	Rework `collapse_eg()` to NOT use `except`.. Since it turns out the semantics are basically inverse of normal `except` (particularly for re-raising) which is hard to get right, and bc it's a lot easier to just delegate to what `trio` already has behind the `strict_exception_groups=False` setting, Bp I added a rant here which will get removed shortly likely, but i think going forward recommending against use of `except` is prudent for anything low level enough in the runtime (like trying to filter begs). Dirty deats, - copy `trio._core._run.collapse_exception_group()` to here with only a slight mod to remove the notes check and tb concatting for the collapse case. - rename `maybe_collapse_eg()` - > `get_collapsed_eg()` and delegate it directly to the former `trio` fn; return `None` when it returns the same beg without collapse. - simplify our own `collapse_eg()` to either raise the collapsed `exc` or original `beg`.	2025-07-25 10:23:19 -04:00
Tyler Goodlet	ccc3b1fce1	`ipc._uds`: assign `.l/raddr` in `.connect_to()` Using `.get_stream_addrs()` such that we always (can) assign the peer end's PID in the `._raddr`. Also factor common `ConnectionError` re-raising into a `_reraise_as_connerr()`-@cm.	2025-07-24 23:16:30 -04:00
Tyler Goodlet	11c4e65757	Add `.trionics.maybe_open_context()` locking test Call it `test_lock_not_corrupted_on_fast_cancel()` and includes a detailed doc string to explain. Implemented it "cleverly" by having the target `@acm` cancel its parent nursery after a peer, cache-hitting task, is already waiting on the task mutex release.	2025-07-20 15:01:18 -04:00
Tyler Goodlet	33ac3ca99f	Always `finally` invoke cache-miss `lock.release()`s Since the `await service_n.start()` on key-err can be cancel-masked (checkpoint interrupted before `_Cache.run_ctx` completes), we need to always `lock.release()` in to avoid lock-owner-state corruption and/or inf-hangs in peer cache-hitting tasks. Deats, - add a `try/except/finally` around the key-err triggered cache-miss `service_n.start(_Cache.run_ctx, ..)` call, reporting on any taskc and always `finally` unlocking. - fill out some log msg content and use `.debug()` level.	2025-07-20 14:57:26 -04:00
Tyler Goodlet	9ada628a57	Rename all lingering ctx-side bits As before but more thoroughly in comments and var names finally changing all, - caller -> parent - callee -> child	2025-07-18 20:07:37 -04:00
Tyler Goodlet	d2c3e32bf1	Well then, I guess it just needed, a checkpoint XD Here I was thinking the bcaster (usage) maybe required a rework but, NOPE it's just bc a checkpoint was needed in the parent task owning the `tn` which spawns `get_sub_and_pull()` tasks to ensure the bg allocated `an`/portal is eventually cancel-called.. Ah well, at least i started a patch for `MsgStream.subscribe()` to make it multicast revertible.. XD Anyway, I tossed in some checks & notes related to all that unnecessary effort since I do think i'll move forward implementing it: - for the `cache_hit` case always verify that the `bcast` clone is unregistered from the common state subs after `.subscribe().__aexit__()`. - do a light check that the implicit `MsgStream._broadcaster` is always the only bcrx instance left-leaked into that state.. that is until i get the proper de-allocation/reversion from multicast -> unicast working. - put in mega detailed note about the required parent-task checkpoint.	2025-07-18 00:36:52 -04:00
Tyler Goodlet	51944a0b99	TOSQASH 285ebba: woops still use `bcrx._state` for now..	2025-07-18 00:36:52 -04:00
Tyler Goodlet	024e8015da	Switch nursery to `CancelScope`-status properties Been meaning to do this forever and a recent test hang finally drove me to it Bp Like it sounds, adopt the "cancel-status" properties on `ActorNursery` use already on our `Context` and derived from `trio.CancelScope`: - add new private `._cancel_called` (set in the head of `.cancel()`) & `._cancelled_caught` (set in the tail) instance vars with matching read-only `@properties`. - drop the instance-var and instead delegate a `.cancelled: bool` property to `._cancel_called` and add a usage deprecation warning (since removing it breaks a buncha tests).	2025-07-18 00:36:52 -04:00
Tyler Goodlet	aaed3a4a37	Add `Channel.closed/.cancel_called` I.e. the public properties for the private instance var equivs; improves expected introspection usage.	2025-07-18 00:36:52 -04:00
Tyler Goodlet	edffd5e367	Set `Channel._cancel_called` via `chan` var In `Portal.cancel_actor()` that is, at the least to make it easier to ref search from an editor Bp	2025-07-18 00:36:52 -04:00
Tyler Goodlet	4ca81e39e6	Never shield-wait `ipc_server.wait_for_no_more_peers()` As mentioned in prior testing commit, it can cause the worst kind of hangs, the SIGINT ignoring kind.. Pretty sure there was never any reason outside some esoteric multi-actor debugging case, and pretty sure that already was solved?	2025-07-18 00:36:52 -04:00
Tyler Goodlet	dd7aca539f	Tool-up `test_resource_cache.test_open_local_sub_to_stream` Since I recently discovered a very subtle race-case that can sometimes cause the suite to hang, seemingly due to the `an: ActorNursery` allocated behind the `.trionics.maybe_open_context()` usage; this can result in never cancelling the 'streamer' subactor despite the `main()` timeout-guard? This led me to dig in and find that the underlying issue was 2-fold, - our `BroadcastReceiver` termination-mgmt semantics in `MsgStream.subscribe()` can result in the first subscribing task to always keep the `MsgStream._broadcaster` instance allocated; it's never `.aclose()`ed, which makes it tough to determine (and thus trace) when all subscriber-tasks are actually complete and exited-from-`.subscribe()`.. - i was shield waiting `.ipc._server.Server.wait_for_no_more_peers()` in `._runtime.async_main()`'s shutdown sequence which would then compound the issue resulting in a SIGINT-shielded hang.. the worst kind XD Actual changes here are just styling, printing, and some mucking with passing the `an`-ref up to the parent task in the root-actor where i was doing a conditional `ActorNursery.cancel()` to mk sure that was actually the problem. Presuming this is fixed the `.pause()` i left unmasked should never hit.	2025-07-18 00:36:52 -04:00
Tyler Goodlet	735dc9056a	Go multi-line-style tuples in `maybe_enter_context()` Allows for an inline comment of the first "cache hit" bool element.	2025-07-18 00:36:52 -04:00
Tyler Goodlet	e949839edf	More prep-to-reduce the `Actor` method-iface - drop the (never/un)used `.get_chans()`. - add #TODO for factoring many methods into a new `.rpc`-subsys/pkg primitive, like an `RPCMngr/Server` type eventually. - add todo to maybe mv `.get_parent()` elsewhere? - move masked `._hard_mofo_kill()` to bottom.	2025-07-18 00:36:29 -04:00
Tyler Goodlet	6194ac891c	Add `.ipc._shm` todo-idea for `@actor_fixture` API	2025-07-18 00:36:29 -04:00
Tyler Goodlet	6554e324f2	Update buncha log msg fmting in `.msg._ops` Mostly just multi-line code styling again: always putting standalone `'f\n'` on separate LOC so it reads like it renders to console. Oh and and a level drop to `.runtime()` for rx-msg reports.	2025-07-18 00:36:29 -04:00

1 2 3 4 5 ...

2254 Commits (POST_final_eg_refinements_failafter_investigation) All Branches Search

2254 Commits (POST_final_eg_refinements_failafter_investigation)

All Branches