tractor

Commit Graph

Author	SHA1	Message	Date
Gud Boi	03c42e1333	Drop `xfail` from `test_moc_reentry_during_teardown` The per-`ctx_key` locking fix in `f086222d` intended to resolve the teardown race reproduced by the new test suite, so the test SHOULD now pass. TLDR, it doesn't Bp Also add `collapse_eg()` to the test's ctx-manager stack so that when run with `pytest <...> --tpdb` we'll actually `pdb`-REPL the RTE when it hits (previously an assert-error). (this commit-msg was generated in some part by [`claude-code`][claude-code-gh]) [claude-code-gh]: https://github.com/anthropics/claude-code	2026-04-06 18:29:07 -04:00
Gud Boi	cab366cd65	Add xfail test for `_Cache.run_ctx` teardown race Reproduce the piker `open_cached_client('kraken')` scenario: identical `ctx_key` callers share one cached resource, and a new task re-enters during `__aexit__` — hitting `assert not resources.get()` bc `values` was popped but `resources` wasn't yet. Deats, - `test_moc_reentry_during_teardown` uses an `in_aexit` event to deterministically land in the teardown window. - marked `xfail(raises=AssertionError)` against unpatched code (fix in `9e49eddd` or wtv lands on the `maybe_open_ctx_locking` or thereafter patch branch). Also, add prompt-io log for the session. (this patch was generated in some part by [`claude-code`][claude-code-gh]) [claude-code-gh]: https://github.com/anthropics/claude-code Prompt-IO: ai/prompt-io/claude/20260406T193125Z_85f9c5d_prompt_io.md	2026-04-06 18:17:04 -04:00
Gud Boi	85f9c5df6f	Add per-`ctx_key` isolation tests for `maybe_open_context()` Add `test_per_ctx_key_resource_lifecycle` to verify that per-key user tracking correctly tears down resources independently - exercises the fix from 02b2ef18 where a global `_Cache.users` counter caused stale cache hits when the same `acm_func` was called with different kwargs. Also, add a paired `acm_with_resource()` helper `@acm` that yields its `resource_id` for per-key testing in the above suite. (this patch was generated in some part by [`claude-code`][claude-code-gh]) [claude-code-gh]: https://github.com/anthropics/claude-code Prompt-IO: ai/prompt-io/claude/20260406T172848Z_02b2ef1_prompt_io.md	2026-04-06 14:37:47 -04:00
Gud Boi	ebe9d5e4b5	Parametrize `test_resource_cache.test_open_local_sub_to_stream` Namely with multiple pre-sleep `delay`-parametrizations before either, - parent-scope cancel-calling (as originally) or, - depending on the new `cancel_by_cs: bool` suite parameter, optionally just immediately exiting from (the newly named) `maybe_cancel_outer_cs()` a checkpoint. In the latter case we ensure we don't inf sleep to avoid leaking those tasks into the `Actor._service_tn` (though we should really have a better soln for this).. Deats, - make `cs` args optional and adjust internal logic to match. - add some notes around various edge cases and issues with using the actor-service-tn as the scope by default.	2026-04-06 14:37:47 -04:00
Gud Boi	0b0c83e9da	Drop `name=__name__` from all `get_logger()` calls Use new implicit module-name detection throughout codebase to simplify logger creation and leverage auto-naming from caller mod . Main changes, - drop `name=__name__` arg from all `get_logger()` calls (across 29 modules). - update `get_console_log()` calls to include `name='tractor'` for enabling root logger in test harness and entry points; this ensures logic in `get_logger()` triggers so that all `tractor`-internal logging emits to console. - add info log msg in test `conftest.py` showing test-harness log level Also, - fix `.actor.uid` ref to `.actor.aid.uid` in `._trace`. - adjust a `._context` log msg formatting for clarity. - add TODO comments in `._addr`, `._uds` for when we mv to using `multiaddr`. - add todo for `RuntimeVars` type hint TODO in `.msg.types` (once we eventually get that all going obvi!) (this commit msg was generated in some part by [`claude-code`][claude-code-gh]) [claude-code-gh]: https://github.com/anthropics/claude-code	2026-02-11 21:04:49 -05:00
Tyler Goodlet	4ba3590450	Add `.trionics.maybe_open_context()` locking test Call it `test_lock_not_corrupted_on_fast_cancel()` and includes a detailed doc string to explain. Implemented it "cleverly" by having the target `@acm` cancel its parent nursery after a peer, cache-hitting task, is already waiting on the task mutex release.	2025-08-18 21:07:12 -04:00
Tyler Goodlet	70664b98de	Well then, I guess it just needed, a checkpoint XD Here I was thinking the bcaster (usage) maybe required a rework but, NOPE it's just bc a checkpoint was needed in the parent task owning the `tn` which spawns `get_sub_and_pull()` tasks to ensure the bg allocated `an`/portal is eventually cancel-called.. Ah well, at least i started a patch for `MsgStream.subscribe()` to make it multicast revertible.. XD Anyway, I tossed in some checks & notes related to all that unnecessary effort since I do think i'll move forward implementing it: - for the `cache_hit` case always verify that the `bcast` clone is unregistered from the common state subs after `.subscribe().__aexit__()`. - do a light check that the implicit `MsgStream._broadcaster` is always the only bcrx instance left-leaked into that state.. that is until i get the proper de-allocation/reversion from multicast -> unicast working. - put in mega detailed note about the required parent-task checkpoint.	2025-08-18 21:07:12 -04:00
Tyler Goodlet	1c425cbd22	Tool-up `test_resource_cache.test_open_local_sub_to_stream` Since I recently discovered a very subtle race-case that can sometimes cause the suite to hang, seemingly due to the `an: ActorNursery` allocated behind the `.trionics.maybe_open_context()` usage; this can result in never cancelling the 'streamer' subactor despite the `main()` timeout-guard? This led me to dig in and find that the underlying issue was 2-fold, - our `BroadcastReceiver` termination-mgmt semantics in `MsgStream.subscribe()` can result in the first subscribing task to always keep the `MsgStream._broadcaster` instance allocated; it's never `.aclose()`ed, which makes it tough to determine (and thus trace) when all subscriber-tasks are actually complete and exited-from-`.subscribe()`.. - i was shield waiting `.ipc._server.Server.wait_for_no_more_peers()` in `._runtime.async_main()`'s shutdown sequence which would then compound the issue resulting in a SIGINT-shielded hang.. the worst kind XD Actual changes here are just styling, printing, and some mucking with passing the `an`-ref up to the parent task in the root-actor where i was doing a conditional `ActorNursery.cancel()` to mk sure that was actually the problem. Presuming this is fixed the `.pause()` i left unmasked should never hit.	2025-08-18 21:07:06 -04:00
Tyler Goodlet	d6b0ddecd7	Improve bit of tooling for `test_resource_cache.py` Namely while what I was actually trying to solve was why `TransportClosed` was getting raised from `Portal.cancel_actor()` but still useful edge case auditing either way. Also opts into the `debug_mode` fixture with apprope timeout adjustment B)	2025-07-13 15:26:37 -04:00
Tyler Goodlet	4aa89bf391	Bump timeout on resource cache test a bitty bit.	2025-03-14 14:14:53 -04:00
Tyler Goodlet	21a9c47496	Parameterize over cache keying methods: kwargs and "key"	2021-12-16 18:02:03 -05:00
Tyler Goodlet	67dc0d014c	Add basic `maybe_open_context()` caching test	2021-12-16 18:02:03 -05:00
Tyler Goodlet	9b1d8bf7b0	Of course, increase the timeout for windows..	2021-12-16 18:02:03 -05:00
Tyler Goodlet	f617da6ff1	Add timeout around test and prints for guidance	2021-12-16 18:02:03 -05:00
Tyler Goodlet	4a0252baf2	Add task-cached stream test	2021-12-16 18:02:03 -05:00

15 Commits (007b45857b17aac5cb71d4cb32932c09b657b878)