tractor

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	f1b242f913	Block SIGINT handling while in the debugger This seems to prevent a certain class of bugs to do with the root actor cancelling local tasks and getting into deadlock while children are trying to acquire the tty lock. I'm not sure it's the best idea yet since you're pretty much guaranteed to get "stuck" if a child activates the debugger after the root has been cancelled (at least "stuck" in terms of SIGINT being ignored). That kinda race condition seems to still exist somehow: a child can "beat" the root to activating the tty lock and the parent is stuck waiting on the child to terminate via its nursery.	2020-09-28 08:54:21 -04:00
Tyler Goodlet	9e1d9a8ce1	Add an internal context stack This aids with tearing down resources after the crash handling and debugger have completed. Leaving this internal for now but should eventually get a public convenience function like `tractor.context_stack()`.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	09daba4c9c	Explicitly handle `debug_mode` flag correctly	2020-09-24 10:12:33 -04:00
Tyler Goodlet	8b6e9f5530	Port to new debug api, set `_is_root` state flag on startup	2020-09-24 10:12:33 -04:00
Tyler Goodlet	150179bfe4	Support entering post mortem on crashes in root actor	2020-09-24 10:12:33 -04:00
Tyler Goodlet	291ecec070	Maybe not sticky by default	2020-09-24 10:12:33 -04:00
Tyler Goodlet	bd157e05ef	Port to service nursery	2020-09-24 10:12:33 -04:00
Tyler Goodlet	fd5fb9241a	Sparsen some lines	2020-09-24 10:12:33 -04:00
Tyler Goodlet	ebb21b9ba3	Support re-entrant breakpoints Keep an actor local (bool) flag which determines if there is already a running debugger instance for the current process. If another task tries to enter in this case, simply ignore it since allowing entry may result in a deadlock where the new task will be sync waiting on the parent stdio lock (a case that will never arrive due to the current debugger's active use of it). In the future we may want to allow FIFO queueing of local tasks where instead of ignoring re-entrant breakpoints we allow tasks to async wait for debugger release, though not sure the implications of that since you'd likely want to support switching the debugger to the new task and that could cause deadlocks where tasks are inter-dependent. It may be more sane to just error on multiple breakpoint requests within an actor.	2020-09-24 10:12:33 -04:00
Tyler Goodlet	f9ef3fc5de	Cleanups and more comments	2020-09-24 10:12:33 -04:00
Tyler Goodlet	68773d51fd	Always expose the debug module	2020-09-24 10:12:33 -04:00
Tyler Goodlet	abaa2f5da0	Drop uneeded `parent_chan_cs()` cancel call	2020-09-24 10:12:33 -04:00
Tyler Goodlet	efd7095cf8	Add pdbpp as dep	2020-09-24 10:12:32 -04:00
Tyler Goodlet	f7cd2be039	Play with re-entrant trace	2020-09-24 10:12:10 -04:00
Tyler Goodlet	8eb9a742dd	Add multi-process debugging support using `pdbpp` This is the first step in addressing #113 and the initial support of #130. Basically this allows (sub)processes to engage the `pdbpp` debug machinery which read/writes the root actor's tty but only in a FIFO semaphored way such that no two processes are using it simultaneously. That means you can have multiple actors enter a trace or crash and run the debugger in a sensible way without clobbering each other's access to stdio. It required adding some "tear down hooks" to a custom `pdbpp.Pdb` type such that we release a child's lock on the parent on debugger exit (in this case when either of the "continue" or "quit" commands are issued to the debugger console). There's some code left commented in anticipation of full support for issue #130 where we're need to actually capture and feed stdin to the target (remote) actor which won't necessarily being running on the same host.	2020-09-24 10:12:10 -04:00
Tyler Goodlet	e7ee0fec34	Pass a copy of the expected exposed modules	2020-09-24 10:12:10 -04:00
Tyler Goodlet	1d1c881fd7	WIP debugging test script	2020-09-24 10:12:10 -04:00
Tyler Goodlet	b06d4b023e	Add support for "debug mode" When enabled a crashed actor will connect to the parent with `pdb` in post mortem mode.	2020-09-24 10:12:10 -04:00
Tyler Goodlet	b11e91375c	Initial attempt at multi-actor debugging Allow entering and attaching to a `pdb` instance in a child process. The current hackery is to have the child make an rpc to the parent and ask it to hijack stdin, once complete the child enters a `pdb` blocking method. The parent then relays all stdin input to the child thus controlling the "remote" debugger. A few things were added to accomplish this: - tracking the mapping of subactors to their parent nurseries - in the root actor, cancelling all nurseries under the root `trio` task on cancellation (i.e. `Actor.cancel()`) - pass a "runtime vars" map down the actor tree for propagating global state	2020-09-24 10:12:10 -04:00
Tyler Goodlet	8c97f7bbb3	Create runtime variables	2020-09-24 10:12:10 -04:00
goodboy	196cf14211	Merge pull request #152 from guilledk/gh_actions Add Github Actions CI support	2020-09-03 12:22:27 -04:00
Guillermo Rodriguez	5e3ce765dd	Drop mac support, will continue the experiment on another branch	2020-09-03 10:41:09 -03:00
Guillermo Rodriguez	ad68ff665f	Missing a platform.system() check	2020-09-03 09:57:04 -03:00
Guillermo Rodriguez	c993e36e95	Simplified CI detection	2020-09-03 09:44:24 -03:00
Guillermo Rodriguez	03e5852acf	Added some missing CI integration pieces	2020-09-02 13:19:42 -03:00
Guillermo Rodriguez	4d7a16b304	Lower timeout and added spawn_backend to name of jobs	2020-09-02 11:31:10 -03:00
Guillermo Rodriguez	406ded7311	Experimental mac testing	2020-09-02 11:18:12 -03:00
Guillermo Rodriguez	3595317b00	Removed --disable-vnet parameter to pytest that was left after experimenting with this file in the multihost testing branch	2020-09-02 11:16:05 -03:00
Guillermo Rodriguez	865e932107	Initial commit	2020-09-02 11:12:08 -03:00
goodboy	1cbc098721	Merge pull request #150 from guilledk/typelog_sphinx_theme Install docs requirements in travis tests	2020-08-31 11:39:40 -04:00
Guillermo Rodriguez	f05db6841d	Install docs requirements in travis tests!	2020-08-31 12:33:25 -03:00
goodboy	440dae4859	Merge pull request #137 from guilledk/typelog_sphinx_theme Changed docs theme to typelog	2020-08-31 11:22:19 -04:00
Guillermo Rodriguez	a6f7b0df7c	Small grammar fix	2020-08-31 12:17:59 -03:00
Guillermo Rodriguez	1bee78837b	Added logo, fixed github links and grammar issues	2020-08-31 11:49:14 -03:00
Guillermo Rodriguez	13de7991d9	Add link to trio process spawning docs	2020-08-31 10:08:32 -03:00
Guillermo Rodriguez	3536e73df7	Changed docs theme to typelog, also removed all mentions to trio-run-in-process.	2020-08-31 10:08:04 -03:00
goodboy	4da16325f3	Merge pull request #144 from goodboy/dereg_on_channel_aclose Fix for dereg failure on manual stream close leading to an internal nursery composition rework.	2020-08-13 13:56:47 -04:00
Tyler Goodlet	451170bb63	Pass explicit kwargs to new discovery test funcs	2020-08-13 13:26:08 -04:00
Tyler Goodlet	ec5d443ee5	Always log actor errors	2020-08-13 11:55:22 -04:00
Tyler Goodlet	863a4b7933	Update copyright date	2020-08-13 11:55:03 -04:00
Tyler Goodlet	0c8dcd0ec5	Use allocated arbiter port in local reg test	2020-08-13 11:54:37 -04:00
Tyler Goodlet	1ae0efb033	Make rpc_module_paths a list	2020-08-13 11:53:45 -04:00
Tyler Goodlet	8a995beb6a	Docs fixes	2020-08-08 22:29:57 -04:00
Tyler Goodlet	292513b353	Module define default accept addr	2020-08-08 20:58:04 -04:00
Tyler Goodlet	b3eba00c3a	Appease the great mypy	2020-08-08 20:57:43 -04:00
Tyler Goodlet	42be410076	Handle mp accept_addr	2020-08-08 20:27:43 -04:00
Tyler Goodlet	acd5b80f4c	Add close channel test with remote arbiter	2020-08-08 15:17:04 -04:00
Tyler Goodlet	c821690834	Actor cancellation is now more latent; loosen timeing	2020-08-08 15:16:10 -04:00
Tyler Goodlet	7f74182a8a	Never allow more then info logging in daemon; causes blocking	2020-08-08 15:15:43 -04:00
Tyler Goodlet	8477d21499	Restructure actor runtime nursery scoping In an effort acquire more deterministic actor cancellation, this adds a clearer and more resilient (whilst possibly a bit slower) internal nursery structure with explicit semantics for clarifying the task-scope shutdown sequence. Namely, on cancellation, the explicit steps are now: - cancel all currently running rpc tasks and wait for them to complete - cancel the channel server and wait for it to complete - cancel the msg loop for the channel with the immediate parent - de-register with arbiter if possible - wait on remaining connections to release - exit process To accomplish this add a new nursery called the "service nursery" which spawns all rpc tasks instead of using the "root nursery". The root is now used solely for async launching the msg loop for the primary channel with the parent such that it is (nearly) the last thing torn down on cancellation. In the future it should also be possible to have `self.cancel()` return a result to the parent once the runtime is sure that the rest of the shutdown is atomic; this would allow for a true unbounded shield in `Portal.cancel_actor()`. This will likely require that the error handling blocks in `Actor._async_main()` are moved "inside" the root nursery block such that the msg loop with the parent truly is the last thing to terminate.	2020-08-08 14:55:41 -04:00

1 2 3 4 5 ...

451 Commits (f1b242f913b9bf3d84fbf6b8e9d7d154bb3ef9d5) All Branches Search

451 Commits (f1b242f913b9bf3d84fbf6b8e9d7d154bb3ef9d5)

All Branches