piker

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	34dd6ffc22	Add a configurable timeout around backend live feed startup For now make it a larger value but ideally in the long run we can tune it to specific backends and expose it in the config(s).	2023-06-27 13:41:47 -04:00
Tyler Goodlet	8233d12afb	Detect and fill time gaps in tsdb history For now, just detect and fill in gaps (via fresh backend queries) in the shm buffer but eventually i'm pretty sure we can just write these direct to the parquet file as well. Use the new `.data._timeseries.detect_null_time_gap()` to find and fill in the `ShmArray` index range, re-check it and enter a prompt if it didn't totally fill. Also, - do a massive cleanup and removal of all unused/commented code. - drop the duplicate frames tracking, don't think we need it after removing multi-frame concurrent queries. - change backfill loop variable `end_dt` -> `last_start_dt` which is more semantically correct. - fix logic to backfill any missing sub-sequence portion for any frame query that overruns the shm buffer prependable space by detecting the available rows left to insert and only push those. - add a new `shm_push_in_between()` helper to match.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	f25248c871	Add `.data._timeseries` utility mod Org all the new (time) gap detection routines here and also move in the `slice_from_time()` epoch -> index converter routine from `._pathops` B)	2023-06-27 13:41:47 -04:00
Tyler Goodlet	0dcfcea6ee	Finally get partial backfills after tsdb load workinnn It took a little while (and a lot of commenting out of old no longer needed code) but, this gets tsdb (from parquet file) loading before final backfilling from the most recent history frame until the most recent tsdb time stamp! More or less all the convoluted concurrency shit we had for coping with `marketstore` IPC junk is no longer needed, particularly all the query size limits and accompanying load loops.. The recent frame loading technique/order has now changed though since we'd like to show charts asap once tsdb history loads. The new load sequence is as follows: - load mr (most recent) frame from backend. - load existing history (one shot) from the "tsdb" aka parquet files with `polars`. - backfill the gap part from the mr frame back to the tsdb start incrementally by making (hacky) `ShmArray.push(start=<blah>)` calls and not updating the `._first.value` while doing it XD Dirtier deatz: - make `tsdb_backfill()` run per timeframe in a separate task. - drop all the loop through timeframes and insert `dts_per_tf` crap. - only spawn a subtask for the `start_backfill()` call which in turn only does the gap backfilling as mentioned above. - mask out all the code related to being limited to certain query sizes (over gRPC) as was restricted by marketstore.. not gonna go through what all of that was since it's probably getting deleted in a follow up commit. - buncha off-by-one tweaks to do with backfilling the gap from mr frame to tsdb start.. mostly tinkered it to get it all right but seems to be working correctly B) - still use the `broadcast_all()` msg stuff when doing the gap backfill though don't have it really working yet on the UI side (since previously we were relying on the shm first/last values.. so this will be "coming soon" :)	2023-06-27 13:41:47 -04:00
Tyler Goodlet	7a5c43d01a	Support injecting a `info: dict` to `Sampler.broadcast_all()` calls	2023-06-27 13:41:47 -04:00
Tyler Goodlet	6dc3ed8d6a	Expose a `force_reformat: bool` up through graphics stack	2023-06-27 13:41:47 -04:00
Tyler Goodlet	4f4860cfb0	Update shm.push() type sig style	2023-06-27 13:41:47 -04:00
Tyler Goodlet	1e683a4b91	Another guard around sampling subscriber popped race..	2023-06-27 13:41:47 -04:00
Tyler Goodlet	c52e889fe5	First draft history loading rework It was a concurrency-hack mess somewhat due to all sorts of limitations imposed by marketstore (query size limits, strange datetime/timestamp errors, slow table loads for large queries..) and we can drastically simplify. There's still some issues with getting new backfills (not yet in storage) correctly prepended: there's sometimes little gaps due to shm races when reading history indexing vs. when the live-feed startup finishes. We generally need tests for all this and likely a better rework of the feed layer's init such that we're showing history in chart afap instead of waiting on backfills or the live feed to come up. Much more to come B)	2023-06-27 13:41:47 -04:00
Tyler Goodlet	0ba3c798d7	Drop `bar_wap` from default ohlc field set Turns out no backend (including kraken) requires it and really this kinda of measure should be implemented and recorded from our fsp layer instead of (hackily) sometimes expecting it to be in "source data".	2023-06-27 13:41:47 -04:00
Tyler Goodlet	af64152640	.data.history: update to new naming -> `._source.def_iohlcv_fields` -> `.storage.StorageClient`	2023-06-27 13:41:47 -04:00
Tyler Goodlet	bf21d2e329	Rename default OHLCV `np.dtype` descriptions Use `def_iohlcv_fields` for a name and instead of copying and inserting the index field pop it for the non-index version. Drop creating `np.dtype()` instances since `numpy`'s apis accept both input forms so this is simpler on our end.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	e82538eded	.data: export ohlc dtypes at top level	2023-06-27 13:41:47 -04:00
Tyler Goodlet	1ec9b0565f	Move `.data.cli` to `.storage.cli`	2023-06-27 13:41:47 -04:00
Tyler Goodlet	7ab97fb21d	Add marketstore client as storage-backend module To kick off our (tsdb) storage backends this adds our first implementing a new `Storage(Protocol)` client interface. Going foward, the top level `.storage` pkg-module will now expose backend agnostic APIs and helpers whilst specific backend implementations will adhere to that middle-ware layer. Deats: - add `.storage.marketstore.Storage` as the first client implementation, moving all needed (import) dependencies out from `.service.marketstore` as well as `.ohlc_key_map` and `get_client()`. - move root `conf.toml` loading from `.data.history` into `.storage.__init__.open_storage_client()` which now takes in a `name: str` and does all the work of loading the correct backend module, its config, and determining if a service-instance can be contacted and a client loaded; in the case where this fails we raise a new `StorageConnectionError`. - add a new `.storage.get_storagemod()` just like we have for brokers. - make `open_storage_client()` also return the backend module such that the history-data layer can make backend specific calls as needed (eg. ohlc_key_map). - fall back to a basic non-tsdb backfill when `open_storage_client()` raises the new connection error.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	29211b200d	Start `piker.storage` subsys: cross-(ts)db middlewares The plan is to offer multiple tsdb and other storage backends (for a variety of use cases) and expose them similarly to how we do for broker and data providers B)	2023-06-27 13:41:47 -04:00
Tyler Goodlet	ae8358a5e7	Tidy up unused imports and doc string	2023-06-27 13:32:18 -04:00
Tyler Goodlet	00a51c0288	Use new `msgspec.structs` api for `.typecast()`	2023-06-27 13:26:52 -04:00
Tyler Goodlet	994564f923	Just warn-print when annots are str values?	2023-06-27 13:26:52 -04:00
Tyler Goodlet	12172cc5cd	Make `.data.types.Struct.typecast()` work via type lookup from `builtins`	2023-06-27 13:26:52 -04:00
Tyler Goodlet	9c80969fd5	.data.validate: add missing endpoint warnings	2023-05-25 16:01:21 -04:00
Tyler Goodlet	f0a346dcc3	Some linting fixes after trying out `ruff`	2023-05-24 17:25:23 -04:00
Tyler Goodlet	1b577eebf6	Change over the UI layer to use `MktPair` Including changing to `LinkedSplits.mkt: MktPair` and adding an explicit setter method for setting it and being sure that nothing breaks in the display system init! For this commit we leave in warning access to `LinkedSplits.symbol` but will remove in following commit.	2023-05-24 15:30:17 -04:00
Tyler Goodlet	bd8e4760d5	Port everything strictly to `Position.mkt` and `Flume.mkt`	2023-05-24 12:16:28 -04:00
Tyler Goodlet	8e97814c1f	Add "no vlm" indication to `FeedInit` Stash it for now in the (now mutable by default) `.shm_write_opts` and have the new `Flume._has_vlm: bool` (only set to false internally by feed layer) which can be read via new public `.has_vlm()` predicate. Move out the old `.ui/_fsp` helper logic to this flume method.	2023-05-24 08:25:14 -04:00
Tyler Goodlet	7f246697b4	Remove remaining `fqsn` usage from code base minus backward compats	2023-05-23 14:16:02 -04:00
Tyler Goodlet	660a94d610	Don't expect `conf.toml`'s network section For testing this is particularly true until we offer a template with whatever (likely localhost) settings planned to ship.	2023-05-22 11:54:36 -04:00
Tyler Goodlet	e4e4cacef3	.data.feed: Less stringency with fqme matching `Flume.mkt.fqme` might not be exactly the same as the local version now since we've had to add some hacks to certain backends (cough ib) to handle `MktPair.src` not being set as an `Asset` (yet).	2023-05-22 11:52:36 -04:00
Tyler Goodlet	907eaa68cb	Pass `mkt: MktPair` to `.open_history_client()` Since porting all backends to the new `FeedInit` + `MktPair` + `Asset` style init, we can now just directly pass a `MktPair` instance to the history endpoint(s) since it's always called after the live feed `.stream_quotes()` ep B) This has a lot of benefits including allowing brokerd backends to have more flexible, pre-processed market endpoint meta-data that piker has already validated; makes handling special cases in much more straight forward as well such as forex pairs from legacy brokers XD First pass changes all crypto backends to expect this new input, ib will come next after handling said special cases..	2023-05-17 16:52:15 -04:00
Tyler Goodlet	12bfabf056	Expose `.accounting.unpack_fqme()`	2023-05-17 16:43:31 -04:00
Tyler Goodlet	ae049eb84f	Pass and use `MktPair` throughout history routines Previously we were passing the `fqme: str` which isn't as extensive nor were we able to pass `MktPair` direct to backend history manager-loading routines (which should be able to rely on always receiving it since currently `stream_quotes()` is always called first for setup). This also starts a slight bit of configuration oriented tsdb info loading (via a new `conf.toml`) such that a user can decide to host their (marketstore) db on a remote host and our container spawning and client code will do the right startup automatically based on the config. \|-> Related to this I've added some comments about doing storage backend module loading which should get actually written out as part of patches coming in #486 (or something related). Don't allow overruns again in history context since it seems it was never a problem?	2023-05-17 10:19:14 -04:00
Tyler Goodlet	b096ee3b7a	Make `FeedInit.shm_write_opts` an empty dict by default	2023-05-16 16:30:30 -04:00
Tyler Goodlet	cfb125beef	`.data.feed`: finally solve startup overruns issue We need to allow overruns during the async multi-broker context spawning init bc some backends might take longer then others to setup (eg. binance vs. kucoin) and result in some context (stream) being overrun by the time we get to the `.open_stream()` phase. Ideally, we can maybe adjust the concurrent setup to be more of a task-per-provider style to avoid this in the future - which would also in theory result in more-immediate per-provider setup in terms showing ready feeds asap. Also, does a bunch of renaming from fqsn -> fqme and drops the lower casing of input symbols instead expecting the caller to know what the data backend it's requesting is going to be able to handle in terms of symbology.	2023-05-13 17:35:46 -04:00
Tyler Goodlet	df96155057	Always allow overruns in sampler context Requires https://github.com/goodboy/tractor/pull/357. Avoid overruns when doing concurrent live feed init over multiple brokers.	2023-05-13 14:06:27 -04:00
Tyler Goodlet	361fc4645c	Drop passing `loglevel` to `stream_quotes()`, level is set when actor spawns	2023-05-09 18:28:51 -04:00
Tyler Goodlet	88f3912b2d	test_ems: doc out some remaining suites	2023-05-09 14:49:46 -04:00
Tyler Goodlet	038b20d13a	wsbs: increase msg rx timeout to 16 secs	2023-05-09 14:49:46 -04:00
Tyler Goodlet	c415bd1ee1	If backend does not provide `bs_mktid`, use the `bs_fqme`	2023-05-09 14:49:46 -04:00
Tyler Goodlet	226c3364c3	Smh, handle `fixture==None` case..	2023-05-09 14:49:46 -04:00
Tyler Goodlet	7a3bce3f33	.data._web_bs: add client module name to log msgs	2023-05-09 14:49:46 -04:00
Tyler Goodlet	0b43e0aa8c	Try having `brokerd` eps defined in `.brokers._daemon` Since it's a bit weird having service specific implementation details inside the general service `._daemon` mod, and since i'd mentioned trying this re-org; let's do it B) Requires enabling the new mod in both `pikerd` and `brokerd` and obviously a bit more runtime-loading of the service modules in the `brokerd` service eps to avoid import cycles. Also moved `_setup_persistent_brokerd()` into the new mod since the naming would place it there even though the implementation really wouldn't (longer run) since we want to split up `.data.feed` layer backend-invoked eps into a separate actor eventually from the "actual" `brokerd` which will be the actor running only the trade control eps (eg. trades_dialogue()` and friends).	2023-05-09 14:49:26 -04:00
Tyler Goodlet	59743b7b73	Rework `NoBsWs` to avoid agen/`trio` incompatibility `trio`'s internals don't allow for async generator (and thus by consequence dynamic reset of async exit stacks containing `@acm`s) interleaving since doing so corrupts the cancel-scope stack. See details in: - https://github.com/python-trio/trio/issues/638 - https://trio-util.readthedocs.io/en/latest/#trio_util.trio_async_generator - `trio._core._run.MISNESTING_ADVICE` We originally tried to address this using `@trio_util.trio_async_generator` in backend streaming code but for whatever reason stopped working recently (at least for me) and it's more or less implemented the same way as this patch but with more layers and an extra dep. I also don't want us to have to address this problem again if/when that lib isn't able to keep up to date with wtv `trio` is doing.. So instead this is a complete rewrite of the conc design of our auto-reconnect ws API to move all reset logic and msg relay into a bg task which is respawned on reset-requiring events: user spec-ed msg recv latency, network errors, roaming events. Deatz: - drop all usage of `AsyncExitStack` and no longer require client code to (hackily) call `NoBsWs._connect()` on msg latency conditions, intead this is all done behind the scenes and the user can instead pass in a `msg_recv_timeout: float`. - massively simplify impl of `NoBsWs` and move all reset logic into a new `_reconnect_forever()` task. - offer use of `reset_after: int` a count value that determines how many `msg_recv_timeout` events are allowed to occur before reconnecting the entire ws from scratch again.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	9d04accf2e	Factor out all history mgmt-logic into a new `.data.history`	2023-05-09 14:49:26 -04:00
Tyler Goodlet	4131ff1152	Rename `bs_mktid` -> `bs_fqme` and drop (some) `fqsn`s Since we have made `MktPair.bs_mktid` mean something else now, change all the feed setup var names to instead be more representative of the actual value: `bs_fqme: str` and use the new `MktPair.bs_fqme` where necessary.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	765b8f8e5c	Support both input msg-sequence types The legacy version was a `dict` of `dicts` vs. now we want to be handed a `list[FeedInit]`; process both in a factored way. Drop `FeedInit.bs_mktid` since it's already defined on `.mkt.bs_mktid` and we don't really need it top level.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	4129d693be	Add `.data.validate` checker for live feed layer More or less a replacement for what @guilledk did with the initial attempt at a "broker check" type script a while back except in this case we're going to always run this validation routine and it now uses a new `FeedInit` struct to ensure backends are delivering the right schema-ed data during startup. Also allows us to stick deprecation warnings / and or strict API compat errors all in one spot (at least for live feeds). Factors out a bunch of `MktPair` related adapter-logic into a new `.validate.valiate_backend()` which warns to the backend implementer via log msgs all the problems outstanding. Ideally we do our backend module endpoint scan-and-complain regarding missing feature support from here as well (eg. search, broker/trade ctl, ledger processing, etc.).	2023-05-09 14:49:26 -04:00
Tyler Goodlet	53a41ba93d	Add subsys log to new `.data._util`	2023-05-09 14:49:26 -04:00
Tyler Goodlet	0917b580c9	Flip `.feed` and `._sampling` over to new stuff In `.feed` and `._sampling` move to using the new `tractor.Context.open_stream(allow_overruns: bool)` (cough, A BREAKING CHANGE). Also set `Flume.mkt` during construction in `.feed.open_feed()`.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	611d86d988	Change `Flume.symbol` -> `.mkt: MktPair` Might as well try and flip it over to the new type; make appropriate dict serialization changes in `.to_msg()`. Alias back to `.symbol: Symbol` with a property.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	2cf7daca30	Another fqsn -> fqme rename	2023-05-09 14:49:26 -04:00
Tyler Goodlet	2d609dceac	Drop `loglevel` from `spawn_args` inputs to `maybe_spawn_daemon()`	2023-05-09 14:49:26 -04:00
Tyler Goodlet	4c1d174801	Expect `loglevel: str` in brokerd root task ep Set the level right after spawn and once for the lifetime of the daemon.	2023-05-09 14:49:26 -04:00
Tyler Goodlet	6272cae8d4	Drop more `Optional` usage on our `Struct`	2023-05-09 14:49:25 -04:00
Tyler Goodlet	0f3041724b	Use `MktPair` for `Flume.symbol` when used by backend Initial attempt at getting the sampling and shm layer to use the new mkt info meta-data type. Draft out a potential `BackendInitMsg: msgspec.Struct` for validating the init msg returned from the `stream_quotes()` start value; obvs don't actually use it yet.	2023-05-09 14:49:25 -04:00
Tyler Goodlet	452cd7db8a	Optionally load `MktPair` in `Flume`s	2023-05-09 14:49:25 -04:00
Tyler Goodlet	2cc80d53ca	First stage port of `.data.feed` to `MktPair` Add `MktPair` handling block for when a backend delivers a `mkt_info`-field containing init msg. Adjust the original `Symbol`-style `'symbol_info'` msg processing to do `Decimal` defaults and convert to `MktPair` including slapping in a hacky `_atype: str` field XD General initial name changes to `bs_mktid` and `_fqme` throughout!	2023-05-09 14:49:25 -04:00
Tyler Goodlet	7eb0b1d249	Comment about `Struct.typecast()` conflict with frozen instances	2023-05-09 14:49:25 -04:00
Tyler Goodlet	55b6cba31e	Encode a `mktpair` field if passed in msg by caller	2023-05-09 14:49:25 -04:00
Tyler Goodlet	7a8e615fa6	Explicitly decode tick sizes as decimal for symbol loading in `Flume`	2023-05-09 14:49:25 -04:00
Tyler Goodlet	9e2eff507e	Drop shm logging levels to debug over warning	2023-05-09 14:49:25 -04:00
Tyler Goodlet	56f736e7ca	Drop use of `Symbol.brokers` everywhere	2023-05-09 14:49:25 -04:00
Tyler Goodlet	9f03484c4d	Move all fqsn parsing and `Symbol` to new `accounting._mktinfo	2023-05-09 14:49:25 -04:00
Tyler Goodlet	badc30baae	Add an inverse of `float_digits()`: `digits_to_dec()	2023-05-09 14:49:25 -04:00
Tyler Goodlet	0d9acb1cb0	numpy: drop `numpy.float` in py311	2023-05-04 12:01:59 -04:00
jaredgoldman	9bf6f557ed	Label private methods accordingly, remove cryptofeeds module	2023-04-12 19:48:46 -04:00
jaredgoldman	b14b323068	Remove breakpoint in web_bs, ensure we only unsub if ws is connected	2023-04-12 19:48:46 -04:00
jaredgoldman	ac34ca7cad	Add sub method to flow Stash for checkout of master	2023-04-12 19:48:46 -04:00
Tyler Goodlet	8e91e215b3	WIP - ensure `asyncio` pumps the event loop each send	2023-04-12 19:48:46 -04:00
jaredgoldman	c751c36a8b	Update trade message format	2023-04-12 19:48:46 -04:00
jaredgoldman	ad9d645782	WIP - setup basic history and streaming client	2023-04-12 19:48:46 -04:00
jaredgoldman	5fdec8012d	Add cryptofeeds data feed module, Add Kucoin backend client wip	2023-04-12 19:48:46 -04:00
Tyler Goodlet	12e196a6f7	Catch `KeyError` on bcast errors which pop the sub Not sure how i missed this (and left in handling of `list.remove()` and it ever worked for that?) after the `samplerd` impl in `5ec1a72` but, this adjusts the remove-broken-subscriber loop to catch the correct `set.remove()` exception type on a missing (likely already removed) subscription entry.	2023-03-10 18:20:22 -05:00
Tyler Goodlet	712f1a47a0	Require `step: float` input to `slice_from_time()` There's been way too many issues when trying to calculate this dynamically from the input array, so just expect the caller to know what it's doing and don't bother with ever hitting the error case of calculating and incorrect value internally.	2023-03-10 18:20:22 -05:00
Tyler Goodlet	29418e9655	Avoid index-from-time slicing including gaps Not sure why this was ever allowed but, for slicing to the sample before whatever target time stamp is passed in we should definitely not return the prior index as for the slice start since that might include a very large gap prior to whatever sample is scanned to have the earliest matching time stamp. This was essential to fixing overlay intersect points searching in our ``ui.view_mode`` machinery..	2023-03-10 18:20:22 -05:00
Tyler Goodlet	5dd69b2295	Better handle dynamic registry sampler broadcasts In situations where clients are (dynamically) subscribing while broadcasts are starting to taking place we need to handle the `set`-modified-during-iteration case. This scenario seems to be more common during races on concurrent startup of multiple symbols. The solution here is to use another set to take note of subscribers which are successfully sent-to and then skipping them on re-try. This also contains an attempt to exception-handle throttled stream overruns caused by higher frequency feeds (like binance) pushing more quotes then can be handled during (UI) client startup.	2023-03-10 18:20:22 -05:00
Tyler Goodlet	441243f83b	Attempt to report `piker storage -d <fqsn>` errors Not really sure there's much we can do besides dump Grpc stuff when we detect an "error" `str` for the moment.. Either way leave a buncha complaints (como siempre) and do linting fixups..	2023-03-09 15:37:43 -05:00
Tyler Goodlet	b226b678e9	Fix missed `marketstore` mod imports	2023-03-09 15:37:42 -05:00
Tyler Goodlet	afac553ea2	Move all docker and external db code to `piker.service`	2023-03-09 15:37:42 -05:00
Tyler Goodlet	93c81fa4d1	Start `piker.service` sub-package For now just moves everything that was in `piker._daemon` to a subpkg module but a reorg is coming pronto!	2023-03-09 15:37:42 -05:00
Tyler Goodlet	bfe3ea1f59	Set explicit `marketstore` container startup timeout	2023-03-09 15:37:42 -05:00
Tyler Goodlet	56629b6b2e	Hardcode `cancel` log level for `ahabd` for now	2023-03-09 15:37:42 -05:00
Tyler Goodlet	7694419e71	Background docker-container logs processing Previously we would make the `ahabd` supervisor-actor sync to docker container startup using pseudo-blocking log message processing. This has issues, - we're forced to do a hacky "yield back to `trio`" in order to be "fake async" when reading the log stream and further, - blocking on a message is fragile and often slow. Instead, run the log processor in a background task and in the parent task poll for the container to be in the client list using a similar pseudo-async poll pattern. This allows the super to `Context.started()` sooner (when the container is actually registered as "up") and thus unblock its (remote) caller faster whilst still doing full log msg proxying! Deatz: - adds `Container.cuid: str` a unique container id for logging. - correctly proxy through the `loglevel: str` from `pikerd` caller task. - shield around `Container.cancel()` in the teardown block and use cancel level logging in that method.	2023-03-09 15:37:42 -05:00
Tyler Goodlet	8c66f066bd	Deliver es specific ahab-super in endpoint startup config	2023-03-09 15:37:42 -05:00
Tyler Goodlet	959e423849	Add warning around detach flag to docker client	2023-03-09 15:37:42 -05:00
Tyler Goodlet	7b196b1b97	Support startup-config overrides to `ahabd` super With the addition of a new `elastixsearch` docker support in https://github.com/pikers/piker/pull/464, adjustments were made to container startup sync logic (particularly the `trio` checkpoint sleep period - which itself is a hack around a sync client api) which caused a regression in upstream startup logic wherein container error logs were not being bubbled up correctly causing a silent failure mode: - `marketstore` container started with corrupt input config - `ahabd` super code timed out on startup phase due to a larger log polling period, skipped processing startup logs from the container, and continued on as though the container was started - history client fails on grpc connection with no clear error on why the connection failed. Here we revert to the old poll period (1ms) to avoid any more silent failures and further extend supervisor control through a configuration override mechanism. To address the underlying design issue, this patch adds support for container-endpoint-callbacks to override supervisor startup configuration parameters via the 2nd value in their returned tuple: the already delivered configuration `dict` value. The current exposed values include: { 'startup_timeout': 1.0, 'startup_query_period': 0.001, 'log_msg_key': 'msg', }, This allows for container specific control over the startup-sync query period (the hack mentioned above) as well as the expected log msg key and of course the startup timeout.	2023-03-09 15:37:42 -05:00
Tyler Goodlet	fe0695fb7b	First draft storage layer cli Adds a `piker storage` subcmd with a `-d` flag to wipe a particular fqsn's time series (both 1s and 60s). Obviously this needs to be extended much more but provides a start point.	2023-03-09 15:37:42 -05:00
Tyler Goodlet	3a4794e9d1	Backward-compat: don't require `'lot_tick_size'` In order to support existing `pps.toml` files in the wild which don't have the `asset_type, price_tick_size, lot_tick_size` fields, we need to only optionally read them and instead expect that backends will write the fields going forward (coming in follow patches). Further this makes some small asset-size (vlm accounting) quantization related adjustments: - rename `Symbol.decimal_quant()` -> `.quantize_size()` since that is explicitly what this method is doing. - and expect an input `size: float` which we cast to decimal instead of doing it inside the `.calc_size()` caller code. - drop `Symbol.iterfqsns()` which wasn't being used anywhere at all.. Additionally, this drafts out a new replacement market-trading-pair data type to eventually replace `.data._source.Symbol` -> `MktPair` which we aren't using yet, but serves as the documentation-driven motivator ;) and, it relates to https://github.com/pikers/piker/issues/467.	2023-03-02 19:22:19 -05:00
Guillermo Rodriguez	f5b8b9a14f	Add sym registry to PaperBoi as well as a sym ref on Transaction Add decimal quantize API to Symbol to simplify by-broker truncation Add symbol info to `pps.toml` Move _assert call to outside the _async_main context manager Minor indentation and styling changes, also convert a few prints to log calls Fix multi write / race condition on open_pps call Switch open_pps to not write by default Fix integer math kraken syminfo _tick_size initialization	2023-03-01 21:06:48 -03:00
jaredgoldman	342aec648b	Skip zero test and change use Path when creating a config folder in marketstore	2023-02-28 13:51:47 -05:00
jaredgoldman	4b72d3ba99	Add backpressure setting back as it wasn't altering test behaviour	2023-02-28 13:51:47 -05:00
algorandpa	0dec2b9c89	Enable backpressure during data-feed layer startup to avoid overruns	2023-02-25 18:59:39 -05:00
Guillermo Rodriguez	47bf45f30e	Merge pull request #464 from pikers/elasticsearch_integration Elasticsearch integration	2023-02-24 16:38:37 -03:00
Esmeralda Gallardo	b96e2c314a	Minor style changes and removed unnecesary comments	2023-02-24 15:11:15 -03:00
Esmeralda Gallardo	f96d6a04b6	Fixed UnboundLocalError on _ahab. Added test for marketstore's initialization	2023-02-22 13:28:07 -03:00
Guillermo Rodriguez	acc6249d88	Remove unnesesary arguments to some pikerd functions, fix container init error by switching from log reading to quering es health endpoint, fix install on ci and add more logging.	2023-02-21 20:45:10 -03:00
Esmeralda Gallardo	b5cdf14036	Modified elasticsearch file name to 'elastic' to avoid name errors. Applied changes suggested in the pr.	2023-02-21 13:34:29 -03:00
Guillermo Rodriguez	bf9ca4a4a8	Generalize ahab to support elasticsearch logs and init procedure	2023-02-21 13:34:29 -03:00
Guillermo Rodriguez	17a4fe4b2f	Trim unnecesary stuff left from marketstore copy, also fix elastic config name for docker build, add elasticsearch to dependencies	2023-02-21 13:34:28 -03:00
Esmeralda Gallardo	0dc24bd475	Added dockerfile, yaml file and script to statrt an elasticsearch's docker instance.	2023-02-21 13:34:26 -03:00
Tyler Goodlet	e01220af14	Type annot tweaks to feeds mod	2023-02-21 10:54:18 -05:00
Tyler Goodlet	ebf53e32bd	Fix return type annot for `slice_from_time()`	2023-02-13 12:27:58 -05:00
Tyler Goodlet	433697cc4f	Add cached refs to last 1d xy outputs For the purposes of avoiding another full format call we can stash the last rendered 1d xy pre-graphics formats as `IncrementalFormatter.x/y_1d: np.ndarray`s and allow readers in the viz and render machinery to use this data easily for things like "only drawing the last uppx's worth of data as a line". Also add a `.flat_index_ratio: float` which can be used similarly as a scalar applied to indexes into the src array but instead when indexing (flattened) 1d xy formatted outputs. Finally, this drops the way overdone/noisy `.__repr__()` meth we had XD	2023-02-13 12:27:58 -05:00
Tyler Goodlet	d622b4157c	Only draw up to 2nd last datum for OHLC bars paths	2023-02-13 12:27:58 -05:00
Tyler Goodlet	92ce1b3304	Only handle hist discrepancies when market is open We obviously don't want to be debugging a sample-index issue if/when the market for the asset is closed (since we'll be guaranteed to have a mismatch, lul). Pass in the `feed_is_live: trio.Event` throughout the backfilling routines to allow first checking for the live feed being active so as to avoid breakpointing on false +ves. Also, add a detailed warning log message for when actually investigating a mismatch.	2023-02-13 12:27:58 -05:00
Tyler Goodlet	a8e1796a8b	Comment bad x-range bp for now	2023-02-13 12:27:58 -05:00
Tyler Goodlet	5ced05aab0	Breakpoint bad (-ve or too large) x-ranges to m4 This should never really happen but when it does it appears to be a race with writing startup pre-graphics-formatter array data where we get `x_end` epoch value subtracting some really small offset value (like `-/+0.5`) or the opposite where the `x_start` is epoch and `x_end` is small. This adds a warning msg and `breakpoint()` as well as guards around the entire code downsampling code path so that when resumed the downsampling cycle should just be skipped and avoid a crash.	2023-02-13 12:27:58 -05:00
Tyler Goodlet	7afc9301ac	Handle last-in-view time slicing edge case Whenever the last datum is in view `slice_from_time()` need to always spec the final array index (i.e. the len - 1 value we set as `read_i_max`) to avoid a uniform-step arithmetic error where gaps in the underlying time series causes an index that's too low to be returned.	2023-02-13 12:27:58 -05:00
Tyler Goodlet	12c6d58c2a	Drop bp blocks from formatters mod	2023-02-13 12:27:58 -05:00
Tyler Goodlet	63f0567418	Drop `Flume.index_stream()`, `._sampling.open_sample_stream()` replaces it	2023-02-13 12:27:58 -05:00
Tyler Goodlet	6a0c36922e	Drop `._index_step` from formatters and instead defer to `Viz.index_step()`	2023-02-12 13:55:26 -05:00
Tyler Goodlet	fc17187ff4	Drop edge case from `slice_from_time()` Doesn't seem like we really need to handle the situation where the start or stop input time stamps are outside the index range of the data since the new binary search handling via `numpy.searchsorted()` covers this case at minimal runtime cost and with an equally correct output. Allows us to drop some other indexing endpoint internal variables as well.	2023-02-12 13:55:26 -05:00
Tyler Goodlet	a7d78a3f40	Use left-style index search on RHS scan as well	2023-02-12 13:55:26 -05:00
Tyler Goodlet	cdec4782f0	Add commented append slice-len sanity check	2023-02-12 13:55:26 -05:00
Tyler Goodlet	ed1f64cf43	Fix gap detection on RHS; always bin-search on overshot time range	2023-02-12 13:55:26 -05:00
Tyler Goodlet	50ef4efccb	Align step curves the same as OHLC bars	2023-02-12 13:55:26 -05:00
Tyler Goodlet	51f2461e8b	Add `IncrementalFormatter.x_offset: np.ndarray` Define the x-domain coords "offset" (determining the curve graphics per-datum placement) for each formatter such that there's only on place to change it when needed. Obviously each graphics type has it's own dimensionality and this is reflected by the array shapes on each subtype.	2023-02-12 13:55:26 -05:00
Tyler Goodlet	444768d30f	Adjust OHLC bar x-offsets to be time span matched Previously we were drawing with the middle of the bar on each index with arms to either side: +/- some arm length. Instead this changes so that each bar is drawn after each index/timestamp such that in graphics coords the bar span more correctly matches the time span in the x-domain. This makes the linked region between slow and fast chart directly match (without any transform) for epoch-time indexing such that the last x-coord in view on the fast chart is no more then the next time step in (downsampled) slow view. Deats: - adjust in `._pathops.path_arrays_from_ohlc()` and take an `bar_w` bar width input (normally taken from the data step size). - change `.ui._ohlc.bar_from_ohlc_row()` and `BarItems.draw_last_datum()` to match.	2023-02-12 13:55:26 -05:00
Tyler Goodlet	24b384f3ef	Set `path_arrays_from_ohlc(use_time_index=True)` on epoch indexing Allows easily switching between normal array `int` indexing and time indexing by just flipping the `Viz._index_field: str`. Also, guard all the x-data audit breakpoints with a time indexing condition.	2023-02-12 13:55:26 -05:00
Tyler Goodlet	93330954c2	Ugh, use `bool` flag to determine index field..	2023-02-12 13:55:26 -05:00
Tyler Goodlet	3019c35e30	Move `Viz` layer to new `.ui` mod	2023-02-12 13:41:18 -05:00
Tyler Goodlet	3638ae8d3e	Drop unused `read_src_from_key: bool` to `.format_to_1d()`	2023-02-12 13:41:18 -05:00
Tyler Goodlet	0663880a6d	Fix formatter xy ndarray first prepend case First allocation vs. first "prepend" of source data to an xy `ndarray` format must be mutex in order to avoid a double prepend. Previously when both blocks were executed we'd end up with a `.xy_nd_start` that was decremented (at least) twice as much as it should be on the first `.format_to_1d()` call which is obviously incorrect (and causes problems for m4 downsampling as discussed below). Further, since the underlying `ShmArray` buffer indexing is managed (i.e. write-updated) completely independently from the incremental formatter updates and internal xy indexing, we can't use `ShmArray._first.value` and instead need to use the particular `.diff()` output's prepend length value to decrement the `.xy_nd_start` on updates after initial alloc. Problems this resolves with m4: - m4 uses a x-domain diff to calculate the number of "frames" to downsample to, this is normally based on the ratio of pixel columns on screen vs. the size of the input xy data. - previously using an int-index (not epoch time) the max diff between first and last index would be the size of the input buffer and thus would never cause a large mem allocation issue (though it may have been inefficient in terms of needed size). - with an epoch time index this max diff could explode if you had some near-now epoch time stamp minus an x-allocation value: generally some value in `[0.5, -0.5]` which would result in a massive frames and thus internal `np.ndarray()` allocation causing either a crash in `numba` code or actual system mem over allocation. Further, put in some more x value checks that trigger breakpoints if we detect values that caused this issue - we'll remove em after this has been tested enough.	2023-02-12 13:41:18 -05:00
Tyler Goodlet	3bed142d15	Handle time-indexing for fill arrows Call into a reworked `Flume.get_index()` for both the slow and fast chart and do time index clipping to last datum where necessary.	2023-02-12 13:41:18 -05:00
Tyler Goodlet	7aef31701b	Add some commented debug prints for default fmtr	2023-02-12 13:41:18 -05:00
Tyler Goodlet	135627e142	Slicec to an extra index around each timestamp input	2023-02-12 13:41:18 -05:00
Tyler Goodlet	44f50e3d0e	Implement `stop_t` gap adjustments; the good lord said it is the problem	2023-02-12 13:41:18 -05:00
Tyler Goodlet	5ab4e5493e	Add gap detection for `stop_t`, though only report atm	2023-02-12 13:41:18 -05:00
Tyler Goodlet	98438e29ef	Drop `Flume.view_data()`	2023-02-12 13:41:18 -05:00
Tyler Goodlet	d649a7d1fa	Drop old breakpoint	2023-02-12 13:41:18 -05:00
Tyler Goodlet	2669ced629	Drop `_slice_from_time()`	2023-02-12 13:41:18 -05:00
Tyler Goodlet	f2c0987a04	Use uniform step arithmetic in `slice_from_time()` If we presume that time indexing using a uniform step we can calculate the exact index (using `//`) for the input time presuming the data set has zero gaps. This gives a massive speedup over `numpy` fancy indexing and (naive) `numba` iteration. Further in the case where time gaps are detected, we can use `numpy.searchsorted()` to binary search for the nearest expected index at lower latency. Deatz, - comment-disable the call to the naive `numba` scan impl. - add a optional `step: int` input (calced if not provided). - add todos for caching binary search results in the gap detection cases. - drop returning the "absolute buffer indexing" slice since the caller can always just use the read-relative slice to acquire it.	2023-02-12 13:41:18 -05:00
Tyler Goodlet	0bdb7261d1	Flip over to epoch-time based x-domain indexing	2023-02-12 13:41:17 -05:00
Tyler Goodlet	12857a258b	Adjust all `slice_from_time()` calls to not expect mask	2023-02-12 13:41:17 -05:00
Tyler Goodlet	46808fbb89	Rewrite `slice_from_time()` using `numba` Gives approx a 3-4x speedup using plain old iterate-with-for-loop style though still not really happy with this .5 to 1 ms latency.. Move the core `@njit` part to a `_slice_from_time()` with a pure python func with orig name around it. Also, drop the output `mask` array since we can generally just use the slices in the caller to accomplish the same input array slicing, duh..	2023-02-12 13:41:17 -05:00
Tyler Goodlet	a3844f9922	Use step size to determine bar gaps	2023-02-12 13:41:17 -05:00
Tyler Goodlet	a33f58a61a	Move `Flume.slice_from_time()` to `.data._pathops` mod func	2023-02-12 13:41:17 -05:00
Tyler Goodlet	d5844ce8ff	Delegate formatter `.index_field` to the parent `Viz`	2023-02-12 13:41:17 -05:00
Tyler Goodlet	bf88b40a50	Facepalm2: fix array-read-slice, like actually.. We need to subtract the first index in the array segment read, not the first index value in the time-sliced output, to get the correct offset into the non-absolute (`ShmArray.array` read) array.. Further we do** need the `&` between the advance indexing conditions and this adds profiling to see that it is indeed real slow (like 20ms ish even when using `np.where()`).	2023-02-12 13:41:17 -05:00
Tyler Goodlet	e4a0d4ecea	Markup OHLC->path gen with `numba` issue #	2023-02-12 13:41:17 -05:00
Tyler Goodlet	031d7967de	Facepalm: actually return latest index on time slice fail..	2023-02-12 13:41:17 -05:00
Tyler Goodlet	2e67e98b4d	Go with explicit `.data._m4` mod name Since it's a notable and self-contained graphics compression algo, might as well give it a dedicated module B)	2023-02-12 13:41:17 -05:00
Tyler Goodlet	7124a131dd	Move (unused) path gen routines to `.ui._pathops`	2023-02-12 13:41:17 -05:00
Tyler Goodlet	9052ed5ddf	Move qpath-ops routines back to separate mod	2023-02-12 13:41:17 -05:00
Tyler Goodlet	7ec21c7f3b	Rename `.ui._pathops.py` -> `.ui._formatters.py	2023-02-12 13:41:17 -05:00
Tyler Goodlet	382a619a03	Fix from-time index slicing? Apparently we want an `\|` for the advanced indexing logic? Also, fix `read_slc` start to not always be 0 XD	2023-02-12 13:41:17 -05:00
Tyler Goodlet	7f3f6f871a	Move path ops routines to top of mod Planning to put the formatters into a new mod and aggregate all path gen/op helpers into this module. Further tweak include: - moving `path_arrays_from_ohlc()` back to module level - slice out the last xy datum for `OHLCBarsAsCurveFmtr` 1d formatting - always copy the new x-value from the source to `.x_nd`	2023-02-12 13:41:17 -05:00
Tyler Goodlet	6ea04f850d	Drop diff state tracking in formatter This was a major cause of error (particularly trying to get epoch indexing working) and really isn't necessary; instead just have `.diff()` always read from the underlying source array for current index-step diffing and append/prepend slice construction. Allows us to, - drop `._last_read` state management and thus usage. - better handle startup indexing by setting `.xy_nd_start/stop` to `None` initially so that the first update can be done in one large prepend. - better understand and document the step curve "slice back to previous level" logic which is now heavily commented B) - drop all the `slice_to_head` stuff from and instead allow each formatter to choose it's 1d segmenting.	2023-02-12 13:41:17 -05:00
Tyler Goodlet	f3bab826f6	Comment out bps for time indexing	2023-02-12 13:41:17 -05:00
Tyler Goodlet	ac1f37a2c2	Expect `index_field: str` in all graphics objects	2023-02-12 13:41:17 -05:00
Tyler Goodlet	166d14af69	Simplify formatter update methodology Don't expect values (array + slice) to be returned and applied by `.incr_update_xy_nd()` and instead presume this will implemented internally in each (sub)formatter. Attempt to simplify some incr-update routines, (particularly in the step curve formatter, though most of it was reverted to just a simpler form of the original implementation XD) including: - dropping the need for the `slice_to_head: int` control. - using the `xy_nd_start/stop` index counters over custom lookups.	2023-02-12 13:41:17 -05:00

1 2 3 4 5 ...

629 Commits (fb0c8fa7ad63856fcf40c1824928a4be28be7957)