piker

Commit Graph

Author	SHA1	Message	Date
Tyler Goodlet	4a180019f0	Swap out `fuzzywuzzy` for the newer `rapidfuzz` lib	2023-09-13 11:57:02 -04:00
Tyler Goodlet	481618cc51	kraken: handle ws live trading API symbology Of course I missed this first try but, we need to use the ws market pair symbology set (since apparently kraken loves redundancy at least 3 times XD) when processing transactions that arrive from live clears since it's an entirely different `LTC/EUR` style key then the `XLTCEUR` style delivered from the ReST eps.. As part of this: - add `Client._altnames`, `._wsnames` as `dict[str, Pair]` tables, leaving the `._AssetPairs` table as is keyed by the "xname"s. - Change `Pair.respname: str` -> `.xname` since these keys all just seem to have a weird 'X' prefix. - do the appropriately keyed pair table lookup via a new `api_name_set: str` to `norm_trade_records()` and set is correctly in the ws live txn handler task.	2023-08-30 16:32:34 -04:00
Tyler Goodlet	ad37cfbe2f	Break backfill loop on `end_dt < start_dt`	2023-08-29 08:43:14 -04:00
Tyler Goodlet	546049b62f	data.history: handle venue-closure gap edge case	2023-08-25 17:47:30 -04:00
Tyler Goodlet	5ed8544fd1	Bleh, move `.data.types` back up to top level pkg Since it's depended on by `.data` stuff as well as pretty much everything else, makes more sense to expose it as a top level module (and maybe eventually as a subpkg as we add to it).	2023-08-05 15:57:10 -04:00
Tyler Goodlet	100be54641	data.history: add TODO for non-zero epochs and some typing	2023-07-31 17:21:11 -04:00
Tyler Goodlet	08e8990fe3	Do single `ShmArray.array` read on zero-time filtering	2023-07-26 15:41:04 -04:00
Tyler Goodlet	2c6ae5d994	Drop the `gap_dt_unit: str` column We don't need it in `detect_time_gaps()` since doing straight up datetime diffs in `polars` already has a humanized `str` representation but with higher precision like '2d 1h 24m 1s' B)	2023-07-26 15:37:59 -04:00
Tyler Goodlet	7802febd20	Backfill history gaps with pre-gap close	2023-07-26 12:56:06 -04:00
Tyler Goodlet	64329d44e7	Flip `tractor.breakpoint()`s to new `.pause()`	2023-07-26 12:48:19 -04:00
Tyler Goodlet	58cf7ce10e	Add `norm_trade()` ep to validator warnings	2023-07-26 12:39:08 -04:00
Tyler Goodlet	d0f72bf269	Wrap symcache loading into `.from_scratch()` Since we need it both when explicitly reloading and whenever either the file or data in the file doesn't exist.	2023-07-26 12:27:26 -04:00
Tyler Goodlet	759ebe71e9	Allow disabling symcache load via kwarg as well	2023-07-20 15:27:46 -04:00
Tyler Goodlet	e88913e1f3	.data._pathops: drop profiler imports, fix some naming to appease `ruff`	2023-07-20 15:27:22 -04:00
Tyler Goodlet	5e7916a0df	Start `piker.toolz` subpkg for all our tooling B) Since there's a growing list of top level mods which are more or less utils/tools for working with the runtime; begin to move them into a new subpkg starting with a new `.toolz.debug`. Start with, - a new `open_crash_handller()` for doing breakpoints around blocks that might error. - move in what was `piker._profile` into `.toolz.profile` and adjust all importing appropriately.	2023-07-20 15:23:01 -04:00
Tyler Goodlet	dfa13afe22	Allow backends to "bypass" symcache loading Some backends like `ib` don't have an obvious (nor practical) way to easily download the entire symbology set available from all its mkt venues. For such backends loading might require a non-std approach (like using the contract search from some input mkt-key set) and can't be expected to necessarily be supported out of the box. As such, allow annotating a broker sub-pkg module with a `_no_symcache: bool = True` attr which will make `open_symcache()` yield early with an empty `SymbologyCache` instance for use by the caller to fill in the mkt and assets tables in whatever ad-hoc way desired.	2023-07-17 17:12:40 -04:00
Tyler Goodlet	2dab0e2e56	Expose `.data._symcache` stuff at subpkg toplevel The list is `open_symcache()`, `get_symcache()`, `SymbologyCache`, and `Stuct` which seems more or less fine to make part of the public namespace. Also, make `._timeseries.t_unit` an instance of literal to make `ruff` happy?	2023-07-17 01:20:52 -04:00
Tyler Goodlet	e8025d0985	.data.types.Struct: by default include non-members from `.to_dict()`..	2023-07-16 21:32:36 -04:00
Tyler Goodlet	da206f5242	Store "namespace path" for each backend's pair struct Since some backends have multiple venues keyed by the same symbol-pair-name, AND often the market/symbol info for those different market-venues is entirely different (cough binance), we will have to (sometimes) save the struct namespace-path as str for lookup when deserializing a symcache to object form. NOTE: this change is reliant on the following `tractor` dev commit which improves support for constructing a path from object-instance: `bee2c36072` Add a backend(-wide) default struct path stored as a (TOML top level) field `pair_ns_path: str` in the serialized `dict`-table as well as allow for a per pair-`Struct` value optionally defined on each type def; the global is only used if none was defined per struct via a `ns_path: str`. Further deats: - don't write non-struct-member fields to dict for TOML file cache. - always keep object forms, well as objects (in tables).. XD - factor cache loading from `dict` (and thus from TOML or presumably any other interchange form) into a `@classmethod` constructor method B) - all choosing the subtable for `.search()` by name.	2023-07-13 17:58:50 -04:00
Tyler Goodlet	7f4884a6d9	data.types.Struct.to_dict(): discard non-member struct by default	2023-07-12 12:33:30 -04:00
Tyler Goodlet	8b9494281d	Don't verify the history step period for now in `tsdb_backfill()`	2023-07-12 08:45:55 -04:00
Tyler Goodlet	8330b36e58	User/return explicit `symcache` var name in sync case	2023-07-12 08:45:55 -04:00
Tyler Goodlet	243821aab1	Bleh! Ok make `open_symcache()` and `@acm`.. Turns in order to make things much cleaner from inside-the-runtime usage we do probably want to just make the manager async so that we can generate the cache on demand from async UI inits as well as daemon actors.. So change to that and instead make `get_symcache()` the helper that should ONLY be called from sync funcs / offline ledger processing utils!	2023-07-12 08:45:55 -04:00
Tyler Goodlet	8f40e522ef	Add handy `DiffDump`ing for our `.types.Struct` So you can do a `Struct1` - `Struct2` and we dump a little diff `list` of tuples for anal on the REPL B) Prolly can be broken out into it's own micro-patch?	2023-07-12 08:45:55 -04:00
Tyler Goodlet	ddc5f2b441	Use `MktPair.from_msg()` in symcache Since we now fully support interchange-as-dict-msg, use the msg codec API and drop manual `Asset` unpacking. Also, wrap `get_symcache()` in a `pdbp` crash handler block for now B)	2023-07-12 08:45:55 -04:00
Tyler Goodlet	13f231b926	Decode cached mkts and assets back to structs B) As part of loading the cache we can now fill the asset sub-tables: `.mktmaps` and `.assets` with their deserialized struct instances! In theory this might be possible for the backend defined `Pair` structs as well but we need to figure out probably an endpoint to offer the conversion? Also, add a `SymbologyCache.search()` which allows sync code to scan the existing (known via cache) symbol set just like how async code can use the (much slower) `open_symbol_search()` ctx endpoint 💥	2023-07-12 08:45:55 -04:00
Tyler Goodlet	c8c28df62f	Much (much) better symbology cache refinements For starters rename the cache type to `SymbologyCache` and fill out its interface to include an (async) `.reload()` which can be used to populate the in-mem asset-table sets such that any tractor-runtime task can actually directly call it. Use a symcache file name schema of `_cache/<backend>.symcache.toml`. Dirtier deatz: - make `.open_symcache()` a `@cm` such that it can be used from sync code and will actually call `trio.run()` in the case where it needs to do a full (re)load; also don't write on exit only on reloads. - add `.get_symcache()` a simple non-ctx-mngr reader which again can mostly be called willy-nilly from sync code without the full runtime being up (but likely will only work if symcache files already exist for the backend).	2023-07-12 08:45:55 -04:00
Tyler Goodlet	005023275e	Add a symbology cache subsys New mod is `.data._symcache` and it needs backend clients to declare `Client.get_assets()` and `.get_mkt_pairs()` to generate the cache files which now go in the config dir under `_cache/`.	2023-07-12 08:45:55 -04:00
Tyler Goodlet	9748b22d34	Always include the src asset for (parquet file names) for fiat pairs	2023-07-12 08:45:55 -04:00
Tyler Goodlet	ea270d3396	.data.ticktools: add reverse flag, better docs Since it may be handy to get the latest ticks first, add a `reverse: bool` to `iterticks()` and add some cleaner logic and a proper doc string to `frame_ticks()`.	2023-06-27 15:47:05 -04:00
Tyler Goodlet	621634b5a2	Move `frame_ticks()` and tick-type defs into `.ticktools`	2023-06-27 15:47:05 -04:00
Tyler Goodlet	eacc59226f	rename `.data._normalize` -> `.ticktools`	2023-06-27 15:47:05 -04:00
Tyler Goodlet	7b4472e37e	data._sampling.frame_ticks(): slight rework to generalize	2023-06-27 15:47:05 -04:00
Tyler Goodlet	cdf9105d0d	Export `Flume` and `Feed` from `piker.data`	2023-06-27 13:48:03 -04:00
Tyler Goodlet	35359861bb	.brokers._daemon: add notes around needed brokerd respawn tech	2023-06-27 13:41:47 -04:00
Tyler Goodlet	d42aa60325	Define the flattened "fundamental double auction" emitted tick type set	2023-06-27 13:41:47 -04:00
Tyler Goodlet	f81ea64cab	Drop unused `Union`	2023-06-27 13:41:47 -04:00
Tyler Goodlet	6b2e85e4b3	Add type-annots to sampler subscription method internals	2023-06-27 13:41:47 -04:00
Tyler Goodlet	9eeea51165	Define shm buffer sizing in `.data.history` Also adjust sizing such that the history buffer will backfill the last six years by default (in 1m OHLC) and the hft buffer will do only 3 days worth. Also ensure the fsp layer passes the src shm's buffer size when allocating since the size is now required by allocators in the shm apis.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	33ec27715b	Sync shm mod with dev version in `tractor`, drop buffer sizing vars, require `size: int` to all allocators	2023-06-27 13:41:47 -04:00
Tyler Goodlet	e1be098406	Only hard re-render `Viz`s matching backfill deats Avoid unnecessarily re-rendering the wrong (1min OHLC history) chart and/or other such charts with update tasks listening to the sampler stream. Instead only redraw in tasks which are updating vizs which match the actual details of the backfill event. We can probably also eventually match against a range tuple (emitted in the msg) and then have the task further only update the formatter layer unless the range is actually in view?	2023-06-27 13:41:47 -04:00
Tyler Goodlet	dd3e4b5a1f	Emit backfill details in broadcasts Send both the `Viz.name` and `timeframe: int` so that the UI side can match against them and only update a lone curve in a single plot.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	0484e97382	Try to not overrun shm during gap backfilling..	2023-06-27 13:41:47 -04:00
Tyler Goodlet	c8f8724887	Mask out all the duplicate frame detection	2023-06-27 13:41:47 -04:00
Tyler Goodlet	f8ab3bde35	Allow sampler step events to overrun; only 1s period	2023-06-27 13:41:47 -04:00
Tyler Goodlet	c1201c164c	Parametrize index margin around gap detection segment	2023-06-27 13:41:47 -04:00
Tyler Goodlet	34dd6ffc22	Add a configurable timeout around backend live feed startup For now make it a larger value but ideally in the long run we can tune it to specific backends and expose it in the config(s).	2023-06-27 13:41:47 -04:00
Tyler Goodlet	8233d12afb	Detect and fill time gaps in tsdb history For now, just detect and fill in gaps (via fresh backend queries) in the shm buffer but eventually i'm pretty sure we can just write these direct to the parquet file as well. Use the new `.data._timeseries.detect_null_time_gap()` to find and fill in the `ShmArray` index range, re-check it and enter a prompt if it didn't totally fill. Also, - do a massive cleanup and removal of all unused/commented code. - drop the duplicate frames tracking, don't think we need it after removing multi-frame concurrent queries. - change backfill loop variable `end_dt` -> `last_start_dt` which is more semantically correct. - fix logic to backfill any missing sub-sequence portion for any frame query that overruns the shm buffer prependable space by detecting the available rows left to insert and only push those. - add a new `shm_push_in_between()` helper to match.	2023-06-27 13:41:47 -04:00
Tyler Goodlet	f25248c871	Add `.data._timeseries` utility mod Org all the new (time) gap detection routines here and also move in the `slice_from_time()` epoch -> index converter routine from `._pathops` B)	2023-06-27 13:41:47 -04:00
Tyler Goodlet	0dcfcea6ee	Finally get partial backfills after tsdb load workinnn It took a little while (and a lot of commenting out of old no longer needed code) but, this gets tsdb (from parquet file) loading before final backfilling from the most recent history frame until the most recent tsdb time stamp! More or less all the convoluted concurrency shit we had for coping with `marketstore` IPC junk is no longer needed, particularly all the query size limits and accompanying load loops.. The recent frame loading technique/order has now changed though since we'd like to show charts asap once tsdb history loads. The new load sequence is as follows: - load mr (most recent) frame from backend. - load existing history (one shot) from the "tsdb" aka parquet files with `polars`. - backfill the gap part from the mr frame back to the tsdb start incrementally by making (hacky) `ShmArray.push(start=<blah>)` calls and not updating the `._first.value` while doing it XD Dirtier deatz: - make `tsdb_backfill()` run per timeframe in a separate task. - drop all the loop through timeframes and insert `dts_per_tf` crap. - only spawn a subtask for the `start_backfill()` call which in turn only does the gap backfilling as mentioned above. - mask out all the code related to being limited to certain query sizes (over gRPC) as was restricted by marketstore.. not gonna go through what all of that was since it's probably getting deleted in a follow up commit. - buncha off-by-one tweaks to do with backfilling the gap from mr frame to tsdb start.. mostly tinkered it to get it all right but seems to be working correctly B) - still use the `broadcast_all()` msg stuff when doing the gap backfill though don't have it really working yet on the UI side (since previously we were relying on the shm first/last values.. so this will be "coming soon" :)	2023-06-27 13:41:47 -04:00

1 2 3 4 5 ...

575 Commits (30d55fdb275373516dd2133ae6f850a6acd6d7a3)