-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How does Tick determine what is a "useful" time series? #2
Comments
think this has different answers depending on the dimensionality of the series.. |
in 1D (like views on a page) you could make a case that volume is a good indicator of useful, or maybe variance? in >1D I bet covariance would be a good starting point. A useless time series then is one that is always zero, or more generally alway the same. |
hey also I bet there is a proper answer to this in terms of information content. Like a useful timeseries is one that is hard to compress, has low entropy etc. There's a lovely green book on my desk by Mackay that has opinions... |
I am more curious as to how this makes sense for a user Whatever determination you use, it means that there will be a result for some queries and no result for others. And neither of those results would necessarily mean "because there was no data" which is confusing to me. |
That is a very fat book! |
This is quite implementation specific, I think. We could implement tick such that a user knows what time series are being made available once it starts listening to the stream. But you are right, if I have A, B & C and Tick determined that A, A & B are the only useful time series but the user was interested in B & C. I don't know how to handle that. It almost becomes a back to the drawing board problem to solve. |
What about thinking of it more as a compression problem? If there is no Alternatively, a "no information" response to a query is an interesting M
|
Still wrapping my head around thinking of it as a compression problem...just grabbed the green book
But is it useful if I am looking for something very specific? |
only if it can be explained simply |
If you have timeseries for each key, couldn't you create what A&B would be? why do you need a time series for groups? |
Oh right, intersection vs exclusive. oh well |
Can I have table "key" with row "co occurrence" by time? |
No. Consider for example the following stream:
##Time series:
And if my question is give me all the times Deep was in NYC, I can't decipher it from the above time series. I can however decipher it from
|
Great! I spent 5 minutes building time series by hand from a stream of imaginary JSON. |
sorry 😧 |
Not sure if I understand. Isn't that the same as having more than one column as a primary key? If so, it becomes the same as what I mentioned in the example |
what is wrong with that? |
amen re: explaining no data |
So we could go one of the three ways here:
The text was updated successfully, but these errors were encountered: