HSE Bootcamp
We had price of Bitcoin for every day since 2022. The task was to predict price for next week (July 2023).
Actual price, seasonality and other
Actually, there were too many anomalies, but these are the most influencing
Backtest is something like cross-validation for timeseries. We predict validation data on first step and then we can use it on the second step
I was using ETNA library from Tinkoff. This library was designed special for timeseries tasks. Main advantages are simplicity and confidence in the absence of leakage.
🎯 Baseline is simple NaiveModel
More useful to think about errors.
💀 The competition took place in August 2023. However test data was from July 2023 (month before). So everyone could just find real price of Bitcoin and easily make submission. And so many refused to continue to participate