-
Notifications
You must be signed in to change notification settings - Fork 612
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
curious what you would recommend for real-time training + prediction models? #491
Comments
After spending some time looking at hrnn and implementations, I switched gears to something simpler to support continuous learning https://github.com/online-ml/river |
If anyone's curious I'm building an open source version here https://github.com/victusfate/concierge Todo: on server startup get all events since last model training and update each model |
There are two different things you can do here with implicit to get near-realtime updates with the ALS model :
|
This is great news, I'd love to compare the results to river-ml since I have more experience with implicit. Also worth noting I got the deployed system to work great. I gather all user item ratings hourly for a full training (snapshot model). When new servers come up they load this model and then delta train from a redis ordered set of all user item ratings since the last model snapshot. In addition live models receive real time updates via redis pubsub. This way at scale, I can have multiple predictor http servers all yielding similar results (can't guarantee they all receive all updates in the same order), but they are generally convergent. |
In the case where a user is new, but the server is incapable to fit it yet into the model (as @victusfate explained, cause a pub/sub flow to add new users/items should preferably have certain delay for performance optimisation); How could I recommend to this new user? Should I use the (This assumption is made by not knowing the truly relevance of the |
@sorenrife I ended up using popular results for new users in my current deployment using implicit (just hourly trained atm), and I think you can take the same approach with live model updates (keep an active popularity rank going as ratings come in) something like this (grabbing code snippets from my hourly training) -> df is a pandas data set
and in the rankings method
|
I admire the api, efficiency, and results of implicit.
I'm finding a need for real time training + prediction in some of my company's systems, and started searching around for ideas/implementations. Has anyone had experience working with this?
Realize this is off topic from implicit (totally understand if it's closed).
Starting to look for ideas here:
The text was updated successfully, but these errors were encountered: