Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[R-package] Allow data to live in memory mapped file #2184

Closed
nbenn opened this issue May 19, 2019 · 3 comments
Closed

[R-package] Allow data to live in memory mapped file #2184

nbenn opened this issue May 19, 2019 · 3 comments

Comments

@nbenn
Copy link

nbenn commented May 19, 2019

In R, two packages exists that allow for working with matrices as memory mapped files: bigmemory and more recently bigstatsr. As the underlying data structures are regular column major arrays, it should be not hard to extend the LightGBM R-package to process such arrays in similar fashion as the already implemented base::matrix() objects.

I'll put together a proof of concept and I'm opening this issue to discuss whether this could be of interest for the wider LightGBM/R community.

@nbenn
Copy link
Author

nbenn commented May 19, 2019

A proof of concept is available from master...nbenn:issue-#2184

To run, a patched version of bigstatsr is needed, available as

devtools::install_github("nbenn/bigstatsr@data-ptr")

@jameslamb
Copy link
Collaborator

Thanks @nbenn ! I'll take a look in the next few days. Appreciate you making a PoC

@guolinke guolinke closed this as completed Aug 1, 2019
@StrikerRUS
Copy link
Collaborator

Closed in favor of being in #2302. We decided to keep all feature requests in one place.

Welcome to contribute this feature! Please re-open this issue (or post a comment if you are not a topic starter) if you are actively working on implementing this feature.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants