Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I/O metaissuse #2125

Closed
3 of 5 tasks
Keno opened this issue Jan 25, 2013 · 7 comments
Closed
3 of 5 tasks

I/O metaissuse #2125

Keno opened this issue Jan 25, 2013 · 7 comments
Assignees
Labels
io Involving the I/O subsystem: libuv, read, write, etc.
Milestone

Comments

@Keno
Copy link
Member

Keno commented Jan 25, 2013

  1. Bikeshed socket API
  2. Write buffering
  3. Performance improvements for reading (probably needs I/O benchmarks)
  4. Complete UV based file API (preferably after Struct patch merge)
  5. Move initial I/O detection into julia and make STDIO object appropriate julia objects accordingly (depends on 4)
@ghost ghost assigned Keno Jan 25, 2013
@ViralBShah
Copy link
Member

This list sounds so juicy - looking forward to seeing how it all works when it is done.

@kmsquire
Copy link
Member

I've been looking forward to this as well! One idea I discussed briefly with @StefanKarpinski was to have transparent reading/writing of compressed (gzip, bzip2, xz) files (with a way to access the raw stream if needed). The current gzip interface is somewhat limited, so it would be nice to integrate with/build on the I/O system once this work has moved forward.

For gzip and xz, http://git.tukaani.org/?p=libxzfile.git might be worth looking at.

@dmbates
Copy link
Member

dmbates commented Jan 25, 2013

+1

A binary, saved R dataset (the RDX2 format) can be compressed with gzip, bzip2 or xz and xz is preferred for very large datasets. Right now I need to write them out with gzip compression and use

using DataFrames, RDA
require("gzip")
using GZip

ff = gzopen(<filename>)
fr = read_RDA(ff)
close(ff)

@johnmyleswhite
Copy link
Member

Yes, this would be great to have.

-- John

On Jan 25, 2013, at 12:45 PM, dmbates [email protected] wrote:

+1

A binary, saved R dataset (the RDX2 format) can be compressed with gzip, bzip2 or xz and xz is preferred for very large datasets. Right now I need to write them out with gzip compression and use

using DataFrames, RDA
require("gzip")
using GZip

ff = gzopen()
fr = read_RDA(ff)
close(ff)

Reply to this email directly or view it on GitHub.

@JeffBezanson
Copy link
Member

Highest priority is eliminating the need for the Proceed with caution! warning.

@ViralBShah
Copy link
Member

Where do we stand on this one w.r.t. 0.2?

@JeffBezanson
Copy link
Member

Subsumed by #3887.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
io Involving the I/O subsystem: libuv, read, write, etc.
Projects
None yet
Development

No branches or pull requests

6 participants