unshift stream support #452

twiggy · 2021-05-03T22:19:55Z

A few issues have pointed out that the stream becomes broken\altered after to fromStream is called.

its a pretty serious side effect that wouldn't be obvious at all.

we are currently working around this issue by calling stream.read(xxxx) and then stream.unshift. Maybe unshift could be an argument to stream.read() ... or maybe the chunk read could be passed back.

For us we get a stream from the http request, want to detect file type, but then immediately need to pass the stream to AWS's s3 sdk ... bit weird and we could probably just write the file, but unshift seems to work for now.

let leadingChunk = stream.read(4100);
let {ext, mime} = await fileType.fromBuffer(leadingChunk);
stream.unshift(leadingChunk);

Borewit · 2021-05-04T09:58:59Z

A few issues have pointed out that the stream becomes broken\altered after to fromStream is called.

I think you mean that the concept of a stream has been misunderstood in other issues. fromStream reads from a stream, and then the stream get (partially) consumed. That is how streams work.

At the same time, I do understand the desire to peek ahead on stream, determine the right file type from the stream, and read from the stream as if nothing happened. For that use case stream(readableStream) has been designed. Which is very similar to the example you provided. You directly see limitations, how much data will you buffer (this should be better documented, need to finalize #434).

An attempt overcome the limited buffer length in stream(): Make stream() method, stream based, by using teeing #300
Similar misunderstanding related to consuming streams: fromBuffer and stream methods give different results for same ai file #426

For us we get a stream from the http request, want to detect file type, but then immediately need to pass the stream to AWS's s3 sdk ... bit weird and we could probably just write the file, but unshift seems to work for now.

Writing the file first sounds like a reasonable solution to me. You simply do not know how much of the file you need to determine it's file type. The fact that we can determine the file type for many files with the very first few bytes, does not take away the fact that for some files we need much more data.

Borewit · 2021-07-05T06:30:43Z

Related to #399.

Improve document stream() detection limitation. Related: #426, #452

Borewit closed this as completed Jul 5, 2021

Borewit added a commit that referenced this issue Jul 12, 2021

Make stream() sample size configurable

d3b2543

Improve document stream() detection limitation. Related: #426, #452

Borewit added a commit that referenced this issue Jul 20, 2021

Make stream() sample size configurable

0b84897

Improve document stream() detection limitation. Related: #426, #452

Borewit added a commit that referenced this issue Jul 20, 2021

Make stream() sample size configurable

c635507

Improve document stream() detection limitation. Related: #426, #452

Borewit added a commit that referenced this issue Jul 22, 2021

Make stream() sample size configurable

8dc4c7d

Improve document stream() detection limitation. Related: #426, #452

Borewit added a commit that referenced this issue Jul 22, 2021

Make stream() sample size configurable

f68ae1b

Improve document stream() detection limitation. Related: #426, #452

Borewit added a commit that referenced this issue Jul 22, 2021

Make stream() sample size configurable

2a26e42

Improve document stream() detection limitation. Related: #426, #452

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unshift stream support #452

unshift stream support #452

twiggy commented May 3, 2021 •

edited by Borewit

Loading

Borewit commented May 4, 2021 •

edited

Loading

Borewit commented Jul 5, 2021

unshift stream support #452

unshift stream support #452

Comments

twiggy commented May 3, 2021 • edited by Borewit Loading

Borewit commented May 4, 2021 • edited Loading

Borewit commented Jul 5, 2021

twiggy commented May 3, 2021 •

edited by Borewit

Loading

Borewit commented May 4, 2021 •

edited

Loading