Backend: File upload, add generating S3 presigned urls for PUT and GET #1170

pbn4 · 2021-04-23T16:52:14Z

Blocker for #1153

seanmalbert · 2021-05-04T01:07:22Z

Can you flesh out your thoughts on this, so we can attempt to size it? Were you thinking a direct upload solution like outlined here https://devcenter.heroku.com/articles/s3-upload-node#direct-uploading? What about bucket strategy (i.e. one bucket per client or can we get away with one bucket for all?) and file naming conventions?

From what I can tell, we currently aren't using the aws-sdk package to interact with s3 storage, so would this be the first implementation of this?

pbn4 · 2021-05-05T11:36:11Z

Were you thinking a direct upload solution like outlined here https://devcenter.heroku.com/articles/s3-upload-node#direct-uploading?

Yes this is exactly what I wanted us to implement.

What about bucket strategy (i.e. one bucket per client or can we get away with one bucket for all?) and file naming conventions?

I'd go with file naming conventions and 2 buckets (one for public read and one for private reads). Right now there are two use cases I know of:

listing hero image Listings Management Fieldset: Listing Photo #1177 which could follow a pattern /listing_<listing_id>/images/hero-image-name.jpg, this is a public read, private write bucket (pattern does not really matter much as long as it's unique in this case),
uploading paper applications per listing (cc: @slowbot) which could also follow the same pattern but for reading those you would need a presigned URL too (same as for writing)

From what I can tell, we currently aren't using the aws-sdk package to interact with s3 storage, so would this be the first implementation of this?

yes first implementation of this.

There are react libraries for frontend that handle stuff like this e.g. https://react-dropzone-uploader.js.org/ see s3 uploads manual. Backend could return a standardized message for a presigning request. This is for frontend folks to evaluate if the tool suits our needs though.

seanmalbert · 2021-05-13T15:11:36Z

When we were discussing this last week, @jaredcwhite mentioned using Cloudinary. Giving it more thought, I think this is the way to go and cuts down on the overhead of having to deal with image transformations in the future (and allows us to do it right away). Plus, there's a free tier that gives us 25 credits. 1 credit is good for 1k transformation, 1gb of storage or 1gb of bandwidth. Transformations only count for the first time an asset is transformed, so requesting an asset with a transformation specified in its URL won't count against us each time. If storage or bandwidth becomes an issue, it also allows us to specify an s3 bucket to cut down on costs. Given the low number of listings, I don't think it'll be an issue. I created a Cloudinary account for Exygy to try out some things, and I think it'll work well. I can add the API secret to Heroku, so you can grab it for your local environment there too. This also supports signed URLs for documents we want to keep private, but I think we only need to handle the public use case of listings for MVP.

So here's what I think should happen (attempting to write this so that anyone can pick this up). This outline is in broad strokes, so if you need any clarification on any part, please ask.

Backend

There is already some asset definition on the listing entity and an asset table, which I think can be repurposed
create an assets module, you can create the structure by referencing other modules like listings or preferences
asset.entity.ts should specify these fields:
- id
- created_at
- updated_at
- file_id (this will be Cloudinary's public_id
- label
because we want to be able to use assets for other entities, don't specify listing_id as a column
For the controller/service, I think we only need Post/create to start, since the listing itself will be responsible for getting its images
Upon receiving the asset, the service needs to handle two operations
1. performing a signed upload stream to Cloudinary (see documentation https://github.com/cloudinary/cloudinary_npm#cloudinaryupload_stream, they do have Promisified versions of this I believe)
2. Use the response of the upload to create a record using the public_id, and you can use the filename as the label
The response of the create service is to be used by a FileUpload component added to ui-components/src/forms, so file_id and label should be sufficient (since you can construct the asset URL with the public ID, or you could return the full URL
The listing entity contains a json asset column, for now, we can either keep using this, since we're still keeping the file_id, label keys or we can save them to the asset table. To minimize the amount of work now, I think we can stick with what's there.
So on getting a listing, because the file ID should now be the public ID and we will have consistent sizing, at least to start with, when you fetch the listing, you can construct the image URL server side and specify the crop method (c_fill will probably be good) and the width and height, so in the URL for example would be /h_500,w_500,c_fill/

Future considerations:
At a later date, we may want to upload the files in chunks. For now, if we can keep the file size to a reasonable limit, we should be fine.

Frontend

In ui-component/src/forms create a FileUpload.tsx which uses the react-dropzone-uploader package linked above in @pbn4 's comment
In addition to some of the standard props the other form inputs receive, it should accept:
- upload url
- accepts (they type of assets it should accept, default to "image/*")
- defaultValues
There are a myriad of examples out there using react dropzone with cloudinary, most of which show direct unsigned uploads, so if you're going off an example you found, the difference will be that you're going to send the file(s) to our backend for the secure upload to Cloudinary
As files are added, you should be able to check the file size and keep track of the total. We want to be able to notify the user when that limit is exceeded and not upload those files
you should get back from the response a file_id and label, these are the values that will get saved with the listing
since the server side listing service should be constructing the imageUrl for ImageCard (at least for now), you shouldn't have to change anything client side to render the image for the listing

pbn4 · 2021-05-18T10:44:11Z

@seanmalbert OK for me. A side note is that we do not have to push a stream to the server for private upload, cloudinary supports the same flow as the one that I proposed with s3: create a presigned upload url server-side and expose it to frontend (manual), so that the upload can happen browser side.

seanmalbert · 2021-05-30T13:41:48Z

I added the cloudinary account keys to Heroku's config vars on https://dashboard.heroku.com/apps/bloom-reference-backend/settings. The only two that need to be kept secure are CLOUDINARY_SECRET and CLOUDINARY_ENV (if you need that one).

pbn4 added the discussion label May 5, 2021

pbn4 mentioned this issue May 5, 2021

Listings Management Fieldset: Listing Photo #1177

Closed

kathyccheng assigned seanmalbert May 12, 2021

This was referenced May 13, 2021

Component: Multiple File Input #1153

Open

Listings Management Field Set: Ranking and Results #1178

Closed

Listings Management Fieldset: Proposed Updates #1239

Closed

kathyccheng removed the discussion label May 21, 2021

kathyccheng assigned bpsushi and unassigned seanmalbert May 27, 2021

pbn4 mentioned this issue Jun 2, 2021

Add assets module #1307

Merged

27 tasks

kathyccheng closed this as completed Jun 9, 2021

pbn4 mentioned this issue Jun 29, 2021

1153 Dropzone-style file upload component #1437

Merged

25 tasks

YazeedLoonat pushed a commit to YazeedLoonat/bloom that referenced this issue May 31, 2022

fix: coming soon tag color on listing detail page (bloom-housing#1170)

6a9e8a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backend: File upload, add generating S3 presigned urls for PUT and GET #1170

Backend: File upload, add generating S3 presigned urls for PUT and GET #1170

pbn4 commented Apr 23, 2021

seanmalbert commented May 4, 2021

pbn4 commented May 5, 2021 •

edited

Loading

seanmalbert commented May 13, 2021

pbn4 commented May 18, 2021

seanmalbert commented May 30, 2021

Backend: File upload, add generating S3 presigned urls for PUT and GET #1170

Backend: File upload, add generating S3 presigned urls for PUT and GET #1170

Comments

pbn4 commented Apr 23, 2021

seanmalbert commented May 4, 2021

pbn4 commented May 5, 2021 • edited Loading

seanmalbert commented May 13, 2021

Backend

Frontend

pbn4 commented May 18, 2021

seanmalbert commented May 30, 2021

pbn4 commented May 5, 2021 •

edited

Loading