mediaInfo-python-lambda

An AWS Lambda for retrieving metadata from video files when uploaded to an S3 bucket

General info

This lambda utilizes the mediainfo library to retrieve metadata from video files (MPEG-4, AVI, MPEG-TS, MPEG-PS, FLV, H.264/AVC, DivX, H.263, H.265 ...) that are stored in a S3 bucket. The code is based in part on the tutorial and code from (https://github.com/kchokshifox/MediaEvaluator) and the mediainfo library from (https://mediaarea.net/en/MediaInfo) which is an opensource software under a BSD-style license "Copyright (c) 2002-2021 MediaArea.net SARL. All rights reserved." The desired meta data fields need to be added to the response (lines 58-81). For this example, wrapperType, codec, bitRate, width, height, and resolution have been included as they are commonly included in most video format metadata.

Authors

Technologies

Project is created with:

Python
AWS EC2 or Docker with Amazon Linux image
AWS CLI
AWS S3
AWS IAM roles
AWS Lambda

Install

Open VSCode to add any aditional meta data fields that you need. Add your MongoDB Atlas db API to line 122. Be sure to create the collection first in MongoDB. Zip the folder and upload to an S3 bucket. Create a Lambda function with a runtime of python 3.7. Create an IAM role with full acccess permisisons to Lambda and full access permissions to S3. Do not add this Lambda to a VPC. The resource based policy in your Lambda should look like the following,

  "Version": "2012-10-17",
  "Id": "default",
  "Statement": [
    {
      "Sid": "lambda-de46be74-cbf3-4f18-87ca-484754dcaddd",
      "Effect": "Allow",
      "Principal": {
        "Service": "s3.amazonaws.com"
      },
      "Action": "lambda:InvokeFunction",
      "Resource": "arn:aws:lambda:us-east-1:616082320291:function:<lambda_name>",
      "Condition": {
        "StringEquals": {
          "AWS:SourceAccount": "<aws_account_number>"
        },
        "ArnLike": {
          "AWS:SourceArn": "<ARN_s3_video_bucket>"
        }
      }
    }
  ]
}

Apply the IAM role you just created to the Lambda. Add the following permissions to the S3 bucket holding your videos,

{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Principal": {
                "AWS": "<full_ARN_for_the_medinainfo_lambda>"
            },
            "Action": [
                "s3:GetObject",
                "s3:GetObjectAcl"
            ],
            "Resource": "arn:aws:s3:::<bucket_name>/*"
        }
    ]
}

Use the 'Upload from' button to upload the zip file with the S3 URI address. Now, add a trigger to your Lambda function for the S3 bucket that the video files will be stored in. Upload a video to test functionality.

Testing

For testing purposes, I recommend replacing lines 90 - 92 in mediaEvaluator.py with the following,

bucketName = event['bucketName']
objectName = event['objectKey']

Now, you can use the Lambda Test feature on the AWS Console by entering the folling as your test,

{
"bucketName": "<s3_video_files_bucket>",
"objectKey": "<file_name_as_stored_in_s3_bucket>"
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Flask-1.1.2.dist-info		Flask-1.1.2.dist-info
Flask_Cors-3.0.10.dist-info		Flask_Cors-3.0.10.dist-info
Jinja2-2.11.3.dist-info		Jinja2-2.11.3.dist-info
MarkupSafe-1.1.1.dist-info		MarkupSafe-1.1.1.dist-info
Werkzeug-1.0.1.dist-info		Werkzeug-1.0.1.dist-info
__pycache__		__pycache__
bin		bin
boto3-1.17.11.dist-info		boto3-1.17.11.dist-info
boto3-1.17.7.dist-info		boto3-1.17.7.dist-info
boto3		boto3
botocore-1.20.11.dist-info		botocore-1.20.11.dist-info
botocore-1.20.7.dist-info		botocore-1.20.7.dist-info
botocore		botocore
certifi-2020.12.5.dist-info		certifi-2020.12.5.dist-info
certifi		certifi
chardet-4.0.0.dist-info		chardet-4.0.0.dist-info
chardet		chardet
click-7.1.2.dist-info		click-7.1.2.dist-info
click		click
dateutil		dateutil
flask		flask
flask_cors		flask_cors
gcc7-0.0.7.dist-info		gcc7-0.0.7.dist-info
gcc7		gcc7
idna-2.10.dist-info		idna-2.10.dist-info
idna		idna
itsdangerous-1.1.0.dist-info		itsdangerous-1.1.0.dist-info
itsdangerous		itsdangerous
jinja2		jinja2
jmespath-0.10.0.dist-info		jmespath-0.10.0.dist-info
jmespath		jmespath
markupsafe		markupsafe
python_dateutil-2.8.1.dist-info		python_dateutil-2.8.1.dist-info
requests-2.25.1.dist-info		requests-2.25.1.dist-info
requests		requests
s3transfer-0.3.4.dist-info		s3transfer-0.3.4.dist-info
s3transfer		s3transfer
six-1.15.0.dist-info		six-1.15.0.dist-info
urllib3-1.26.3.dist-info		urllib3-1.26.3.dist-info
urllib3		urllib3
werkzeug		werkzeug
xmltodict-0.12.0.dist-info		xmltodict-0.12.0.dist-info
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mediaEvaluator.py		mediaEvaluator.py
mediainfo		mediainfo
six.py		six.py
six.pyc		six.pyc
xmltodict.py		xmltodict.py
xmltodict.pyc		xmltodict.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mediaInfo-python-lambda

Table of contents

General info

Authors

Technologies

Install

Testing

About

Releases

Packages

Languages

License

reedtlr/mediaInfo-python-lambda

Folders and files

Latest commit

History

Repository files navigation

mediaInfo-python-lambda

Table of contents

General info

Authors

Technologies

Install

Testing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages