The Image Recognition and Processing Backend demonstrates how to use AWS Step Functions to orchestrate a serverless processing workflow using AWS Lambda, Amazon S3, Amazon DynamoDB and Amazon Rekognition. This workflow processes photos uploaded to Amazon S3 and extracts metadata from the image such as geolocation, size/format, time, etc. It then uses image recognition to tag objects in the photo. In parallel, it also produces a thumbnail of the photo.
This repository contains sample code for all the Lambda functions depicted in the diagram below as well as an AWS CloudFormation template for creating the functions and related resources. There is also a test web app that you can run locally to interact with the backend.
- An image is uploaded to the
PhotoRepo
S3 bucket under the "Incoming/" prefix - The S3 upload event triggers the
ImageProcStartExecution
Lambda function, which kicks off an execution of theImageProc
state machine in AWS Step Functions, passing in the S3 bucket and object key as input parameters. - The
ImageProc
state machine has the following sub-steps:
- Read the file from S3 and extract image metadata (format, EXIF data, size, etc.)
- Based on output from previous step, validate if the file uploaded is a supported file format (png or jpg). If not, throw
NotSupportedImageType
error and end execution. - Store the extracted metadata in the
ImageMetadata
DynamoDB table - In parallel, kick off two processes simultaneously:
- Call Amazon Rekognition to detect objects in the image file. If detected, store the tags in the
ImageMetadata
DynamoDB table - Generate a thumbnail and store it under the "Thumbnails/" prefix in the
PhotoRepo
S3 bucket
- Call Amazon Rekognition to detect objects in the image file. If detected, store the tags in the
You can use the test web app to upload images and see the result of the image recognition and processing workflow.
The backend infrastructure can be deployed in US West - Oregon (us-west-2) using the provided CloudFormation template. Click Launch Stack to launch the template in the US West - Oregon (us-west-2) region in your account:
(In the last page of the wizard, make sure to:
- Click the checkboxes to give AWS CloudFormation permission to "create IAM resources" and "create IAM resources with custom names"
- Follow the instructions to "Create Change Set"
- Click "Execute" )
Option 2: Launch the CloudFormation Template in a different region than US West - Oregon (us-west-2)
If you would like to deploy the template to a different region (must be a region that supports Amazon Rekognition and AWS Step Functions, e.g. US East (N.Virginia) or EU (Ireland), you need a S3 bucket in the target region, and then package the Lambda functions into that S3 bucket by using the aws cloudformation package
utility.
First, In the terminal, go to the lambda-functions
folder. Then prepare npm dependencies for the following Lambda functions:
cd lambda-functions
cd create-s3-event-trigger-helper && npm install && cd ../thumbnail && npm install && cd ../extract-image-metadata && npm install && cd ..
Set environment variables for later commands to use:
REGION=[YOUR_TARGET_REGION]
S3BUCKET=[REPLACE_WITH_YOUR_BUCKET]
Then go to the cloudformation
folder and use the aws cloudformation package
utility
cd ../cloudformation
python inject_state_machine_cfn.py -s state-machine.json -c image-processing.serverless.yaml -o image-processing.complete.yaml
aws cloudformation package --region $REGION --s3-bucket $S3BUCKET --template image-processing.complete.yaml --output-template-file image-processing.output.yaml
Last, deploy the stack with the resulting yaml (image-processing.output.yaml
) through the CloudFormation Console or command line:
aws cloudformation deploy --region $REGION --template-file image-processing.output.yaml --stack-name photo-sharing-backend --capabilities CAPABILITY_IAM
You can use the test web app to see the backend working in action.
The web app needs references to the resources created from the CloudFormation template above. To do so, follow these steps:
- Go to CloudFormation console
- Go to the Output section of the stack you just launched in the previous section
- Open the Config.ts file in the webapp/app/ folder, and fill in the corresponding values from the CloudFormation stack output
This web app is built using Angular2 and TypeScript, which relies heavily on node and npm.
Verify that you are running at least node v4.x.x
and npm 3.x.x
by running node -v
and npm -v
in a terminal/console window.
In a terminal, go to the webapp
folder, then type
npm install
npm start
This compiles the application, starts a local server, and opens a browser that loads the test web application.
Pick any username to log in (This is a test app to showcase the backend so it's not using real user authentication. In an actual app, you can use Amazon Cognito to manage user sign-up and login.)
The username will be used in storing ownership metadata of the uploaded images.
Create new or select existing albums to upload images to.
Upload images and see status updates when:
- Upload to S3 bucket succeeds
- The AWS Step Function execution is started. The execution ARN is provided in the UI so you can easily look up its details in the Step Functions Console
- The AWS Step Function execution completes
A sample set of extracted image metadata and recognized tags, along with the thumbnail generated in the Step Function execution is displayed for each uploaded image.
Below is the diagram of the state machine being executed every time a new image is uploaded (you can explore this in the Step Functions Console):
To remove all resources created by this example, do the following:
- Delete all objects from the S3 bucket created by the CloudFormation stack.
- Delete the CloudFormation stack.
- Delete the CloudWatch log groups associated with each Lambda function created by the CloudFormation stack.
The following sections explain all of the resources created by the CloudFormation template provided with this example.
- PhotoRepoS3Bucket - An S3 bucket that stores the incoming uploaded images and resized thumbnails
- AlbumMetadataDDBTable - A DynamoDB table that stores information about albums (owner, creation dates, etc.)
- ImageMetadataDDBTable - A DynamoDB table that stores metadata about each image uploaded to the system (size, extracted GPS coordinates, tags detected by Amazon Rekognition, etc.)
- ImageProcStateMachine - An Step Functions state machine that orchestrates the multi-step image processing workflow
- ExtractImageMetadataFunction - A Lambda function that uses the GraphicsMagick library to extract metadata from the uploaded image (format, size, EXIF info, etc.)
- StoreImageMetadataFunction - A Lambda function that stores the extracted metadata into
ImageMetadataDDBTable
- RekognitionFunction - A Lambda function that invokes the Amazon Rekognition API to detect labels in the uploaded image
- StoreRekognizedTagsFunction - A Lambda function that stores the labels detected by Amazon Rekognition into the
ImageMetadataDDBTable
- GenerateThumbnailFunction - A Lambda function that generates a thumbnail for the uploaded images
- ImageProcStartExecutionFunction - A Lambda function that kicks off the
ImageProcStateMachine
every time a new object is uploaded into S3 bucket under "Incoming/" prefix - CreateS3EventTriggerFunction - A Lambda function that the CloudFormation template calls to create the S3 event trigger to invoke ImageProcStartExecutionFunction as a custom resource
- DescribeExecutionFunction - A Lambda function that queries the AWS Step Functions service on the status of a given execution
- TestClientIdentityPool - A Cognito Identity Pool used by the test web app to sign API requests to query the metadata DynamoDB tables and Step Functions execution status, and read/write from the
PhotoRepoS3Bucket
This CloudFormation template chose not to create one IAM role for each Lambda function and consolidated them, simply to reduce the number of IAM roles it takes up in your account. When developing your application, you might instead create individual IAM roles for each Lambda function to follow the Least Privilege principle.
- BackendProcessingLambdaRole - An IAM role assumed by Lambda functions that make up the
ImageProcStateMachine
and theImageProcStartExecutionFunction
which kicks off the state machine execution. This role provides logging permissions and access to read/write thePhotoRepoS3Bucket
, theImageMetadataDDBTable
, call theDetectLabels
Amazon Rekognition API and start state machine execution in Step Functions. - CustomResourceHelperRole - An IAM role the Lambda functions that are used by
CreateS3EventTriggerFunction
for creating custom resources in the CloudFormation template - StateMachineRole - An IAM role assumed by the
ImageProcStateMachine
during execution. It has permission to invoke Lambda functions. - DescribeExecutionFunctionRole - An IAM role assumed by
DescribeExecutionFunction
. It has permission toDescribeExecution
API in Step Functions. - TestClientIAMRole - An IAM role assumed by the
TestClientIdentityPool
Cognito Identity pool
This reference architecture sample is licensed under Apache 2.0.