Skip to content

Welcome

AI ​​Solution Kit provides a series of machine learning functions based on deep learning, such as text recognition, natural language understanding, face detection, and image understanding. You can easily use out-of-the-box AI features by calling APIs, while seamlessly integrating with other services provided by AWS.

APIs

The solution automatically creates RESTful APIs through Amazon API Gateway. After deploying the solution, you can send HTTP POST requests to use the AI features.

The table below lists the supported APIs. You can click the Details link to view the test method and sample code of each API interface. For more information, see API Reference Guide.

Optical Character Recognition (OCR)

API Name Description API
Lite OCR (Simplified Chinese) Recognize and extract Simplified Chinese, numbers, alphabetical characters and symbols. Return the information such as text or coordinates. Details
Lite OCR (Traditional Chinese) Recognize and extract Traditional Chinese, numbers, alphabetical characters and symbols from images. Return the information such as text or coordinates. Details
Advanced OCR (Multilingual) Recognize and extract Simplified/Traditional Chinese, Vietnamese, Japanese, Korean, English, numbers, alphabetical characters and symbols. Return the information such as text or coordinates. Details
Document Layout Analysis Convert document images to a Markdown/JSON format output, with table format in Markdown/HTML. Details
Custom OCR You can customize the OCR template, extract the structured text information in cards and tickets, and display the results in the key-value format. Details
Car License Plate Recognize text on Chinese car license plate. Details

Face and Body

API Name Description API
Face Detection Detect the face in an image, and map the detected facial features and contour key point information to 64 vector coordinates. Details
Face Comparison Compare two faces of the same person and return a confidence score of the similarity. Details
Human Attribute Recognition Recognize the attributes of the human body in the image, and return the human body position coordinates and attribute analysis in each area, including the semantic information of 16 attributes such as gender, age, and clothing. Details
Human Image Segmentation Segment human bodies from background and return the alpha channel which is a color component representing the degree of transparency of a color. Details

Image Understanding

API Name Description API
Image Similarity Compare two images by calculating the cosine distance from the image feature vector and converting it into confidence, and return similarity score. Details
Object Recognition Recognize objects in the image and return the region information and confidence score for each object. 300 types of objects are supported. Details
Pornography Detection Detect pornographic image in three dimensions (normal, sexy, porn) and return confidence scores. Details

Vision Production

API Name Description API
Image Super Resolution Upscale the resolution and enhance details in the images, and support image enhancement. Details

Natural Language Understanding (NLU)

API Name Description API
Text Similarity Compare two Chinese words or sentences and return similarity score. Details
General NLU Support a variety of Chinese text understanding tasks, such as text classification, sentiment analysis, extraction, and customizable labeling systems. Details

This implementation guide describes architectural considerations and configuration steps for deploying AI Solution Kit in the AWS cloud. It includes links to CloudFormation templates that launches and configures the AWS services required to deploy this solution using AWS best practices for security and availability.

The guide is intended for IT architects, developers, DevOps, data engineers with practical experience architecting in the AWS Cloud.