Text Detection

You are currently viewing the documentation for the latest version (2.2.1). To access a different version, click the "Switch version" button located in the upper-right corner of the page.

■ If you are not sure which version of the product you are currently using, please feel free to contact Mech-Mind Technical Support.

Function

Use the Text Detection model package to run inference on the input image. The model package detects text regions in the image and is typically used together with the Text Recognition model package.

Applicable to industries such as 3C electronics, automotive, and packaging for detecting characters, labels, serial numbers, and more.

Input and Output

After you import the model package in the Deep Learning Model Package Inference Step, the following input and output ports are displayed.

Input

Input Port	Data Type	Description
Image	Image/Color	Image input to this port will be used for deep learning model package inference. Displays when the Input Data Type is 2D image.
Surface Data	Surface	Surface data input to this port will be used for deep learning model package inference. Displays when the Input Data Type is Surface data.

Input Port

Data Type

Description

Image

Image/Color

Image input to this port will be used for deep learning model package inference. Displays when the Input Data Type is 2D image.

Surface Data

Surface

Surface data input to this port will be used for deep learning model package inference. Displays when the Input Data Type is Surface data.

Output

Output Port	Data Type	Description
Visualization Output	Image/Color	Visualized results.
Text Images	Image/Color[]	Detected text area. This port is displayed when the Input Data Type is 2D image.
Surface Data	Surface[]	The text area detected from an image. This port is displayed when the Input Data Type is Surface data.
Confidence	Number[]	Confidence in the detected text area.

Output Port

Data Type

Description

Visualization Output

Image/Color

Visualized results.

Text Images

Image/Color[]

Detected text area. This port is displayed when the Input Data Type is 2D image.

Surface Data

Surface[]

The text area detected from an image. This port is displayed when the Input Data Type is Surface data.

Confidence

Number[]

Confidence in the detected text area.

Parameter Description

The following parameters need to be adjusted when the text detection model package is imported into this Step.

Model Package Settings

Parameter	Description
Model Manager Tool	Parameter description: This parameter is used to open the deep learning model package management tool and import the deep learning model package. The model package file is a .dlkpack file exported by Mech-DLK. Instruction: Refer to Deep Learning Model Package Management Tool for the usage.
Model Name	Parameter description: After a Deep Learning Model Package is imported, this parameter is used to select the imported model package for this step. Tuning instruction: After importing a deep learning model package with the Deep Learning Model Package Management Tool, select the corresponding model package name from the drop-down list.
Release Original Model Package After Switching	Parameter description: Controls whether the resources used by the original model package are released upon the switch. Default setting: Selected. Instruction: If selected, when the Step switches to another model package, the system immediately releases the resources of the original model package, even if it is still used by other Steps. If not selected, the system releases the resources of the original model package only when it is no longer used by any Step.
Model Package Type	Parameter description: Once a Model Name is selected, the Model Package Type will be filled automatically.
Input Batch Size	Parameter description: The number of images processed during each inference.
GPU ID	Parameter description: This parameter is used to select the device ID of the GPU that will be used for the inference. Tuning instruction: Once you have selected the model name, you can select the GPU ID in the drop-down list of this parameter.
Input Data Type	Parameter description: This parameter is used to specify the type of input data. The corresponding input ports will be displayed after the parameter is selected. It supports 2D image and surface data input.

Parameter

Description

Model Manager Tool

Parameter description: This parameter is used to open the deep learning model package management tool and import the deep learning model package. The model package file is a .dlkpack file exported by Mech-DLK.
Instruction: Refer to Deep Learning Model Package Management Tool for the usage.

Model Name

Parameter description: After a Deep Learning Model Package is imported, this parameter is used to select the imported model package for this step.
Tuning instruction: After importing a deep learning model package with the Deep Learning Model Package Management Tool, select the corresponding model package name from the drop-down list.

Release Original Model Package After Switching

Parameter description: Controls whether the resources used by the original model package are released upon the switch.
Default setting: Selected.
Instruction: If selected, when the Step switches to another model package, the system immediately releases the resources of the original model package, even if it is still used by other Steps. If not selected, the system releases the resources of the original model package only when it is no longer used by any Step.

Model Package Type

Parameter description: Once a Model Name is selected, the Model Package Type will be filled automatically.

Input Batch Size

Parameter description: The number of images processed during each inference.

GPU ID

Parameter description: This parameter is used to select the device ID of the GPU that will be used for the inference.
Tuning instruction: Once you have selected the model name, you can select the GPU ID in the drop-down list of this parameter.

Input Data Type

Parameter description: This parameter is used to specify the type of input data. The corresponding input ports will be displayed after the parameter is selected. It supports 2D image and surface data input.

Preprocessing

Parameter

Description

ROI File

Parameter description: This parameter is used to set or modify the ROI of the input image.

Tuning instruction: Once the deep learning model is imported, a default ROI will be applied. If you need to edit the ROI, click the Open the editor button. Edit the ROI in the pop-up Set ROI window, and fill in the ROI name.

Instructions for Setting ROI: Hold down the left mouse button and drag to select an ROI, and then click the left mouse button again to confirm. If you need to re-select the ROI, please click the left mouse button and drag again. The coordinates of the selected ROI will be displayed in the “ROI Properties” section. Click the OK button to save and exit.

Before the inference, please check whether the ROI set here is consistent with the one set in Mech-DLK. If not, the recognition result may be affected.

During the inference, the ROI set during model training, i.e. the default ROI, is usually used. If the position of the object changes in the camera’s field of view, please adjust the ROI.

If you would like to use the default ROI again, please delete the ROI file name below the Open the editor button.

Postprocessing

Parameter	Description
Inference Configuration	Parameter description: Configures the inference settings for a Text Detection model package. Click Open the editor to open the inference configuration window. Instruction: Refer to Inference Configuration Tool for detailed parameter description.

Parameter

Description

Inference Configuration

Parameter description: Configures the inference settings for a Text Detection model package. Click Open the editor to open the inference configuration window.
Instruction: Refer to Inference Configuration Tool for detailed parameter description.

Visualization Settings

Parameter	Description
Draw Result on Image	Parameter description: Once enabled, the detection results will be displayed on the image. Default value: Disabled. Instruction: Set the parameter according to the actual requirement.
Customize Font Size	Parameter description: This parameter determines whether to customize the font size in the visualized outputs. Once this option is selected, you should set the Font Size (0–10). The default value is 1.5. Default value: Disabled. Instruction: Set the parameter according to the actual requirement.

Parameter

Description

Draw Result on Image

Parameter description: Once enabled, the detection results will be displayed on the image.
Default value: Disabled.
Instruction: Set the parameter according to the actual requirement.

Customize Font Size

Parameter description: This parameter determines whether to customize the font size in the visualized outputs. Once this option is selected, you should set the Font Size (0–10). The default value is 1.5.
Default value: Disabled.
Instruction: Set the parameter according to the actual requirement.

Is this page helpful?

Leave a feedback

Text Detection

Function

Input and Output

Input

Output

Parameter Description

Model Package Settings

Preprocessing

Postprocessing

Visualization Settings

Is this page helpful?

Thanks for your support!

You can give a feedback in any of the following ways:

We Value Your Privacy