Text Detection

You are currently viewing the documentation for the latest version (2.2.0). To access a different version, click the "Switch version" button located in the upper-right corner of the page.

■ If you are not sure which version of the product you are currently using, please feel free to contact Mech-Mind Technical Support.

Function

Use the Text Detection model package to run inference on the input image. The model package detects text regions in the image and is typically used together with the Text Recognition model package.

Applicable to industries such as 3C electronics, automotive, and packaging for detecting characters, labels, serial numbers, and more.

intro

Input and Output

After the model package is imported into the Deep Learning Model Package Inference Step, the following input and output ports are displayed.

Input

Input ports Data type Description

Image

Image/Color

Image input to this port will be used for deep learning model package inference. Displays when the input data type is 2D image.

Surface data

Surface

Surface data input to this port will be used for deep learning model package inference. Displays when the input data type is Surface data.

Output

Output ports Data type Description

Visualize outputs

Image/Color

Visualized results

Text image

Image/Color[]

Detected text area. This port is displayed when the input data type is 2D image.

Surface data

Surface[]

The text area detected from an image. This port is displayed when the input data type is Surface data.

Confidence

Number[]

Confidence in the detected text area.

Parameter Description

The following parameters need to be adjusted when the text detection model package is imported into this Step.

Model Package Settings

Parameter Description

Model manager tool

Parameter description: This parameter is used to open the deep learning model package management tool and import the deep learning model package. The model package file is a .dlkpack file exported by Mech-DLK.
Tuning instruction: Refer to Mech-DLKDeep Learning Model Package Management ToolMech-DLK for the usage.

Model name

Parameter description: After a Deep Learning Model Package is imported, this parameter is used to select the imported model package for this step.
Tuning instruction: After importing a deep learning model package with the Deep Learning Model Package Management Tool, select the corresponding model package name from the drop-down list.

Release original model package after switching

Description: Controls whether the resources used by the original model package are released upon the switch.
Default setting: Selected.
Instruction: If selected, when the Step switches to another model package, the system immediately releases the resources of the original model package, even if it is still used by other Steps. If not selected, the system releases the resources of the original model package only when it is no longer used by any Step.

Model package type

Parameter description: Once a Model Name is selected, the Model Package Type will be filled automatically.

Input batch size

Description: The number of images processed during each inference.

GPU ID

Parameter description: This parameter is used to select the device ID of the GPU that will be used for the inference.
Tuning instruction: Once you have selected the model name, you can select the GPU ID in the drop-down list of this parameter.

Input data type

Description: This parameter is used to specify the type of input data. The corresponding input ports will be displayed after the parameter is selected. Support 2D image and surface data input.

Preprocessing

Parameter Description

ROI path

Description: This parameter is used to set or modify the ROI of the input image.

Tuning instruction: Once the deep learning model is imported, a default ROI will be applied. If you need to edit the ROI, click the Open the editor button. Edit the ROI in the pop-up Set ROI window, and fill in the ROI name.

Instructions for Setting ROI: Hold down the left mouse button and drag to select an ROI, and then click the left mouse button again to confirm. If you need to re-select the ROI, please click the left mouse button and drag again. The coordinates of the selected ROI will be displayed in the “ROI Properties” section. Click the OK button to save and exit.

Before the inference, please check whether the ROI set here is consistent with the one set in Mech-DLK. If not, the recognition result may be affected.

During the inference, the ROI set during model training, i.e. the default ROI, is usually used. If the position of the object changes in the camera’s field of view, please adjust the ROI.

If you would like to use the default ROI again, please delete the ROI file name below the Open the editor button.

Postprocessing

Parameter Description

Inference configuration

Description: Configures the inference settings for the text detection model package inference. Click Open the editor to open the inference configuration window.
Instruction: Refer to Inference Configuration Tool for detailed parameter description.

Visualization Settings

Parameter Description

Draw result on image

Description: Once enabled, the detection result will be displayed on the image.
Default value: Disabled.
Instruction: Set the parameter according to the actual requirement.

Customize Font Size

Parameter description: This parameter determines whether to customize the font size in the visualized outputs. Once this option is selected, you should set the Font Size (0–10). The default value is 1.5.
Default value: Disabled.
Instruction: Set the parameter according to the actual requirement.

Is this page helpful?

You can give a feedback in any of the following ways:

We Value Your Privacy

We use cookies to provide you with the best possible experience on our website. By continuing to use the site, you acknowledge that you agree to the use of cookies. If you decline, a single cookie will be used to ensure you're not tracked or remembered when you visit this website.