Use the Text Recognition Module

You are currently viewing the documentation for the latest version (2.6.2). To access a different version, click the "Switch version" button located in the upper-right corner of the page.

■ If you are not sure which version of the product you are currently using, please feel free to contact Mech-Mind Technical Support.

Taking an image dataset of numbers (click to download) as an example, this topic will show you how to use the Text Recognition module to recognize and output the characters in an image. The characters that can be recognized include numbers, letters, and some special symbols.

You can also use your own data. The usage process is overall the same, but the labeling part is different.

Preparations

Create a new project and add the Text Recognition module: Click New Project after you opened the software, name the project, and select a directory to save the project. Then, click in the upper-right corner and add the Text Recognition module.

Import the image data: Unzip the downloaded data file. Click the Import/Export button in the upper left corner, select Import Folder, and import the image data.

If duplicate images are detected in the image data, you can choose to skip, import, or set an tag for them in the pop-up Import Images dialog box. Since each image supports only one tag, adding a new tag to an already tagged image will overwrite the existing tag. When importing a dataset, you can choose whether to replace duplicate images.

Dialog box for the Import Images or Import Folder option:

Dialog box for the Import Dataset option:

When you select Import Dataset, you can only import datasets in the DLKDB format (.dlkdb), which are datasets exported from Mech-DLK.
The texts in the imported images should be oriented toward the positive direction (0°).
The Text Detection or Object Detection module can precede the Text Recognition module for better recognition results. In this case, you can select Import Import From Previous Module to import data.
- If a Text Detection module precedes it, make sure the Rectify image(s) feature is enabled to rectify images to 0°. Typically, the Rectify image(s) feature reliably carries out its designated task. However, occasionally, few images oriented at 0° may inadvertently undergo rectification to 180°. In such situations, it is advisable to exercise discretion in practical applications.

Select an ROI: Click the ROI Tool button and adjust the frame to set an ROI that covers the text areas of all images. Then, click the button in the lower right corner of the ROI to save the setting. Setting the ROI can avoid interferences from the background.
Divide training and validation sets: You can click and drag the slider to adjust the image ratio. Please make sure that both the training set and validation set include all kinds of texts to be detected. If the default training set and validation set cannot meet this requirement, please right-click the name of the image and then click Switch to training set or Switch to validation set to adjust the set to which the image belongs.

Data Labeling

Select the Text Recognition Tool from the toolbar to label the images. When the Text Recognition Tool is used to make a selection, the recognition result will automatically appear right under the selection frame. Manual verification and confirmation are required. Therefore, making a valid selection and confirming a correct recognition result in time are conducive to improving model quality.

When labeling a image, you can click Pre-trained labeling or VFM labeling for automatic labeling. After the automatic labeling is completed, you can manually adjust the labeling results.
For adjustment in VFM Labeling Tool, please follow those steps:
1. Right-click on and select VFM Labeling Tool.
2. Click on the upper area of the labeling area.
3. In the window of VFM Labeling Settings, you can modify the VFM labeling results.

In the process of labeling, the feature of Text Template can be used. Text Template is for the batch replacement of labeled results. Please follow these steps:

Click Text Template under the Labeling tab in the right side of the main interface.
Click + New in the upper-right corner of the Text Template tab.
In the New Text Template window, enter text(s) and click OK.
Select an image in the image list. Click Apply beside the newly created text template. The applied text template will be used in the labeling result.

Train Models

Train the model: Keep the default training parameter settings and click on Train to start training the model.
Monitor training progress through training information: On the Training tab, the training information panel allows you to view real-time model training details.
View training progress through the Training Chart window: Click the Show chart button under the Training tab to view real-time changes in the model’s accuracy and loss curves during training. An overall upward trend in the accuracy curve and a downward trend in the loss curve indicate that the current training is running properly.

Stop training early based on actual conditions (optional): When the model accuracy has met the requirements, you can save time by clicking the Training center button, selecting the project from the task list, and then clicking stop trainig to stop training. You can also wait for the model training to complete and observe parameter values such as the highest accuracy to make a preliminary assessment of the model’s performance.

If the accuracy curve shows no upward trend after many epochs, it may indicate a problem with the current model training. Stop the model training process, check all parameter settings, examine the training set for missing or incorrect labels, correct them as needed, and then restart training.

Model Validation

Validate the model: After the training is completed, click Validate to validate the model and check the results.
Check the model’s validation results in the training set: After validation is complete, you can view the validation result quantity statistics in the Validation statistics section under the Validation tab. Click the View full report button to open the Detailed Report window and view detailed validation statistics.

If the model training performance is unsatisfactory, please check the labels, adjust the training parameter settings, and restart the training. You can also click the Export report button at the bottom-right corner of the Detailed Report window to export the report to your local device.
Restart training: After adding newly labeled images to the training set, click the Train button to restart training.
Recheck model validation results: After training is complete, click the Validate button again to validate the model and review the validation results on each dataset.
Fine-tune the model (optional): You can enable developer mode and turn on Finetune in the Training Parameter Settings dialog box. For more information, see Iterate a Model.

Continuously optimize the model: Repeat the above steps to continuously improve model performance until it meets the requirements.

If you want to adjust validation results, please follow those steps:

Set filtering rules in Filtering Rule Settings in the validation tab.
In Filtering Rule Settings window, you can modify validation results by adding filtering items.

Model Export

Click Export and select a directory to save the trained model.

The exported model can be used in Mech-DLK SDK. Click here to view the details.

Is this page helpful?

Leave a feedback

Thanks for your support!

You can give a feedback in any of the following ways:

We Value Your Privacy

We use cookies to provide you with the best possible experience on our website. By continuing to use the site, you acknowledge that you agree to the use of cookies. If you decline, a single cookie will be used to ensure you're not tracked or remembered when you visit this website.

I accept

I don't accept