Let’s consider a concrete example:
If the AI is to learn the order number in a particular document, the following training data (Ground Truth) is required:
These values define the so-called Bounding Box, which is the convex hull that surrounds the text characters and determines its position.
Integration of the BLU DELTA Learn-API
The BLU DELTA AI offers its own interface, the Learn-API. This allows corrected values to be fed directly into the AI training process, whether from a workflow or a business process. The Learn-API accepts both the document and the essential document information as well as their position data.
It’s crucial that position data is captured during the correction in one’s own process or interface and then forwarded to the Learn-API. The interface must display the documents as images with a resolution of 300 dpi and allow the user to highlight text fields in the document.
In optimal integration, only the documents or invoices that the BLU DELTA AI is uncertain about are shown to the workflow. An operator reviews and, if necessary, corrects these by marking the relevant word, which is then taken over directly by the Learn-API. In the background, the workflow transfers this information directly to the Learn-API, so this correction is considered in the next training.
Optionally, the Learn-API can also accept values without position information. In such cases, a smart algorithm combined with AI tries to determine the Ground Truth (i.e., the corresponding position data for the values). However, this is not always successful. If this fails, a BLU DELTA Data Labeler must manually add the missing information, making the system learn slower.
Of course, the Learn-API can also only take over the documents. In this case, all training data must be manually checked. The information is prioritized by relevance (cluster size, active learning) and captured manually by the customer or a BLU DELTA Data Labeler before being included in the training.
For more technical details about the BLU DELTA Learn-API, visit www.bludelta.dev.
If you would like to find out more about data collection with BLU DELTA KI, we look forward to hearing from you.