Rossum Connector
Configuring Rossum in Data Capture
As with the integration of Mindee, using Rossum in the Data Capture module requires you to first create an account via the registration process (Rossum Registration).
Once the account has been created, configuration is carried out according to the following steps:
-
Access the Data Capture configuration menu.
-
Open the “Rossum” submenu.
-
Enter your Rossum login details:
-
Username;
-
Password associated with your account.
-
For accounts created after November 2022, also enter the chosen domain when creating an account.
-
Click on the “Login” button to establish the link between Data Capture and the Rossum platform and enable data extraction.
Once the connection is successful, the Data Capture model will automatically use Rossum to analyze the submitted documents and extract data in a structured manner, using its document intelligence engine.
Automatic import of account-related configurations
Once the connection to an account has been successfully established, all configurations associated with that account are automatically imported into Data Capture.
This includes:
-
Document processing settings defined on the third-party platform (Mindee or Rossum);
-
Extraction templates configured to identify and structure data;
-
Any other configurations required to ensure complete and operational integration with Data Capture.
This automation saves significant time and ensures consistency between the extraction platform and the Data Capture model.
Tracking of non-imported configurations
In the event of an error or interruption during the import of configurations, the non-imported items are automatically listed to facilitate their processing.
These configurations are presented in a logical order, allowing them to be gradually resumed in the appropriate sequence to ensure a complete and successful import.
This tracking mechanism allows you to:
-
Quickly identify missing or partially imported items;
-
Manually restart the import if necessary;
-
Avoid inconsistencies in the Data Capture model configuration.
Rossum Schema
Once the configurations have been successfully imported, an additional step is to finalize the configuration of the schema integrated into Data Capture.
This phase is essential to ensure that the extraction process runs smoothly.
It includes the following actions:
-
Enter the associated document types: each imported schema must be linked to the relevant document types (e.g., invoice, purchase order, etc.). This association allows the analysis to be correctly oriented according to the nature of the document being processed.
-
Define data capture rules: next, you need to configure the mapping and extraction rules by specifying how the data extracted by Rossum should be interpreted, transformed, and integrated into the Data Capture model.
This step ensures that the raw data from the OCR matches the fields expected in your application.
Data Capture settings – Rossum
When configuring Rossum in Data Capture, certain specific settings must be entered in addition to the standard settings to ensure optimal extraction process performance:
-
Rossum account: verify that the Rossum account is correctly connected and authenticated. All extraction actions will be performed from this account.
-
Queue: specify the Rossum queue to be used for document processing. This queue determines the flow in which documents are analyzed.
-
Timeout: set a maximum safety time limit for document processing. By default, extraction usually takes between 30 and 40 seconds, but this setting can be adjusted as needed to avoid interruptions or timeout errors.
- From the Rossum configuration menu, you can also:
-
view the different queues available;
-
view the workspaces associated with the logged-in account, making it easier to select and manage active configurations.
Rossum Batch
Rossum also offers a batch processing mechanism via a dedicated email address. By redirecting or sending documents to this email address, they will be automatically handled, analyzed, and processed by Rossum, without manual intervention.
This mode of operation is particularly suitable for automating the receipt and processing of documents on a large scale.
At the data capture level, the corresponding templates will be generated using Rossum's mass processing.