If you are using the Limina container, the OCR system is configured through environment variables. For detailed instructions, refer to the sections on Environment Variables and the Guide for Integrating Limina DEID container with Azure OCR. You can configure more than one OCR systems.Once configured, the OCR system can be selected either via environment variables or directly in the REST request.
Standard OCR
| Attribute | Rating |
|---|---|
| Accuracy | Medium |
| Speed | High |
| Cost | Low |
When to choose Standard OCR?
- Most cost efficient OCR solution is needed.
Azure Computer Vision OCR
| Attribute | Rating |
|---|---|
| Accuracy | High |
| Speed | High |
| Cost | Medium |
PAI_OCR_SYSTEM=azure_computer_vision as an environment variable at container startup or specify it directly in the request parameters. Check out our Guide for Integrating Limina DEID container with Azure OCR for setup information.
The control flow is similar to the Standard OCR mode.
When to choose Azure Computer Vision OCR?
- Handwriting support is needed.
- Rotated document support is needed.
- Enhanced language support is needed.
- Wide range of supported languages, such as Japanese. See Azure OCR Language support for more details.
Azure Document Intelligence
| Attribute | Rating |
|---|---|
| Accuracy | High |
| Speed | High |
| Cost | High |
PAI_OCR_SYSTEM=azure_doc_intelligence as an environment variable at container startup or specify it directly in the request parameters. Check out our Guide for Integrating Limina DEID container with Azure OCR for setup information.
The control flow is similar to the Standard OCR mode.
When to choose Azure Document Intelligence?
- Need advanced features for complex layouts like tables and forms.
- Handwriting support is needed.
- Rotated document support is needed.
- Enhanced language support is needed.
- Wide range of supported languages. See Azure OCR Language support for more details.
Hybrid OCR
| Attribute | Rating |
|---|---|
| Accuracy | Medium |
| Speed | High |
| Cost | Medium |
PAI_OCR_SYSTEM=hybrid either as an environment variable or specify it in the request parameters, instead of PAI_OCR_SYSTEM=azure_computer_vision.
The control flow for Hybrid OCR mode is depicted in the following flow chart:
When to choose Hybrid OCR?
- Need a mix of performance and cost effectiveness.
- Need to process mainly PDFs. For other document types the Hybrid OCR will work just like the Standard OCR.