OCR is a feature enabling automated recognition of text within images, facilitating data extraction for enhanced data discovery. The OCR support is optimized for high-quality images and is available across various languages with some considerations for handwritten content.
Recommended Settings
For best results, ensure that images are clear, in the recommended resolution, and meet the minimum DPI settings.
- Resolution: At least 1024 x 768 pixels or higher for optimal results
- DPI: Minimum of 300 DPI
- File Types: JPG, JPEG, PNG
Image Quality
- Resolution: OCR processing requires a minimum image resolution of 640 x 480 pixels (approximately 300,000 pixels). However, to optimize text recognition accuracy, an image resolution of 1024 x 768 pixels or higher is recommended.
- DPI (Dots Per Inch): A minimum of 300 DPI is recommended.
- Image Clarity: Text must be clear and readable. OCR may struggle with blurry or low-quality images.
Supported File Types
OCR supports common image file formats, including JPG, JPEG, and PNG.
Supported Languages
OCR is compatible with multiple languages, facilitating broad applicability for data discovery across multilingual data sources.
Handwriting Support
Handwritten text is processed on a best effort basis. Recognizability may vary depending on handwriting clarity and style.