How to detect & localize a text in pdf using OCR in MATLAB
Using OCR to detect and localize text is simple in MATLAB. However, it is only workable if your input is image format (jpg,png) but not pdf. Hence, we are going to convert the pdf to image. However, up to MATLAB version R2019a, It don't have any built-in function to convert pdf to image. For this example, i am going to use a python package pdf2image help us to convert pdf to image. There are no conflicts using MATLAB or Python. If there is something working better in Python, we can collaborate both platform (MATLAB and Python) through MATLAB Api to complete our objective.
Highlights :
Execute python user-defined function from MATLAB
Detect and Localize a text in pdf
Product Focus :
MATLAB
Computer Vision Toolbox
Written at 16 July 2019
Citation pour cette source
Kevin Chng (2024). How to detect & localize a text in pdf using OCR in MATLAB (https://www.mathworks.com/matlabcentral/fileexchange/72156-how-to-detect-localize-a-text-in-pdf-using-ocr-in-matlab), MATLAB Central File Exchange. Récupéré le .
Compatibilité avec les versions de MATLAB
Plateformes compatibles
Windows macOS LinuxCatégories
Tags
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!Découvrir Live Editor
Créez des scripts avec du code, des résultats et du texte formaté dans un même document exécutable.
OCRforPDF
Version | Publié le | Notes de version | |
---|---|---|---|
1.0.3 | modify description |
||
1.0.2 | Change description |
||
1.0.1 | *change description |
||
1.0.0 |