OCR for PDF files
Currently, to my knowledge, only 'pdfminer' is used for extracting text from a PDF. Images embedded in the PDF are not parsed using 'tesseract'
Is there a way to OCR (parts of) PDF-files?
Do not update/delete: Banner broadcast message test data
Do not update/delete: Notification broadcast message test data
Currently, to my knowledge, only 'pdfminer' is used for extracting text from a PDF. Images embedded in the PDF are not parsed using 'tesseract'
Is there a way to OCR (parts of) PDF-files?