Hi,
I have a sample image below where i’m trying to extract the content using pytesseract
I’ve tried pre-processing it in OpenCV first via:
- Convert to grayscale
- Apply Gaussian Blurring, then Adaptive Thresholding
- Apply Dilation
Using Pytesseract, i can extract the text fine apart from those in the Dividend Period column, and that is because the words are not pronounced enough.
Are there other pre-processing techniques that i can use in OpenCV to increase the font weight of that column so i can extract those text to a reasonable degree?
Appreciate if someone can point me in the right direction (better if you can share a sample code), or is this not possible at all?