The bank module cannot parse PDF anymore
Hello,
I got an issue with local_installer.sh thus I use for now the official pip package instead of Git sources (see my another issue here: #644).
It's very strange because it has worked one time but now it doesn't work because of a PDF parsing issue. Here is the output of woob bank ls --debug
:
2023-07-27 20:46:12,844:DEBUG:woob.backend.cic.browser:3.6:browsers.py:1004:internal_callback Handle https://www.cic.fr/fr/banque/documentinternet.html?_tabi=C&_pid=DocumentsEtContrats&k_typePageDoc=DocsFavoris&k_keyFolder=&k_numPage=1&k_natDoc=&k_from=&k_to=&k_crit=CTRREF%3DCOURANT+100961830800074204001&_fid=GoOuvreDoc&cle=141000746293639 with SubscriptionPage Bug(cic): Please install python3-pdfminer to parse PDF (If --auto-update is passed on the command-line, new versions of the module will be checked automatically) Traceback (most recent call last): File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/tools/pdf.py", line 527, in extract_text from pdfminer.pdfdocument import PDFDocument ModuleNotFoundError: No module named 'pdfminer'
During handling of the above exception, another exception occurred:
Traceback (most recent call last): File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/tools/pdf.py", line 531, in extract_text from pdfminer.pdfparser import PDFDocument ModuleNotFoundError: No module named 'pdfminer'
During handling of the above exception, another exception occurred:
Traceback (most recent call last): File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/core/bcall.py", line 100, in backend_process for subresult in result: File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/tools/application/base.py", line 327, in _do_complete_iter for i, sub in enumerate(res): File "/home/turman/.local/share/woob/modules/3.6/woob_modules/creditmutuel/module.py", line 72, in iter_accounts for account in self.browser.get_accounts_list(): ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/browser/browsers.py", line 1121, in inner return func(browser, *args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/turman/.local/share/woob/modules/3.6/woob_modules/creditmutuel/browser.py", line 779, in get_accounts_list text = extract_text_from_pdf(content) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/turman/.local/pipx/venvs/woob/lib/python3.11/site-packages/woob/tools/pdf.py", line 537, in extract_text raise ImportError('Please install python3-pdfminer to parse PDF') ImportError: Please install python3-pdfminer to parse PDF
---------------------------------------------------------+----------+---------- 2023-07-27 20:46:12,897:DEBUG:woob.backend.cic.browser:3.6:browsers.py:1256:dump_state Stored cookies into storage 2023-07-27 20:46:12,900:DEBUG:woob.config:3.6:yamlconfig.py:90:save Configuration file saved: /home/turman/.config/woob/bank.storage.
I'm using Debian 12 and the python3-pdfminer Debian package is correctly installed.
Maybe it's linked to the fact I had to use pipx instead of pip in order to install woob ?
Thanks for your help, Vincent.