Problem with using fopen

6 Avr 2021

0 Réponses

Mise à jour 6 Avr 2021

4 Vues (30 jours)

Connectez-vous pour répondre à cette question.

Follow Question

Connectez-vous pour répondre à cette question.

Follow Question

Afficher commentaires plus anciens

0 votes

TestCOA.pdf

The goal is not just get the words from a pdf like you get from extractFileText(filename) syntax, but also the position of each sentence. The solution i use is to read the pdf and then flatedecode it to acive this information. After decoding the information can look like this:

I found a pyhonscript* that works and i want to translate it into matlab.

...here comes the problem

Python:

pdf = open("TestCOA.pdf","rb").read() <--- python read the file perfectly

Matlab:

fileID = fopen("TestCOA.pdf",'rb','n','us-ascii');

A = fscanf(fileID,'%c') <-- reads some char but mixed with invalid characters <?>

pdf=py.open("TestCOA.pdf","rb").read() <-- same results with the python integration syntax

Upploaded example pdf to try it out. Hope someone can help me to figure this out. :)

*The full python script: https://gist.github.com/averagesecurityguy/ba8d9ed3c59c1deffbd1390dafa5a3c2

0 commentaires
Afficher -2 commentaires plus anciens Masquer -2 commentaires plus anciens

Connectez-vous pour commenter.

Connectez-vous pour répondre à cette question.

Follow Question

Réponses (0)

Connectez-vous pour répondre à cette question.

Catégories

En savoir plus sur Startup and Shutdown dans Centre d'aide et File Exchange

Produits

MATLAB

Tags

Question posée :

le 6 Avr 2021

le 6 Avr 2021

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Translated by