webread is not getting all text on a website

1 vue (au cours des 30 derniers jours)
Will Kinsman
Will Kinsman le 28 Fév 2016
Hi all,
I am trying to build a program to get the plain text from a website. The issue I am encountering is that webread does not seem to be collecting all of the text on the site (specifically, the table; see website below). I see my options are twofold:
  1. query a third-party html-to-plain text website that can do a better job
  2. determine if there is a workaround that catches more text than the webread method I am using now
here is my code:
html = webread('https://finance.yahoo.com/q/bs?s=MXWL');
txt = regexprep(html,'<script.*?/script>','');
saveTXT(txt,'htmlplaintext');
help is greatly appreciated; I love you guys!
Will

Réponses (1)

Walter Roberson
Walter Roberson le 28 Fév 2016

Catégories

En savoir plus sur Environment and Settings dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by