Download linked files on a webpage after using webread/urlread

I am trying to scrape content from a webpage. When I use webread and/or urlread with the desired url as input, I get the output in HTML format, with several hyperlinks (see attached image):
txt1 = webread('https://science.house.gov/bills/hr-3458-connected-transportation-and-innovation-act-of-2015');
Screenshot 2019-09-22 at 6.07.28 PM.png
When I hover over the hyperlinks, the associated urls appear at the bottom of the MATLAB window. The hyperlinks shown in the image are the ones corresponding to the files I need. I can extract the text of these hyperlinks using extractbetween. However, I haven't been able to figure out how to read the urls associated with the hyperlinks. Is there a way to do this?
Thanks!

 Réponse acceptée

Ritu
Ritu le 3 Oct 2019
Turns out, I can do this using webread, followed by HTMLTree, and then websave.

Plus de réponses (0)

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by