Welcome!
This is the community forum for my apps Pythonista and Editorial.
For individual support questions, you can also send an email. If you have a very short question or just want to say hello — I'm @olemoritz on Twitter.
.pdf links from a website
-
Is it possible to build a workflow that pulls all the links to .PDFs off a webpage in the built-in browser.
-
Yes, but i can't help in detail. It probably requires a workflow with just a Python script.
Given the webpage address, you'd use
Requests
to get the webpage html, then search for links ending in.pdf
and return them in a list. I imagine you could useRequests
to download the pdfs as well. -
You might be able to pull the HTML directly from the built-in browser, but I’m not 100% sure.
-
See the two links below.... The basic idea is to use
requests
to get the webpage HTML and useBeautifulSoup
to parse that HTML to find the links that end in ".pdf".http://omz-forums.appspot.com/pythonista/post/5903606662299648
http://omz-forums.appspot.com/pythonista/post/5253563362050048