Re: Two questions about grabbing "webpages" with p:load / p:document from Norm Tovey-Walsh on 2025-01-23 (xproc-dev@w3.org from January 2025)

From: Norm Tovey-Walsh <ndw@nwalsh.com>
Date: Thu, 23 Jan 2025 14:32:27 +0000
To: andy.carver@yahoo.com
Cc: "xproc-dev@w3.org" <xproc-dev@w3.org>
Message-ID: <m2ikq5y51g.fsf@nwalsh.com>

"andy.carver@yahoo.com" <andy.carver@yahoo.com> writes:
> 1. Is this (bare-bones HTML) output from the p:load step actually all the spec (or other XProc documentation) has in mind, when speaking of the ability of p:load (or p:document) to retrieve documents (i.e. "webpages") from the Web?


> blank, white screen. For all the lovely data is not in the HTML served -- not, that is, until some AJAX queri(es) retrieve it from the server and add it to the DOM.

Yeah. We’re not providing a runtime environment that executes the page’s JavaScript. Sometimes you can dig around a bit and work out what the JavaScript requests and, if there’s a pattern, get it “by hand”. But that’s not always easy.

> I will mention that I'm a Windows user. So I apologize, if the answer to 2. is kindergarten stuff to Linux gurus :D In any case, I'm hoping for a solution that will work (eventually) in Windows.

Nothing Windows or Linux related here, I’m afraid.

                                        Be seeing you,
                                          norm

--
Norm Tovey-Walsh <ndw@nwalsh.com>
https://norm.tovey-walsh.com/

> One of the great misfortunes of mankind is that even his good

Received on Thursday, 23 January 2025 14:32:35 UTC