Download the rendered HTML instead of PDF or screenshots. Use the Playwright CLI:
npx playwright open --save-har=/tmp/page.har --save-har-glob="**" <url> --headless
Or more directly, dump the fully rendered DOM:
node -e "const{chromium}=require('playwright');(async()=>{const b=await chromium.launch();const p=await b.newPage();await p.goto(process.argv[1],{waitUntil:'networkidle'});console.log(await p.content());await b.close()})();" <url> > /tmp/page.html
Then read /tmp/page.html to analyze the page content.