[Feature] PDF snapshot tests #19253

nicojs · 2022-12-04T13:40:46Z

Hi friends 🙋‍♂️. Thanks for creating this valuable project!

I'm looking for ways to test my generated PDF file with decktape using snapshot tests. I would expect such a thing to be possible with playwright since underlying browsers generally support displaying PDF files.

However, I ran into this issue: #6091, which is closed as out-of-scope. I have two questions:

Are there any technical reasons for it to be out of scope?
Would you want to reconsider this stance for 2023 🎆?

aslushnikov · 2022-12-06T08:28:21Z

@nicojs

Are there any technical reasons for it to be out of scope?

The PDF generally falls outside of the web platform, and Playwright focuses on Web (as of 2022).

Would you want to reconsider this stance for 2023 🎆?

As far as I can tell, that's unlikely since these's still a lot to be done for web testing.

In general, the suggested work-around would be to use something like PDF.js to render PDF and then test it as a regular website.

Hope this helps!

nicojs · 2022-12-06T15:09:25Z

Thanks! I didn't know about pdf.js but it seems to work like a charm. Here is my code for anyone that wants to do the same thing:

// print.spec.ts
import test, { expect } from '@playwright/test';

test.describe('Print', () => {
	let numberOfPages: number;

	test.beforeEach(async ({ page, browserName }) => {
		test.skip(browserName !== 'chromium', 'Print pdf is only validated in chromium');
		await page.goto('http://localhost:15005/print.html');
		await page.waitForFunction(() => typeof print=== 'object');
		numberOfPages = await page.evaluate(async () => print.numPages);
	});

	test('no visual regressions of the example slides', async ({ page }) => {
		const theCanvas = page.locator('#the-canvas');
		for (let i = 0; i < numberOfPages; i++) {
			expect(await theCanvas.screenshot()).toMatchSnapshot({ name: `page-${i}.png` });
			await page.evaluate(() => print.nextPage());
		}
	});
});

<!-- print.html -->
<!DOCTYPE html>
<html lang="en">
	<head>
		<meta charset="UTF-8" />
		<meta http-equiv="X-UA-Compatible" content="IE=edge" />
		<meta name="viewport" content="width=device-width, initial-scale=1.0" />
		<title>Document</title>
	</head>
	<body>
		<canvas id="the-canvas"></canvas>
		<script src="https://cdn.jsdelivr.net/npm/[email protected]/build/pdf.min.js"></script>
		<script type="module">
			const pdf = await pdfjsLib.getDocument("./example.pdf").promise;
			let currentPage = 1;
			await renderPage();

			window.print= {
				nextPage: async () => {
					if (currentPage < pdf.numPages) {
						currentPage++;
						await renderPage();
					}
				},
				numPages: pdf.numPages,
			};
			async function renderPage() {
				const page = await pdf.getPage(currentPage);
				const scale = 1.5;
				const viewport = page.getViewport({ scale: scale });
				// Support HiDPI-screens.
				const outputScale = window.devicePixelRatio || 1;

				const canvas = document.getElementById("the-canvas");
				const canvasContext = canvas.getContext("2d");
				canvas.width = Math.floor(viewport.width * outputScale);
				canvas.height = Math.floor(viewport.height * outputScale);
				canvas.style.width = Math.floor(viewport.width) + "px";
				canvas.style.height = Math.floor(viewport.height) + "px";
				const transform =
					outputScale !== 1
						? [outputScale, 0, 0, outputScale, 0, 0]
						: null;

				await page.render({ canvasContext, transform, viewport })
					.promise;
			}
		</script>
	</body>
</html>

codeami · 2023-01-30T06:50:46Z

I found this easier to use: -
https://www.npmjs.com/package/compare-pdf

karlhorky · 2024-06-13T11:54:55Z

@codeami how did you use it in your Playwright tests?

It would be great to get a canonical example and official Playwright team recommendation of PDF snapshot testing with Playwright, using PDFs from URLs.

The version that we currently have uses page.setContent() to create a PDF viewer using PDF.js using the PDF.js docs Hello World example, but it is verbose, and requires changes to our content security policy.

Ideally, we could just use the built-in PDF viewer in Chrome, but it appears that this is not a path that the Playwright team wants to invest in.

karlhorky · 2024-06-13T13:30:18Z

Here's my version of screenshot / snapshot testing with PDF.js, based on the PDF.js docs Hello World example:

// HTML template string no-op for VS Code highlighting / formatting
function html(strings: TemplateStringsArray, ...values: unknown[]) {
  return strings.reduce((result, string, i) => {
    return result + string + (values[i] ?? '');
  }, '');
}

test('PDF has screenshot', async ({ page }) => {
  // Go to page without Content-Security-Policy header, to avoid CSP
  // prevention of script loading from https://mozilla.github.io
  await page.goto('about:blank');

  await page.setContent(html`
    <!doctype html>
    <html>
      <head>
        <meta charset="UTF-8" />
      </head>
      <body>
        <canvas></canvas>
        <script src="https://mozilla.github.io/pdf.js/build/pdf.mjs" type="module"></script>
        <script type="module">
          pdfjsLib.GlobalWorkerOptions.workerSrc =
            'https://mozilla.github.io/pdf.js/build/pdf.worker.mjs';

          try {
            const pdf = await pdfjsLib.getDocument(
               'https://raw.githubusercontent.com/mozilla/pdf.js/ba2edeae/examples/learning/helloworld.pdf',
            ).promise;

            const page = await pdf.getPage(1);
            const viewport = page.getViewport({ scale: 1.5 });

            const canvas = document.querySelector('canvas');
            canvas.height = viewport.height;
            canvas.width = viewport.width;

            await page.render({
              canvasContext: canvas.getContext('2d'),
              viewport,
            }).promise;
          } catch (error) {
            console.error('Error loading PDF:', error);
          }
        </script>
      </body>
    </html>
  `);

  await page.waitForTimeout(1000);

  await expect(page).toHaveScreenshot({ fullPage: true });
});

Also added to playwright-tricks:

https://github.com/karlhorky/playwright-tricks/tree/main?tab=readme-ov-file#screenshot-comparison-tests-of-pdfs

karlhorky · 2024-11-29T10:48:18Z

Oh, it seems like Playwright 1.49.0 introduced rendering of PDFs in page (at least in Chromium), as @dgozman mentions in Changes in Chromium headless in Playwright v1.49 (#33566):

PDF documents are now rendered in the page, instead of being downloaded. We recommend to update your tests accordingly.

Changes in Chromium headless in Playwright v1.49 #33566.

I can confirm that await expect(page).toHaveScreenshot() works when navigating to a PDF URL 🎉

Although I need to figure out how to make it so that it shows the entire PDF and is not cut off... 🤔 (and no gray background)

This is an example snapshot, showing the unfortunate cropping:

gselsidi · 2025-01-08T19:55:52Z

@karlhorky I actually wasted 2 days with the above, the default chrome pdf viewer sets a shadow dom to closed, so you can't interact with anything on the page such as setting the bounding box to capture the whole page as a screenshot, as fullPage: true doesn't work because it's inside an iframe

I got as far as injecting JS to set the shadow-root to open theoretically playwright still can't interact with it, and returns a pretty limited DOM when i query it.

But when I pause the test and inspect the page I see shadow-dom set to open so unsure what's really going on, feel free to go down the rabbit whole with this...

Going to implement the solutions shared above instead of wasting more time.

  await context.addInitScript(() => {
    const originalAttachShadow = Element.prototype.attachShadow;
    Element.prototype.attachShadow = function (init) {
      if (init.mode === "closed") {
        init.mode = "open";
      }
      await console.log("ShadowRoot created and set to open:", this);
      return originalAttachShadow.call(this, init);
    };
  });

  // Capture the new page (tab) opened
  const [newPage] = await Promise.all([
    context.waitForEvent("page"),
    ***your actions to click to get the new page with the pdf***
  ]);```

aslushnikov closed this as completed Dec 6, 2022

This was referenced Dec 14, 2022

Add some e2e tests astefanutti/decktape#253

Closed

test(harness): Add e2e testing harness using playwright astefanutti/decktape#254

Merged

cderv mentioned this issue Oct 4, 2024

Add some in-browser testing using playwright quarto-dev/quarto-cli#10696

Open

5 tasks

yury-s mentioned this issue Dec 12, 2024

[Feature]: Add ability to validate PDF file contents by loading it into the browser or any other means. User should be able to assert different text elements, images/icons presence and CSS properties too. #33971

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] PDF snapshot tests #19253

[Feature] PDF snapshot tests #19253

nicojs commented Dec 4, 2022 •

edited

Loading

aslushnikov commented Dec 6, 2022

nicojs commented Dec 6, 2022

codeami commented Jan 30, 2023

karlhorky commented Jun 13, 2024 •

edited

Loading

karlhorky commented Jun 13, 2024 •

edited

Loading

karlhorky commented Nov 29, 2024 •

edited

Loading

gselsidi commented Jan 8, 2025 •

edited

Loading

[Feature] PDF snapshot tests #19253

[Feature] PDF snapshot tests #19253

Comments

nicojs commented Dec 4, 2022 • edited Loading

aslushnikov commented Dec 6, 2022

nicojs commented Dec 6, 2022

codeami commented Jan 30, 2023

karlhorky commented Jun 13, 2024 • edited Loading

karlhorky commented Jun 13, 2024 • edited Loading

karlhorky commented Nov 29, 2024 • edited Loading

gselsidi commented Jan 8, 2025 • edited Loading

nicojs commented Dec 4, 2022 •

edited

Loading

karlhorky commented Jun 13, 2024 •

edited

Loading

karlhorky commented Jun 13, 2024 •

edited

Loading

karlhorky commented Nov 29, 2024 •

edited

Loading

gselsidi commented Jan 8, 2025 •

edited

Loading