Skip to content

Latest commit

 

History

History
160 lines (110 loc) · 3.74 KB

README.md

File metadata and controls

160 lines (110 loc) · 3.74 KB

puppeteer docker image

docker image with Google Puppeteer installed

and screenshots scripts

nodesource/node

install

docker pull schliflo/docker-puppeteer:latest
# OR with fixed version
docker pull schliflo/docker-puppeteer:19.2.0

before usage

  1. you should pass --no-sandbox, --disable-setuid-sandbox args when launch browser
const puppeteer = require('puppeteer');

(async() => {

    const browser = await puppeteer.launch({
        args: [
            '--no-sandbox',
            '--disable-setuid-sandbox'
        ]
    });

    const page = await browser.newPage();

    await page.goto('https://www.google.com/', {waitUntil: 'networkidle2'});

    browser.close();

})();
  1. if you got page crash with BUS_ADRERR (chromium issue), increase shm-size on docker run with --shm-size argument
docker run --rm -v <path_to_script>:/app/index.js schliflo/docker-puppeteer
  1. If you're seeing random navigation errors (unreachable url) it's likely due to ipv6 being enabled in docker. Navigation errors are caused by ERR_NETWORK_CHANGED (-21) in chromium. Disable ipv6 in your container using --sysctl net.ipv6.conf.all.disable_ipv6=1 to fix:
docker run --sysctl net.ipv6.conf.all.disable_ipv6=1 --rm -v <path_to_script>:/app/index.js schliflo/docker-puppeteer
  1. add --enable-logging for chrome debug logging http://www.chromium.org/for-testers/enable-logging
const puppeteer = require('puppeteer');

(async() => {

    const browser = await puppeteer.launch({args: [
        '--no-sandbox',
        '--disable-setuid-sandbox',

        // debug logging
        '--enable-logging', '--v=1'
    ]});

usage

mount your script to /app/index.js

docker run --rm -v <path_to_script>:/app/index.js schliflo/docker-puppeteer:latest

custom script from dir

docker run --rm \
 -v <path_to_dir>:/app \
 schliflo/docker-puppeteer:latest \
 node my_script.js

screenshots tools

simple screenshot tools in image

docker run --rm -v /tmp/screenshots:/screenshots \
 schliflo/docker-puppeteer:latest \
 <screenshot,full_screenshot,screenshot_series,full_screenshot_series> 'https://www.google.com' 1366x768

screenshot tools syntax

<tool> <url> <width>x<height> [<delay_in_ms>]

  • delay_in_ms: is optional (defaults to 0)
    • Waits for delay_in_ms milliseconds before taking the screenshot

screenshot

docker run --rm -v /tmp/screenshots:/screenshots \
 schliflo/docker-puppeteer:latest \
 screenshot 'https://www.google.com' 1366x768

output: one line json

{
    "date":"2017-09-01T05:03:27.464Z",
    "timestamp":1504242207,
    "filename":"screenshot_1366_768.png",
    "width":1366,
    "height":768
}

got screenshot in /tmp/screenshots/screenshot_1366_768.png

full_screenshot

save full screenshot of page

docker run --rm -v /tmp/screenshots:/screenshots \
 schliflo/docker-puppeteer:latest \
 full_screenshot 'https://www.google.com' 1366x768

screenshot_series, full_screenshot_series

adds datetime in ISO format into filename

useful for cron screenshots

docker run --rm -v /tmp/screenshots:/screenshots \
 schliflo/docker-puppeteer:latest \
 screenshot_series 'https://www.google.com' 1366x768
docker run --rm -v /tmp/screenshots:/screenshots \
 schliflo/docker-puppeteer:latest \
 full_screenshot_series 'https://www.google.com' 1366x768
2017-09-01T05:08:55.027Z_screenshot_1366_768.png
# OR
2017-09-01T05:08:55.027Z_full_screenshot_1366_768.png