Support for custom puppeteer/browser/page instances

Question

Support for custom puppeteer/browser/page instances

Closed this issue 4 years ago · 7 comments

Hi, I found your repo mentioned in this issue: puppeteer/puppeteer#5345 (comment)

First of all, thank you for sharing this workaround.

I was wondering how hard do you think it would be to support custom puppeteer/browser/page instances. I'm using chrome-aws-lambda to generate PDFs on AWS Lambda. That library customizes the way the puppeteer and the browser instances are initialized:

    browser = await chromium.puppeteer.launch({
      args: chromium.args,
      defaultViewport: chromium.defaultViewport,
      executablePath: await chromium.executablePath,
      headless: chromium.headless,
      ignoreHTTPSErrors: true,
    });

That makes it impossible to use your library.

From looking at the code really quickly, it doesn't look like allowing customization would affect the approach in general. To make it backward-compatible, maybe a third object could be received in the pdf(...) function, including factory functions for puppeteer, browser and page (defaulting to the current approach):

pdf(..., ..., {
  puppeteer: () => { ... },
  browser: puppeteer => { ... },
  page: browser => { ... }
}

Customizing the page could allow to set very specific options like setDefaultNavigationTimeout(...).

BTW, I can lend a hand if you think this is a good idea.
Thanks!

Answer 1 · 2020-10-06T20:33:58.000Z

Hi, Thank you for using it. 😁

Thank you for sharing your idea. 👍 That is great.

We just need the Page, so I'm thinking of a public base method that gets Page is input and applies the process, then changes the current pdf method to create the page and pass it to that primary method. This way, we solve your issue and make it possible to use it over none static pages and make it fully controllable from outside.

I can work on it on Friday (our weekend :) ) and love to hear your feedback. Also, I'm entirely open to PR.

Answer 2 · 2020-10-09T14:05:14.000Z

Please try the v1.1.0 and the new pdfPage method.

Answer 3 · 2020-10-11T15:16:34.000Z

@PejmanNik This is great! Thank you for adding this feature, and sorry for the late response.

I've been testing it a little bit and works locally. There are still a couple of issues when it comes to AWS Lambda (I didn't foresee this). Since the library has a hard dependency on puppeteer, the package size is too large (Lambda has a size limit on the package size). When using chrome-aws-lambda in a Lambda environment, only puppeteer-core is really needed (lightweight), since a pre-compiled binary is included already with the chrome-aws-lambda library. More details here: https://github.com/alixaxel/chrome-aws-lambda#install

My guess is that it should be possible to add puppeteer as an peer dependency and puppeteer-core as a regular dependency (PDFOptions and Page are both part of puppeteer-core). For pdf, puppeteer would be required, but not for pdfPage. In that case, the puppeteer import statement would need to happen inside the pdf function.

Answer 4 · 2020-10-11T16:46:03.000Z

I've created a PR to try out that alternative: #3

Answer 5 · 2020-10-13T14:30:42.000Z

Thank you for your PR 👍
I think there is still room to decrease the size of the package. I can remove the bundle.js file from the npm package so I think, we have a release tonight including your PR and some minor changes

Answer 6 · 2020-10-13T18:05:57.000Z

Thank you @federicojasson. we have a new release for your great contribution

https://github.com/PejmanNik/puppeteer-report/releases/tag/v2.0.0

Answer 7 · 2020-10-13T21:17:11.000Z

@PejmanNik Yay! I can confirm I was able to deploy a working application into AWS Lambda. Thank you too for working on these changes.