A library to solve hcaptcha challenges that are automated within puppeteer. You can automatically set response values where they should be so the only thing left for you is submitting the page or you can get the response token. Average response time is rougly 13 - 20 seconds with TensorFlow's Image Recognition.
npm i puppeteer-hcaptcha
await hcaptcha(page);
page
<Page> - Puppeteer Page Instance
await hcaptchaToken(url);
url
<string> - URL of page with captcha on it
Automatically set respone value (see demo)
// Require puppeteer extra and puppeteer stealth
const puppeteer = require("puppeteer-extra");
const pluginStealth = require("puppeteer-extra-plugin-stealth");
// Require our hcaptcha method
const { hcaptcha } = require("puppeteer-hcaptcha");
// Tell puppeteer to use puppeteer stealth
puppeteer.use(pluginStealth());
(async () => {
// Instantiate a new browser object
// Ignore errors associated to https
// Can be headless but for example sake we want to show the browser
// Set your desired arguments for your puppeteer browser
const browser = await puppeteer.launch({
ignoreHTTPSErrors: true,
headless: false,
args: [
`--window-size=600,1000`,
"--window-position=000,000",
"--disable-dev-shm-usage",
"--no-sandbox",
'--user-data-dir="/tmp/chromium"',
"--disable-web-security",
"--disable-features=site-per-process",
],
});
// Get browser pages
const [page] = await browser.pages();
// Send page to your url
await page.goto("URL OF PAGE WITH CAPTCHA ON IT");
// Remove the page's default timeout function
await page.setDefaultNavigationTimeout(0);
// Call hcaptcha method passing in our page
await hcaptcha(page);
// Your page is ready to submit.
// Captcha solving should be the last function on your page so we
// don't have to worry about the response token expiring.
/**
* Example:
* await page.click("loginDiv > loginBtn");
*/
})();
Return response token only (see demo)
// Require our hcaptchaToken method
const { hcaptchaToken } = require("puppeteer-hcaptcha");
(async () => {
// Create Start Time
const startTime = Date.now();
// Call hcaptchaToken method passing in your url
let token = await hcaptchaToken("URL OF PAGE WITH CAPTCHA ON IT");
// Get End Time
const endTime = Date.now();
// Log timed result to console
console.log(`Completed in ${(endTime - startTime) / 1000} seconds`);
// P0_eyJ0eXAiOiJ...
console.log(token);
})();
- Thanks to Futei, JimmyLaurent, Nayde, DinoHorvat, and Tal
- Temporary inclusion of my forked version of ghost-cursor until PR is accepted.
- Should fix issue requests as timestamps are required to be associated with mouse movements send with requests.
- Fixed asset url to reflect new hCaptcha asset url
- Fixed code inconsistency
- Setup TensorFlow tasks to run in parallel using
Promise.All
which seems to have drastically improved speeds solving #23 - Test results seem to mostly be between the 13 - 16 second range (with a few outliers between 19 - 20)
- Will continue looking into ways to get back to the old speeds from using Google Cloud Vision
- Looking into the potential of using C++ or C# as a backend for true threading with the help of edge-js or something similar
- Fixed issue where
useragents.json
file couldn't be found
- Removed Google Cloud Vision from dependencies
- Integrated TensorFlow Image Recognition instead
- Created fix for checking answer requests failing
- Cleaned up functions
- Documented all functions within code
- Removed setting the
g-recaptcha-response
as hCaptcha no longer requires this
- Added functions to dynamically get HSW/HSL version for getting tasklist
- Updated headers to properly request for tokens
- Added fix for cloudflare sites with regards to HSJ request checks
- Fixed issue finding useragents file
- Reverted changes made in v3.0.1
- Created temporary fix for HSJ requests while I look into a proper solution
- Pushed a fix for when the response token is received from the initial request
- Big changes to solving logic to follow changes that hCaptcha has made to their requests.
-
Made changes to requests based on changes hCaptcha made. Added list of User Agents so that they are randomized on request (seems to speed up response time generally)
-
Fixed issue finding useragents file
- Fixed issues with cloudflare sites not returning solved token (see #2).
- Added Google Vision for image recognition to speed up the process
- Added ability to solve captcha on a page or just return the response token
- Massive shoutout goes to Tal for all the help implementing Google Vision
- The idea to do this came from him. He has a super quick solver for GoLang if you are looking for that. He also added a REST API Version of his solver that can definitely be useful if you don't know anything about GoLang
- Small updates to README documentation
- Initial release
- Huge shoutout to these people for the release of this package
- Futei - Initial project
- JimmyLaurent - Initial Node Module
- Nayde - The idea of porting functionality to be usable through puppeteer
- DinoHorvat - Help with response token issue (workaround was found out before release)
I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
Stems from TensorFlow. Not entirely sure how to fix this but it doesn't impact the solver.