登录页面是最常遇到验证码的地方。无论是 reCAPTCHA v2、v3、旋转门还是图像验证码,CaptchaAI 都可以解决挑战,同时您的自动化处理表单填写和提交。
常见登录验证码类型
| 验证码 | 它看起来如何 | CaptchaAI方法 |
|---|---|---|
| reCAPTCHA v2 | 提交前勾选或质疑 | method=userrecaptcha |
| reCAPTCHA v3 | 隐形评分,屏蔽低分用户 | method=userrecaptcha&version=v3 |
| Cloudflare Turnstile | 登录表单之前的小部件 | method=turnstile |
| 图片验证码 | 要输入的文本图像 | method=base64 |
方法一:HTTP请求(无浏览器)
对于接受标准 POST 请求的登录表单:
import requests
import time
API_KEY = "YOUR_API_KEY"
def solve_recaptcha(site_key, page_url):
resp = requests.get("https://ocr.captchaai.com/in.php", params={
"key": API_KEY, "method": "userrecaptcha",
"googlekey": site_key, "pageurl": page_url
})
task_id = resp.text.split("|")[1]
for _ in range(60):
time.sleep(5)
result = requests.get("https://ocr.captchaai.com/res.php", params={
"key": API_KEY, "action": "get", "id": task_id
})
if result.text == "CAPCHA_NOT_READY": continue
if result.text.startswith("OK|"): return result.text.split("|")[1]
raise Exception(result.text)
raise TimeoutError()
# Login flow
session = requests.Session()
login_url = "https://staging.example.com/qa-login"
# Load login page to get cookies and site key
page = session.get(login_url)
# Extract site_key from the page HTML...
site_key = "6Le-wvkS..."
# Solve CAPTCHA
token = solve_recaptcha(site_key, login_url)
# Submit login form
resp = session.post(login_url, data={
"username": "user@example.com",
"password": "your_password",
"g-recaptcha-response": token
})
if resp.url != login_url:
print("Login successful!")
# session now has auth cookies for subsequent requests
方法 2:Selenium(Python)
对于需要执行 JavaScript 的登录页面:
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
import requests
import time
API_KEY = "YOUR_API_KEY"
options = webdriver.ChromeOptions()
options.add_argument("--no-sandbox")
driver = webdriver.Chrome(options=options)
# Navigate to login page
driver.get("https://staging.example.com/qa-login")
wait = WebDriverWait(driver, 10)
# Fill in credentials
username_field = wait.until(EC.presence_of_element_located((By.NAME, "username")))
username_field.send_keys("user@example.com")
driver.find_element(By.NAME, "password").send_keys("your_password")
# Extract site key and solve
recaptcha = driver.find_element(By.CLASS_NAME, "g-recaptcha")
site_key = recaptcha.get_attribute("data-sitekey")
token = solve_recaptcha(site_key, driver.current_url)
# Inject token
driver.execute_script(
f"document.getElementById('g-recaptcha-response').innerHTML = '{token}';"
)
# Submit
driver.find_element(By.CSS_SELECTOR, 'button[type="submit"]').click()
wait.until(EC.url_changes(driver.current_url))
print(f"Logged in! Now at: {driver.current_url}")
方法 3:Puppeteer (Node.js)
const puppeteer = require("puppeteer");
const axios = require("axios");
const API_KEY = "YOUR_API_KEY";
async function solveRecaptcha(siteKey, pageUrl) {
const submit = await axios.get("https://ocr.captchaai.com/in.php", {
params: {
key: API_KEY,
method: "userrecaptcha",
googlekey: siteKey,
pageurl: pageUrl,
},
});
const taskId = submit.data.split("|")[1];
while (true) {
await new Promise((r) => setTimeout(r, 5000));
const result = await axios.get("https://ocr.captchaai.com/res.php", {
params: { key: API_KEY, action: "get", id: taskId },
});
if (result.data === "CAPCHA_NOT_READY") continue;
if (result.data.startsWith("OK|")) return result.data.split("|")[1];
throw new Error(result.data);
}
}
(async () => {
const browser = await puppeteer.launch({ headless: "new" });
const page = await browser.newPage();
await page.goto("https://staging.example.com/qa-login");
// Fill credentials
await page.type("#username", "user@example.com");
await page.type("#password", "your_password");
// Get site key and solve
const siteKey = await page.$eval(".g-recaptcha", (el) =>
el.getAttribute("data-sitekey")
);
const token = await solveRecaptcha(siteKey, page.url());
// Inject and submit
await page.evaluate(
(t) => (document.getElementById("g-recaptcha-response").innerHTML = t),
token
);
await page.click('button[type="submit"]');
await page.waitForNavigation();
console.log("Logged in:", page.url());
await browser.close();
})();
处理多因素+验证码
有些网站将验证码与多重身份验证结合起来:
# Step 1: Solve CAPTCHA and submit login
token = solve_recaptcha(site_key, login_url)
resp = session.post(login_url, data={
"username": "user@example.com",
"password": "your_password",
"g-recaptcha-response": token
})
# Step 2: Handle MFA page (if redirected)
if "verify" in resp.url or "mfa" in resp.url:
# Your MFA code logic here
mfa_code = get_mfa_code()
resp = session.post(resp.url, data={"code": mfa_code})
# Step 3: Verify logged in
assert "dashboard" in resp.url
故障排除
| 问题 | 原因 | 处理方式 |
|---|---|---|
| 登录返回验证码页面 | 令牌已过期 | 60秒内解决并提交 |
| 密码正确且“凭据无效” | CSRF 令牌缺失 | 从登录页面提取并包含 CSRF 令牌 |
| 登录后会话丢失 | Cookie 未保留 | 使用requests.Session()或浏览器cookie |
| reCAPTCHA v3 尽管有令牌也会阻止 | 分数太低 | CaptchaAI 针对高分进行优化;验证动作参数 |
常问问题
我可以自动登录任何网站吗?
CaptchaAI 处理 CAPTCHA 组件。您的自动化需要处理登录表单本身(填写字段、提交、管理 cookie)。合法性取决于您访问该帐户的授权。
我需要浏览器来自动登录吗?
并非总是如此。许多登录页面接受标准 HTTP POST 请求。仅当登录需要执行 JavaScript 或复杂交互时才使用浏览器。
登录后如何处理会话维护?
在 Python 中使用 requests.Session() 或在浏览器实例中维护 cookie。所有后续经过身份验证的请求都需要登录时的会话 cookie。
相关指南
- 使用 Python 处理 Selenium 验证码
- 使用 Node.js 解决 Puppeteer CAPTCHA 问题
- 剧作家验证码处理