puppeteer採集http請求記錄

原創

2020-06-17 06:29

chrome headless爬取http訪問記錄，代碼如下

const Puppeteer = require("puppeteer");

(async () => {
    const browser = await Puppeteer.launch({
        headless: true
    }).catch(() => browser.close);

    const page = await browser.newPage();
    await page.setRequestInterception(true);
    page.on('request', request => {
        console.log(request.url());
        request.continue();
    });
    page.on('response', response => {
        console.log(response.url());
    });
    page.on('requestfailed', request => {
        console.log(request.url());
    });
    page.on('requestfinished', request => {
        console.log(request.url());
    });

    await page.goto('https://www.baidu.com').catch(() => browser.close);
    //await page.waitFor(500);

    await browser.close();
})()

以上是一個簡單的例子；

基於以上思路，實現靜態資源、api爬取，然後做URL相似度分析存儲，供漏洞掃描使用，迭代中，敬請期待。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

puppeteer採集http請求記錄

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

puppeteer採集http請求記錄

exec: "google-chrome": executable file not found in %PATH%

init failed: unable to detect the containing GOPATH

Suricata日誌同步至Elasticsearch

Suricata鏡像流量解析--Suricata部署

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結