Python 自動截圖-蒐集企業外部風險資料

原創

2020-07-05 22:35

最近在東亞天津分行對公業務部實習，實習生做一些無用的工作。在對企業貸款前風評的時候外部公衆信息的蒐集也是其中一環。對於貸款企業首先需要繪製其股權圖，然後根據企業法人董事，及受益人（25%股權以上）的關聯企業進行外部公衆信息收集。
用python寫了自動截圖的軟件
這樣我就可以做更核心的一些事務了😟

百度搜索一開始我在想怎麼點擊翻頁，其實有更加簡便的規則，詳細參考

from selenium import webdriver
from selenium.webdriver.chrome.options import Options
import os
import time
def get_image(url, pic_name):
#設置chrome爲headless（無界面）開啓模式
    options = webdriver.ChromeOptions()
    options.add_argument("--headless")
    driver = webdriver.Chrome(chrome_options=options)
#控制瀏覽器寫入並轉到鏈接
    driver.get(url)
    time.sleep(3)
    picture_name = driver.title
#用js獲取頁面的寬高
    width = driver.execute_script("return document.documentElement.scrollWidth")
    height = driver.execute_script("return document.documentElement.scrollHeight")
    print(width,height)
#將瀏覽器的寬高設置成剛剛獲取的寬高
    driver.set_window_size(width, height)
    time.sleep(3)
#截圖並關掉瀏覽器
    driver.save_screenshot('d:\\BEATJ\\'+picture_name+pic_name)
    url = 'https://www.baidu.com/s?wd='+ke_word+'&pn=10'
    pic_name = r'2.png'
    driver.get(url)
    time.sleep(3)
    picture_name = driver.title
    width = driver.execute_script("return document.documentElement.scrollWidth")
    height = driver.execute_script("return document.documentElement.scrollHeight")
    print(width,height)
    driver.set_window_size(width, height)
    time.sleep(3)
    driver.save_screenshot('d:\\BEATJ\\' + picture_name + pic_name)
    driver.close()
#你輸入的參數
at=[' 洗錢 恐怖分子融資 賄賂',' 貪污 挪用 盜用',' 制裁 禁運 逃稅',' 走私 販毒 人口販賣','奴役 強迫勞動 剝削']
for line in open("keyword.txt","r",encoding='UTF-8').readlines():
    line1=line.replace('\n','')
    line=line1
    for att in at:
        ke_word=line+' '+att
        url = 'https://www.baidu.com/s?wd='+ke_word+'&pn=00'
        pic_name = r'1.png'
        get_image(url, pic_name)

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Python 自動截圖-蒐集企業外部風險資料

【面試準備】又一次失敗的面試經歷，題目離譜～資深軟件測試工程師

dotnet 8 版本與銀河麒麟V10和UOS系統的 glibc 兼容性

YOJ 102嚴神的數列

Python金融-市場利率分析學習筆記

YOJ 101 嚴神的遊戲

ICPC NEAU 2020 支付

YOJ208 嚴神的機器人 III

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結