python 爬取學信網登錄頁面的例子

原創

宇风-飞扬

2019-08-13 19:03

今天小編就爲大家分享一篇python 爬取學信網登錄頁面的例子，具有很好的參考價值，希望對大家有所幫助。一起跟隨小編過來看看吧

我們以學信網爲例爬取個人信息

**如果看不清楚

按照以下步驟：**

1.火狐爲例打開需要登錄的網頁–> F12 開發者模式（鼠標右擊，點擊檢查元素）–點擊網絡 –>需要登錄的頁面登錄下–> 點擊網絡找到一個POST提交的鏈接點擊–>找到post（注意該post中信息就是我們提交時需要構造的表單信息）

import requests
from bs4 import BeautifulSoup
from http import cookies
import urllib
import http.cookiejar

headers = {
  'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64; rv:54.0) Gecko/20100101 Firefox/54.0',
  'Referer':'https://account.chsi.com.cn/passport/login?service=https://my.chsi.com.cn/archive/j_spring_cas_security_check',
}

session = requests.Session()
session.headers.update(headers)
username = 'xxx'
password = 'xxx'
url = 'https://account.chsi.com.cn/passport/login?service=https://my.chsi.com.cn/archive/j_spring_cas_security_check'
def login(username,password,lt,_eventId='submit'):   #模擬登入函數
  #構造表單數據
  data = { #需要傳去的數據
      '_eventId':_eventId,
      'lt':lt,
      'password':password, 
      'submit':u'登錄',
      'username':username, 
  }
  html = session.post(url,data=data,headers=headers)

def get_lt(url):    #解析登入界面_eventId
  html = session.get(url)
  #獲取 lt
  soup = BeautifulSoup(html.text,'lxml',from_encoding="utf-8")
  lt=soup.find('input',type="hidden")['value']
  return lt

lt = get_lt(url)#獲取登錄form表單信息 以學信網爲例
login(username,password,lt)
login_url = 'https://my.chsi.com.cn/archive/gdjy/xj/show.action'
per_html = session.get(login_url)
soup = BeautifulSoup(per_html.text,'lxml',from_encoding="utf-8")
print(soup)
for tag in soup.find_all('table',class_='mb-table'):
  print(tag)
  for tag1 in tag.find_all('td'):
    title= tag1.get_text(); 
    print(title)

以上這篇python 爬取學信網登錄頁面的例子就是小編分享給大家的全部內容了，希望能給大家一個參考，也希望大家多多支持神馬文庫。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

python 爬取學信網登錄頁面的例子

salesforce零基礎學習（一百三十八）零碎知識點小總結（十）

關於接口協議，你必須要知道這些！

一鍵自動化博客發佈工具,用過的人都說好(頭條篇)

01 穩定性（一）如何應對事故並做好覆盤？

美團一面：項目中有 10000 個 if else 如何優化？想了半天，被問懵了！

FolkMq v1.4.6 發佈（可以內嵌的消息中間件）

京東面試：如何進行JVM調優？

線程池那些坑爹的參數-核心線程數&最大線程數&工作隊列

Stream流常用方法總結

【python+pytorh自然語言處理】AttributeError: 'Example' object has no attribute 'label'錯誤提示

【nvidia+gpu驅動安裝】centos7 安裝nvidia驅動無法啓動問題

【centos7 + MySQL5.7 安裝】centos7 安裝MySQL5.7

centos 磁盤空間不足，掛載新磁盤

【linux + svn 安裝】centos 安裝svn

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結