中國裁判文書下載:selenium路線

成功現狀:
在這裏插入圖片描述

一、requests路線

requests路線需要對js進行解密,對js解密時遇到三個參數

docid
cipher
__RequestVerificationToken

這三個參數主要是針對文書列表頁面的
在這裏插入圖片描述
試圖在這個頁面獲取相應的鏈接,
js解密後,其中ciphertext參數需要感謝大神:越學越害怕
後面的docid和__RequestVerificationToken都非常簡單

但是將這些參數傳入後,請求仍然出現狀態碼202或者狀態碼200但是內容爲None
在這裏插入圖片描述

最後放棄了requests路線

二、selenium路線

問題一:namespace error 命名空間錯誤

參見《selenium的namespace錯誤與無法定位:NamespaceError: An attempt was made to create…andUnable to locate elemen》,鏈接: link.

雖然原因不明,不知道爲什麼報這個錯誤,但是這個錯誤與標籤有關,只要不是特殊標籤即可
如果存在特殊標籤,比如裁判文書網中的下面這個元素的初始標籤:diy:lawyee

<diy:lawyee id="_view_1545184311000" var-name="_var_name_1545184311000" style="display: inline;"> <div class="LM_con clearfix" style="background: none;"> <div class="fl con_left clearfix" style="display: none;"> <a href="../181217BMTKHNT2W0/index.html"> </a> <a id="chartListBtn" href="javascript:;"> </a> </div> <div class="fr con_right">共檢索到 <span>1559125</span> 篇文書,顯示前600條</div> </div> <div class="LM_tool clearfix"> <div class="fl tool_PX tool_On" data-value="s50"> <a href="javascript:;">法院層級</a> </div> <div class="fl tool_PX " data-value="s51"> <a href="javascript:;">裁判日期</a> </div> <div class="fl tool_PX " data-value="s52"> <a href="javascript:;">審判程序</a> </div> <!-- <div class="fl tool_PX " data-value="s52"> <a href="javascript:;">審判程序</a> </div> --> <div class="fr tool_All"> <a class="AllSelect" href="javascript:;"><input type="checkbox" id="AllSelect"><label for="AllSelect">全選</label></a> <a class="AllKeep" href="javascript:;">批量收藏</a> <a class="AllDownload" href="javascript:void(0);">批量下載</a> </div> </div>   <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事複覈 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="3dc5113e6389402aafbeab200113faf6"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=3dc5113e6389402aafbeab200113faf6" class="caseName" target="_blank">劉永權搶劫在法定刑以下量刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">中華人民共和國最高人民法院</span> <span class="ah">(2019)最高法刑核87677387號</span> <span class="cprq">2019-10-29</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,原審被告人劉永權以<span style="color:red">非法佔有</span>爲目的,採取暴力手段強行劫取他人財物,其行爲已構成搶劫罪,且屬於入戶搶劫。劉永權雖不具有法定減輕處罰情節,但其犯罪情節較輕,並能如實供述罪行,認罪態度較好,且已將...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="3dc5113e6389402aafbeab200113faf6" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="3dc5113e6389402aafbeab200113faf6" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事審判監督 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="21e9f0002d8c4d29baf6aabf00c11a3c"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=21e9f0002d8c4d29baf6aabf00c11a3c" class="caseName" target="_blank">徐國慶 貪污罪 駁回申訴通知書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2019)最高法刑申230號</span> <span class="cprq">2019-08-14</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院經審查認爲,你利用擔任河南省寧陵縣國有林場場長的職務便利,隱瞞已用林場債權頂抵部分應付款項的事實,多列支出予以報銷,<span style="color:red">非法佔有</span>公款25200元。原審認定你犯貪污罪的事實清楚,證據確實、充分,定罪準...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="21e9f0002d8c4d29baf6aabf00c11a3c" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="21e9f0002d8c4d29baf6aabf00c11a3c" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事審判監督 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="06d6f4b4fa2644708563aa8601126387"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=06d6f4b4fa2644708563aa8601126387" class="caseName" target="_blank">搶劫刑事決定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2017)最高法刑申407號</span> <span class="cprq">2019-07-05</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,原審法院認定申訴人劉十民因成某的丈夫冉某勇盜竊其財物,使用暴力拉走成某所收廢品的事實清楚,證據確實充分。劉十民所提成某同意其拉走廢品和沒有使用暴力的申訴理由不成立。鑑於:(1)冉某勇盜竊劉十民的財物,負有返還財產、賠償損失的義務。成某所收購的廢品,屬...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="06d6f4b4fa2644708563aa8601126387" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="06d6f4b4fa2644708563aa8601126387" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="143b5d3f9ba64acfae8eaa9a00beb973"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=143b5d3f9ba64acfae8eaa9a00beb973" class="caseName" target="_blank">趙志紅故意殺人、強姦案死刑複覈裁定</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-07-02</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人趙志紅故意非法剝奪他人生命,其行爲已構成故意殺人罪;違背婦女意志,採用暴力、脅迫等手段強姦婦女,其行爲已構成強姦罪;以<span style="color:red">非法佔有</span>爲目的,採用暴力、脅迫手段劫取他人財物,其行爲又構成搶劫...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="143b5d3f9ba64acfae8eaa9a00beb973" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="143b5d3f9ba64acfae8eaa9a00beb973" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="3e068515464d4e03a68eab2700dc9b5e"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=3e068515464d4e03a68eab2700dc9b5e" class="caseName" target="_blank">王洪喜搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-06-18</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人王洪喜以<span style="color:red">非法佔有</span>爲目的,採用暴力手段劫取他人財物,並致人死亡,其行爲已構成搶劫罪。犯罪性質惡劣,情節、後果嚴重,社會危害性大,應依法懲處。第一審判決、第二審裁定認定的事實清楚,證據確...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="3e068515464d4e03a68eab2700dc9b5e" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="3e068515464d4e03a68eab2700dc9b5e" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="b8f6b6fe813945109411ab1b00a5d177"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=b8f6b6fe813945109411ab1b00a5d177" class="caseName" target="_blank">劉樂搶劫、故意殺人死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-06-03</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人劉樂以<span style="color:red">非法佔有</span>爲目的,採取殺人的暴力手段劫取他人財物,其行爲已構成搶劫罪;劉樂故意非法剝奪他人生命,其行爲構成故意殺人罪,應依法並罰。劉樂沉迷賭博,殺害妻子,情節惡劣,後果嚴重;其殺...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="b8f6b6fe813945109411ab1b00a5d177" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="b8f6b6fe813945109411ab1b00a5d177" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="8051cc11053748f7b85cab0901177455"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=8051cc11053748f7b85cab0901177455" class="caseName" target="_blank">於京平搶劫、強姦死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-04-09</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人於京平以<span style="color:red">非法佔有</span>爲目的,採用暴力手段劫取他人財物,又強行與被害人發生性關係,其行爲已構成搶劫罪和強姦罪,應依法數罪併罰。於京平在搶劫過程中爲制服被害人反抗,用隨身攜帶的斧子多次擊打被...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="8051cc11053748f7b85cab0901177455" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="8051cc11053748f7b85cab0901177455" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="341ef75432434ac29b87ab0901177549"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=341ef75432434ac29b87ab0901177549" class="caseName" target="_blank">賈謙龍搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-04-01</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人賈謙龍以<span style="color:red">非法佔有</span>爲目的,採用暴力手段劫取他人財物,其行爲已構成搶劫罪。賈謙龍預謀搶劫,當場殺害一名被害人,犯罪手段殘忍,情節、後果嚴重,罪行極其嚴重,應依法懲處。第一審判決、第二審裁...</p> </div>  <div class="list_Association">  <h4>[關聯文書]</h4>   <div class="guanLian"> <span>本篇</span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=341ef75432434ac29b87ab0901177549" target="_blank" title=""><i class="guanlianAnyou">其他</i><i>最高人民法院 </i><i>無</i><i>2019-04-01</i><i></i></a> </div>  <div class="guanLian"> <span></span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=97fe25f949ee4d5fb136a9d300113d78" target="_blank" title=""><i class="guanlianAnyou">刑事二審</i><i>河北省高級人民法院 </i><i>(2018)冀刑終249號</i><i>2018-09-13</i><i></i></a> </div>  <div class="guanLian"> <span></span><b></b> <a href="../181107ANFZ0BXSK4/index.html?docId=fe7964f3f00a42c7ac04ab2700b109e2" target="_blank" title=""><i class="guanlianAnyou">刑事一審</i><i>河北省邯鄲市中級人民法院 </i><i>(2018)冀04刑初2號</i><i>2018-03-28</i><i>判決</i></a> </div>  </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="341ef75432434ac29b87ab0901177549" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="341ef75432434ac29b87ab0901177549" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=fb5ecd6b76fd4709b0f4ab0e00c35e20" class="caseName" target="_blank">張治剛搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-04-01</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人張治剛以<span style="color:red">非法佔有</span>爲目的,夥同他人採用暴力手段劫取財物,其行爲已構成搶劫罪。張治剛夥同他人經預謀後搶劫無證營運出租車,不顧司機求饒將其殺害,併爲掩蓋罪證拋屍枯井,犯罪情節惡劣,手段殘忍...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="fb5ecd6b76fd4709b0f4ab0e00c35e20" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="5b12fee3ad674bb189d7aaf501095b44"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=5b12fee3ad674bb189d7aaf501095b44" class="caseName" target="_blank">張斌搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-03-31</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人張斌以<span style="color:red">非法佔有</span>爲目的,採取暴力手段劫取他人財物,其行爲已構成搶劫罪。張斌持械入戶搶劫並致人死亡,犯罪情節特別惡劣,後果嚴重,實屬罪行極其嚴重,應依法懲處。第一審判決、第二審裁定認定的...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="5b12fee3ad674bb189d7aaf501095b44" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="5b12fee3ad674bb189d7aaf501095b44" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="76646deadbd2432e8f0eaaf501095bd8"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=76646deadbd2432e8f0eaaf501095bd8" class="caseName" target="_blank">趙晏飛搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-03-22</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人趙晏飛以<span style="color:red">非法佔有</span>爲目的,以暴力手段入戶劫取他人財物,致一人死亡,其行爲已構成搶劫罪。趙晏飛經預謀,深夜入戶搶劫,並持刀捅刺被害人頸部致被害人死亡,犯罪手段殘忍,後果嚴重,情節特別惡劣...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="76646deadbd2432e8f0eaaf501095bd8" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="76646deadbd2432e8f0eaaf501095bd8" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="becc9a11b4bd4696a424aaf501095bc1"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=becc9a11b4bd4696a424aaf501095bc1" class="caseName" target="_blank">谷憲武搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-03-12</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人谷憲武以<span style="color:red">非法佔有</span>爲目的,使用暴力劫取他人財物,其行爲已構成搶劫罪。谷憲武因賭博輸錢而實施搶劫,並殺害被害人,犯罪動機卑劣,情節特別惡劣,實屬罪行極其嚴重,應依法懲處。原審判決、高級人...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="becc9a11b4bd4696a424aaf501095bc1" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="becc9a11b4bd4696a424aaf501095bc1" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 其他 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="88b891c7c7c74f0d954caaf701657b09"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=88b891c7c7c74f0d954caaf701657b09" class="caseName" target="_blank">高中強搶劫死刑複覈刑事裁定書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">無</span> <span class="cprq">2019-01-30</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,被告人高中強以<span style="color:red">非法佔有</span>爲目的,採用暴力手段劫取他人財物,其行爲已構成搶劫罪。高中強搶劫並致人死亡,犯罪手段殘忍,犯罪情節特別惡劣,實屬罪行極其嚴重,應依法懲處。第一審判決、第二審裁定認定的...</p> </div>  <div class="list_Association">  <h4 style="height: 40px;"></h4>   </div> <div class="List_label2 clearfix"> <div class="fr"> <a data-value="88b891c7c7c74f0d954caaf701657b09" class="a_sc" href="javascript:void(0)"><i class="fa fa-heart-o"> </i> 收藏</a> <a data-value="88b891c7c7c74f0d954caaf701657b09" class="a_xz" href="javascript:void(0)"><i class="a_xzBox"></i> 下載</a> </div> </div> </div>  <div class="LM_list"> <div class="List_label clearfix"> <div class="labelOne"><img src="../images/list/one.png"></div> <div class="labelTwo"> 刑事再審 </div> <div class="labelThree"><img src="../images/list/three.png"></div> <!-- <span class="on_1">推薦案例</span> -->     </div> <div class="list_title clearfix"> <a class="AllSelect" href="javascript:void(0)"><input type="checkbox" class="ListSelect" data-value="971d62400160436aaefda9df0112219c"></a>  <h4><a href="../181107ANFZ0BXSK4/index.html?docId=971d62400160436aaefda9df0112219c" class="caseName" target="_blank">趙明利詐騙再審刑事判決書</a></h4> </div> <div class="list_subtitle"> <span class="slfyName">最高人民法院</span> <span class="ah">(2018)最高法刑再6號</span> <span class="cprq">2019-01-03</span> </div> <div class="list_reason">  <h4>[裁判理由]</h4>  <p>本院認爲,原審被告人趙明利在與東北風冷軋板公司的冷軋板購銷交易過程中,主觀上沒有<span style="color:red">非法佔有</span>的目的,客觀上亦未實施虛構事實、隱瞞真相的行爲,其行爲不符合詐騙罪的構成要件,不構成詐騙罪。理由如下:.......... </diy:lawyee>

只要出現這種特殊標籤就會報namespace error

問題二:元素的動態變化

主要是下一頁這個元素,這個元素會動態變化
第一頁的下一頁元素爲:
在這裏插入圖片描述

//*[@id="_view_1545184311000"]/div[18]/a[8]

第二頁的元素爲:
在這裏插入圖片描述

//*[@id="_view_1545184311000"]/div[18]/a[9]

由a[8]變爲a[9]

一開始試圖通過,對於翻頁的次數的變化進行更新,但是失敗了,失敗原因在於只顯示前600份裁判文書,但是由於限定條件的不同,查找的裁判文書數量就不同,導致不好判斷
思路是這樣的:

# 下一頁xpath:
            '''
             if 7<=index <36:
                try:
                    button1 = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[14]')))#問題並不是全爲a8
                    time.sleep(1)
                    button1.click()
                except:
                    pass

            elif 36 <= index < 40:
                try:
                    button1 = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[%d]' % int(
                        49-index))))
                    time.sleep(1)
                    button1.click()
                except:
                    pass
            elif index ==40:
                pass
            '''

然後試圖通過分段函數進行解決,就是提取查找到的文書數量計算要翻的頁數:

在這裏插入圖片描述

"""目的:減少遍歷次數"""
        time.sleep(1)
        condition = browser.find_element_by_xpath('//div[@class="LM_con clearfix"]/div[@class="fr con_right"]/span')
        print(condition.text)  # 不能直接//text()原因不明
        conditions = math.ceil(int(condition.text) / 15)  # 最長12,最短6
        print(conditions)

但是還是不成功!!!!

最後突然想到,下一頁 這個元素正好是最後一個元素可以直接使用xpath語法選中最後一個

button_ = wait.until(EC.presence_of_element_located((By.XPATH, '//div[@class="left_7_3"]/a[last()]')))
button_.click()

問題三、只顯示前600份裁判文書

需要爬所有,但是隻顯示前600份
,沒有辦法直接爬取所有,因爲裁判文書網限定了只顯示前600份

解決方法:通過限定搜索條件如:
在這裏插入圖片描述
這個方法是某個博主(暫時想不起來)給的思路,這個博主是通過北京市的律師來限定的,但是我沒有找到律師的名字,所有我是通過限定區域,如xx區縣
在這裏插入圖片描述

準確說這樣,還是有問題,某些區縣仍然超過600,但是相對較少了,即使超過了也不多,通過設置篩選可以進一步等待後續處理,如:

"""目的:減少遍歷次數"""
        time.sleep(1)
        condition = browser.find_element_by_xpath('//div[@class="LM_con clearfix"]/div[@class="fr con_right"]/span')
        print(condition.text)  # 不能直接//text()原因不明
        conditions = math.ceil(int(condition.text) / 15)  # 最長12,最短6
        print(conditions)
        if int(condition.text) > 600:
            with open('超過600頁.txt','a+',encoding='utf-8')as file:
                file.write('出現超過600條的裁判文書,其所在區域爲:'+ str(i.strip()) +',其數量爲:'+str(condition.text) + '\n')
            logging.warning('出現超過600條的裁判文書,其所在區域爲:'+ str(i.strip()) +',其數量爲:'+str(condition.text))

問題四:彈出框的處理

參見《selenium彈窗之windows下載文件彈窗點擊方法》 ,使用pyautogui進行鍵鼠自動化

但是據說還有selenium的內部方法

"""下載無彈窗
profile = webdriver.FirefoxProfile()
profile.set_preference('browser.download.dir', '‪I:\\360下載\\firefox')
profile.set_preference('browser.download.folderList', 2)
profile.set_preference('browser.download.manager.showWhenStarting', False)
profile.set_preference('browser.helperApps.neverAsk.saveToDisk', 'application/zip')
"""

但是不知道爲什麼我的不行,所以註釋掉了

三、selenium路線的缺陷

有兩個缺陷,一個是速度慢,另一個是彈出框暫時沒有辦法解決
後續可能需要使用scrapy+selenium,以及使彈出框不在彈出

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章