The Python Challenge Level-4 Solution
先附上我在Github上存放的代碼倉庫: The Python Challenge
依然是先查看網頁源代碼。發現有個鏈接’…nothing=12345’
點進去之後發現又給我們一個數字,引導我們去下一個鏈接,於是可以知道,只要鏈接爬到了終點,那麼就會出現別的提示信息
果然,爬到終點之後,提示我們除以二,繼續爬,所以只要把代碼改個數字就好,我的代碼如下:
#!/usr/bin/env python3
# -*- coding: utf-8 -*-
__author__ = 'Yuuki_Dach'
from urllib import request
import re
mainUrl = 'http://www.pythonchallenge.com/pc/def/linkedlist.php?nothing='
subUrl = '8022' # first try is '12345'
subUrl2 = ''
i = 400
while subUrl != subUrl2:
subUrl2 = subUrl
pyUrl = mainUrl + subUrl
req = request.Request(pyUrl)
resp = request.urlopen(req)
urlContent = resp.read()
pattern = re.compile(b'\d+', re.S)
contents = re.findall(pattern, urlContent)
for content in contents:
subUrl = content.decode("ascii")
print(subUrl)
最後得到’peak’。只不過注意後綴要改回’.html’就行了