百度音樂新歌榜100曲自動下載，並改名

原創

yyl910606

2019-02-22 21:22

運行環境:windows,linux

python版本:2.x

中間有抄襲開源中國裏某大牛的代碼，望請原諒

#!/usr/bin/python  
#coding:utf8  
 
import re,urllib  
 
#url='http://music.baidu.com'    
url='http://music.baidu.com/top/new'             #百度新歌100榜
openurl=urllib.URLopener()                         
headers = ('User-Agent','Mozilla/5.0 (Windows NT 5.1; rv:14.0) Gecko/20100101 Firefox/14.0.1')  
openurl.addheaders = [headers]                   #瀏覽器仿真
data=openurl.open(url).read()                    #獲取HTML代碼
datadata=data.decode('utf8')                     #html轉換utf8編碼
music_sid=re.findall(re.compile(r"'sid': '(.*)', 'sname'"),data)  #正則匹配sid 
music_sname=re.findall(re.compile(r"'sname': '(.*)', 'author'"),data)  #正則匹配歌曲名
music_author=re.findall(re.compile(r"'author': '(.*)' }"),data)   #正則匹配作者，或者演唱
 
file=open('downurl.txt','w')                                 #打開一個文本以寫模式打開
for  i in range(len(music_sid)):                             #循環一歌曲sid個數
#   file.write(sid[i]+name[i]+'\n')  
#   print  music_sid[i] + '  ' + music_sname[i] + '-' + music_author[i]  
    print str(i) + ':            '+ music_sname[i] + '-' + music_author[i]   #序號加歌曲名
    da=openurl.open('http://music.baidu.com/song/%s/download'% str(music_sid[0])).read() #打開下載頁面  
    downurl=re.findall(re.compile(r'downlink="/data/music/file\?link=(.*)" type'),da)  #正則匹配下載地址
    file.write(downurl[0]+'\n')                              #下載地址寫到文本，這個可以註釋掉。
    print '%s music file download Ing ........................'%music_sname[i]    #顯示內容
    urllib.urlretrieve(downurl[0],music_sname[i]+'-'+music_author[i]+'.mp3')   #下載歌曲，命名
    print '-'*50  
file.close()

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

百度音樂新歌榜100曲自動下載，並改名

【SQL進階】CASE語句的使用

npm error Cannot read properties of null (reading 'isDescendantOf')

動態域名安全SSH，防止惡意登陸

Centos+iptables+l7-filter 封QQ MSN和P2P

百度音樂新歌榜100曲自動下載，並改名

bt5 sqlmap使用

Joomla 靜態頁面生成排版錯誤

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結