直接上代碼,免的自己忘記了
<?php
echo "Hello World!";
$url = "http://www.youdao.com/w/ok/#keyfrom=dict2.top";//要爬取的網址
//$url = "https://www.runoob.com/html/html-links.html";//要爬取的網址
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt ($ch, CURLOPT_HEADER,false);
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
curl_setopt($ch, CURLOPT_SSL_VERIFYHOST, false);
//設置用戶代理(後綴爲.top的網址不加這個獲取不到內容。)
curl_setopt($ch, CURLOPT_USERAGENT, "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.22 (KHTML, like Gecko) Chrome/25.0.1364.172 Safari/537.22");
//關閉直接輸出
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
//加入重定向處理
curl_setopt($ch, CURLOPT_FOLLOWLOCATION ,1);
$response = curl_exec($ch);
echo curl_errno($ch);
echo($response);
curl_close($ch);
?>
先獲取到了全部內容,在弄指定的內容。好像要用正則表達式,這個也是十竅通了九竅,一竅不通…