Python数据分析与挖掘实战 chapter6-1

原創

2018-08-26 19:14

Python数据分析与挖掘实战P153

#-*_coding:utf-8 _*_
import pandas as pd
from scipy.interpolate import lagrange
inputfile='G:/学习资料/统计/chapter6/demo/data/missing_data.xls'
outputfile='G:/学习资料/统计/chapter6/demo//tmp/missing_data_processed1.xls'
data=pd.read_excel(inputfile, header=None)
def ployinterp_column(s,n,k=5):
    y=s[list(range(n-k,n))+list(range(n+1,n+1+k))]
    y=y[y.notnull()]
    return lagrange(y.index, list(y))(n)
for i in data.columns:
    for j in range(len(data)):
        if (data[i].isnull())[j]:
            data[i][j]=ployinterp_column(data[i],j)
data.to_excel(outputfile,header=None,index=False)

遇到的问题：

1.ImportError: No module named 'xlrd'

在cmd命令行安装xlrd,代码：pip install xlrd

2.File "C:\Users\Administrator\AppData\Local\Programs\Python\Python35\lib\site-packages\pandas\core\series.py", line 842
return self.loc[key]
FutureWarning:
Passing list-likes to .loc or [] with any missing label will raise
KeyError in the future, you can use .reindex() as an alternative.

See the documentation here:
https://pandas.pydata.org/pandas-docs/stable/indexing.html#deprecate-loc-reindex-listlike
查阅该网址后，将loc[key]改为reindex(key)，问题即可解除。

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Python数据分析与挖掘实战 chapter6-1

如何使用 JS 判断用户是否处于活跃状态

lightdb秒级增加列和删除列（not null带默认值）

lightdb数据库超时相关控制参数

通过HPA+CronHPA组合应对业务复杂弹性伸缩场景

❤️‍🔥 Solon Cloud Event 新的事务特性与应用

lightdb mysql 8.0兼容之不可见主键

使用 JS 实现在浏览器控制台打印图片 console.image()

基于Ubuntu-22.04安装K8s-v1.28.2实验（四）使用域名访问网站应用

Python數據分析與挖掘實戰Chapter7 航空公司客戶價值分析

數據庫技術三級第四章

leetcode550. 遊戲玩法分析報告在首次登錄的第2天再次登錄的玩家的分數

leetcode618.學生地理信息報告

leetcode1127. 查找每天僅使用手機端用戶、僅使用桌面端用戶和同時使用桌面端和手機端的用戶人數和總支出金額

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結