pandas讀寫excel文件

https://www.cnblogs.com/everfight/p/pandas_select_rows.html
https://blog.csdn.net/midion9/article/details/89000131
https://blog.csdn.net/fengqiaoxian/article/details/80415354

萬能的pandas能處理excel的20多萬行數據。xlsxwriter處理不了20多萬行,但是能處理五萬行數據,能處理的最多的數據都沒測試過。
一、讀取excel:
Pandas中根據列的值選取多行數據:
#選取等於某些值的行記錄 用
df.loc[df[‘column_name’] == some_value]

二、寫入excel:
寫入excel主要通過pandas構造DataFrame,調用to_excel方法實現。

三、代碼實現。
注意excel格式要正確:第一行可以是數據的含義,從第二行開始是得處理的數據。

from pandas import DataFrame, read_excel, ExcelWriter

def readData():
excelFile = r’C:\Users\sparrow\Desktop\PupilData(20191011073414).xlsx’
data = DataFrame(read_excel(excelFile))

df1 = data[['Time(s)', 'LeftPupil(mm)', 'RightPupil(mm)', 'AveragePupil(mm)']]

df2 = df1.loc[df1['Time(s)'] >= 147].loc[df1['Time(s)'] <= 156.1]
df3 = df1.loc[df1['Time(s)'] >= 286].loc[df1['Time(s)'] <= 294.1]
df4 = df1.loc[df1['Time(s)'] >= 313].loc[df1['Time(s)'] <= 324.1]
df5 = df1.loc[df1['Time(s)'] >= 324].loc[df1['Time(s)'] <= 364.1]
df6 = df1.loc[df1['Time(s)'] >= 372].loc[df1['Time(s)'] <= 397.1]
df7 = df1.loc[df1['Time(s)'] >= 497].loc[df1['Time(s)'] <= 505.1]
df8 = df1.loc[df1['Time(s)'] >= 591].loc[df1['Time(s)'] <= 630.1]
df9 = df1.loc[df1['Time(s)'] >= 640].loc[df1['Time(s)'] <= 663.1]

writer = ExcelWriter('output3.xlsx')
df2.to_excel(writer, '1')
df3.to_excel(writer, '2')
df4.to_excel(writer, '3')
df5.to_excel(writer, '4')
df6.to_excel(writer, '5')
df7.to_excel(writer, '6')
df8.to_excel(writer, '7')
df9.to_excel(writer, '8')
writer.save()
print('ok')

readData()

來瞧瞧運行結果:
在這裏插入圖片描述

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章