python中的dataframe 剔除部分數據後，索引消失，重新建立索引

原創

2018-10-19 23:12

今天在處理一個數據的過程中出現問題，python中的dataframe 剔除部分數據後，索引消失，遍歷就出錯，
報錯形式如下

Traceback (most recent call last):
  File "D:/pycreate/tianchi_糖尿病/data_pre/split_data.py", line 53, in <module>
    handler_data()
  File "D:/pycreate/tianchi_糖尿病/data_pre/split_data.py", line 32, in handler_data
    print(indexdf["S"][i])
  File "D:\ANACONDA\ana3.5.2\lib\site-packages\pandas\core\series.py", line 766, in __getitem__
    result = self.index.get_value(self, key)
  File "D:\ANACONDA\ana3.5.2\lib\site-packages\pandas\core\indexes\base.py", line 3103, in get_value
    tz=getattr(series.dtype, 'tz', None))
  File "pandas\_libs\index.pyx", line 106, in pandas._libs.index.IndexEngine.get_value
  File "pandas\_libs\index.pyx", line 114, in pandas._libs.index.IndexEngine.get_value
  File "pandas\_libs\index.pyx", line 162, in pandas._libs.index.IndexEngine.get_loc
  File "pandas\_libs\hashtable_class_helper.pxi", line 958, in pandas._libs.hashtable.Int64HashTable.get_item
  File "pandas\_libs\hashtable_class_helper.pxi", line 964, in pandas._libs.hashtable.Int64HashTable.get_item
KeyError: 31

後來找了以下是由於我對原始數據刪除了部分異常數據導致的，。

#會導致原索引丟失，30-32
    indexdf=indexdf[indexdf["EE"]!=0]

解決方案

   #重新定義索引，才能支持遍歷
    # indexdf = indexdf.reset_index(drop=True)

代碼：

  indexdf=pd.read_table("0.ann",sep="\s+",names=["T","TC","S","E","name"])
  indexdf["EE"] = indexdf["E"].apply(lambda x: x if ";" not in x else 0)
  indexdf=indexdf[indexdf["EE"]!=0]
        #重新定義索引，才能支持遍歷
 indexdf = indexdf.reset_index(drop=True)
 for i in range(len(indexdf)):
    
            print(indexdf["S"][i])

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

python中的dataframe 剔除部分數據後，索引消失，重新建立索引

MySQL 核心模塊揭祕 | 18 期 | 鎖在內存里長什麼樣*

使用perf工具生成火焰圖

大齡程序員思考

響應式界面控件DevExtreme * 更強的數據分析和可視化功能

HttpSecurity 是如何組裝過濾器鏈的

數說海南——近6年海南各市縣人口簡單看

長序列中Transformers的高級注意力機制總結

WebStorm 創建 Vue 項目

自然語言處理-錯字識別（基於Python）kenlm、pycorrector

關於kenlm工具訓練統計語言模型

django使用過程中獲取數據庫數據（models的注意事項）

windows下pytorch安裝過程（顯卡與系統）

Python-自定義裝飾器，使用裝飾器記錄函數執行次數,一種埋點的實現形式

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結