06 ，df 查看，索引操作：提取小 df ( m行n列 )，所有字段名，索引操作

原創

2020-07-05 00:42

1 ，所有字段名： data.columns

目的：得到所有字段名
得到： index 對象
取一個字段名： res[n]
代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    # 取數據
    res = data.columns
    res.tolist
    print(res)
    print(type(res))
    res02 = res[2]
    print(res02)
    print(type(res02))
=======================================
Index(['PassengerId', 'Survived', 'Pclass', 'Name', 'Sex', 'Age', 'SibSp',
       'Parch', 'Ticket', 'Fare', 'Cabin', 'Embarked'],
      dtype='object')
<class 'pandas.core.indexes.base.Index'>
Pclass
<class 'str'>

2 ，提取小 df ：data[[‘Sex’,‘Age’,‘Survived’]].loc[3:6]

思路：先提取列，再提取行
4,5,6,7 行，[‘PassengerId’,‘Sex’,‘Age’,‘Survived’] 列
代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
===================================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0

3 ，索引，查看所有索引： res.index

代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    # 取索引
    index = res.index
    print(index)
    print(type(index))
===========================================
RangeIndex(start=3, stop=7, step=1)
<class 'pandas.core.indexes.range.RangeIndex'>

4 ，索引，重新索引： res.reset_index(drop=True, inplace=True)

代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
    res.reset_index(drop=True, inplace=True)
    print(res)
======================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0
======================================================
   PassengerId     Sex   Age  Survived
0            4  female  35.0         1
1            5    male  35.0         0
2            6    male   NaN         0
3            7    male  54.0         0

5 ，索引，自定義： res.index = pd.Series([“a”,“b”,“c”,“d”])

代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
    res.index = pd.Series(["a","b","c","d"])
    print(res)
==========================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0
==========================================================
   PassengerId     Sex   Age  Survived
a            4  female  35.0         1
b            5    male  35.0         0
c            6    male   NaN         0
d            7    male  54.0         0

6 ，自定義索引取數據：

思路：
1 ，使用：像正常索引一樣使用
2 ，是否可以選取區間：可以
代碼：

if __name__ == '__main__':
    # 全列顯示 ：
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    res.index = pd.Series(["a","b","c","d"])
    print(res)
    print(res.loc['a'])
    print(res.loc['b':'d'])
===========================================================
   PassengerId     Sex   Age  Survived
a            4  female  35.0         1
b            5    male  35.0         0
c            6    male   NaN         0
d            7    male  54.0         0
================================
PassengerId         4
Sex            female
Age                35
Survived            1
Name: a, dtype: object
================================
   PassengerId   Sex   Age  Survived
b            5  male  35.0         0
c            6  male   NaN         0
d            7  male  54.0         0

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

06 ，df 查看，索引操作：提取小 df ( m行n列 )，所有字段名，索引操作

1 ，所有字段名： data.columns

2 ，提取小 df ：data[[‘Sex’,‘Age’,‘Survived’]].loc[3:6]

3 ，索引，查看所有索引： res.index

4 ，索引，重新索引： res.reset_index(drop=True, inplace=True)

5 ，索引，自定義： res.index = pd.Series([“a”,“b”,“c”,“d”])

6 ，自定義索引取數據：

杭州的 IT 崩盤了麼？

開源高性能結構化日誌模塊NanoLog

【簡寫Mybatis-02】註冊機的實現以及SqlSession處理

手繪二維碼

.NET藉助虛擬網卡實現一個簡單異地組網工具

13 ，np 常用函數：範圍內取 n 個值

01 ，seaborn 基本設置：5種風格，刻度線，圖位置，子圖風格，文字大小，線寬

04 ，plt 設置：x-y 範圍，座標點，外邊框，plt 與 ax 的區別，推薦 ax ：

08 ，散點圖( x-y ) scatter ：

05 ，子圖：多個 ax 進行畫圖，fig.add_subplot

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

06 ，df 查看，索引操作 ： 提取小 df ( m行n列 )，所有字段名，索引操作

1 ，所有字段名 ： data.columns

2 ，提取小 df ：data[[‘Sex’,‘Age’,‘Survived’]].loc[3:6]

3 ，索引， 查看所有索引： res.index

4 ，索引，重新索引 ： res.reset_index(drop=True, inplace=True)

5 ，索引，自定義 ： res.index = pd.Series([“a”,“b”,“c”,“d”])

6 ，自定義索引取數據 ：

06 ，df 查看，索引操作：提取小 df ( m行n列 )，所有字段名，索引操作

1 ，所有字段名： data.columns

3 ，索引，查看所有索引： res.index

4 ，索引，重新索引： res.reset_index(drop=True, inplace=True)

5 ，索引，自定義： res.index = pd.Series([“a”,“b”,“c”,“d”])

6 ，自定義索引取數據：