06 ,df 查看,索引操作 : 提取小 df ( m行n列 ),所有字段名,索引操作

1 ,所有字段名 : data.columns

  1. 目的 : 得到所有字段名
  2. 得到 : index 對象
  3. 取一個字段名 : res[n]
  4. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    # 取數據
    res = data.columns
    res.tolist
    print(res)
    print(type(res))
    res02 = res[2]
    print(res02)
    print(type(res02))
=======================================
Index(['PassengerId', 'Survived', 'Pclass', 'Name', 'Sex', 'Age', 'SibSp',
       'Parch', 'Ticket', 'Fare', 'Cabin', 'Embarked'],
      dtype='object')
<class 'pandas.core.indexes.base.Index'>
Pclass
<class 'str'>

2 ,提取小 df :data[[‘Sex’,‘Age’,‘Survived’]].loc[3:6]

  1. 思路 : 先提取列,再提取行
  2. 4,5,6,7 行,[‘PassengerId’,‘Sex’,‘Age’,‘Survived’] 列
  3. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
===================================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0

3 ,索引, 查看所有索引: res.index

  1. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    # 取索引
    index = res.index
    print(index)
    print(type(index))
===========================================
RangeIndex(start=3, stop=7, step=1)
<class 'pandas.core.indexes.range.RangeIndex'>

4 ,索引,重新索引 : res.reset_index(drop=True, inplace=True)

  1. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
    res.reset_index(drop=True, inplace=True)
    print(res)
======================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0
======================================================
   PassengerId     Sex   Age  Survived
0            4  female  35.0         1
1            5    male  35.0         0
2            6    male   NaN         0
3            7    male  54.0         0

5 ,索引,自定義 : res.index = pd.Series([“a”,“b”,“c”,“d”])

  1. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    print(res)
    res.index = pd.Series(["a","b","c","d"])
    print(res)
==========================================================
   PassengerId     Sex   Age  Survived
3            4  female  35.0         1
4            5    male  35.0         0
5            6    male   NaN         0
6            7    male  54.0         0
==========================================================
   PassengerId     Sex   Age  Survived
a            4  female  35.0         1
b            5    male  35.0         0
c            6    male   NaN         0
d            7    male  54.0         0

6 ,自定義索引取數據 :

  1. 思路 :
    1 ,使用 : 像正常索引一樣使用
    2 ,是否可以選取區間 : 可以
  2. 代碼 :
if __name__ == '__main__':
    # 全列顯示 :
    pd.set_option('display.max_columns', None)
    # 讀文件 csv
    data = pd.read_csv("titanic_train.csv")
    res = data[['PassengerId','Sex','Age','Survived']].loc[3:6]
    res.index = pd.Series(["a","b","c","d"])
    print(res)
    print(res.loc['a'])
    print(res.loc['b':'d'])
===========================================================
   PassengerId     Sex   Age  Survived
a            4  female  35.0         1
b            5    male  35.0         0
c            6    male   NaN         0
d            7    male  54.0         0
================================
PassengerId         4
Sex            female
Age                35
Survived            1
Name: a, dtype: object
================================
   PassengerId   Sex   Age  Survived
b            5  male  35.0         0
c            6  male   NaN         0
d            7  male  54.0         0
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章