pandas之重塑和軸向旋轉

原創

reb0rn初代

2020-06-22 00:32

重塑和軸向旋轉用於重新排列表格型數據的基礎運算。

對於DataFrame，主要功能有：

（1）stack：將數據的列“旋轉”爲行（2）unstack：將數據的行“旋轉”爲列

例1：（其中行列索引均爲字符串）

data = DataFrame(np.arange(6).reshape((2,3)),index=pd.Index(['O','C'],name='state'),columns=pd.Index(['one','two','three'],name='number'))
data
Out[3]: 
number  one  two  three
state                  
O         0    1      2
C         3    4      5

result=data.stack()     #使用該數據的stack方法即可將列轉換爲行，得到一個Series
result
Out[5]: 
state  number
O      one       0
       two       1
       three     2
C      one       3
       two       4
       three     5
dtype: int32

result.unstack()       #對於一個層次化索引的Series，你可以用unstack將其重排爲一個DataFrame
Out[6]: 
number  one  two  three
state                  
O         0    1      2
C         3    4      5

result.unstack(0)      #默認情況下，操作的是最內層（stack也是如此）。傳入分層級的編號或名稱即可對其他級別進行unstack操作
Out[7]: 
state   O  C
number      
one     0  3
two     1  4
three   2  5

result.unstack('state')
Out[8]: 
state   O  C
number      
one     0  3
two     1  4
three   2  5

（3）如果不是所有的級別值都能在分組中找到的話，則unstack操作可能會引入缺失數據

s1 = Series([0,1,2,3],index=['a','b','c','d'])
s2 = Series([4,5,6],index=['c','d','e'])
data2 = pd.concat([s1,s2],keys=['one','two'])
data2.unstack()
Out[9]: 
       a    b    c    d    e
one  0.0  1.0  2.0  3.0  NaN
two  NaN  NaN  4.0  5.0  6.0

data2.unstack().stack()  #stack默認會濾除缺失數據，因此該運算是可逆的
Out[10]: 
one  a    0.0
     b    1.0
     c    2.0
     d    3.0
two  c    4.0
     d    5.0
     e    6.0
dtype: float64

data2.unstack().stack(dropna=False)
Out[11]: 
one  a    0.0
     b    1.0
     c    2.0
     d    3.0
     e    NaN
two  a    NaN
     b    NaN
     c    4.0
     d    5.0
     e    6.0
dtype: float64

（4）在對DataFrame進行unstack操作時，作爲旋轉軸的級別將會成爲結果中的最低級別：

df = DataFrame({'left':result,'right':result+5},columns=pd.Index(['left','right'],name='side'))
df   
Out[13]: 
side          left  right
state number             
O     one        0      5
      two        1      6
      three      2      7
C     one        3      8
      two        4      9
      three      5     10

df = DataFrame({'left':result,'right':result+5},columns=pd.Index(['left','right'],name='side'))
df
Out[13]: 
side          left  right
state number             
O     one        0      5
      two        1      6
      three      2      7
C     one        3      8
      two        4      9
      three      5     10

df.unstack('state')
Out[14]: 
side   left    right    
state     O  C     O   C
number                  
one       0  3     5   8
two       1  4     6   9
three     2  5     7  10

df.unstack('state').stack('side')
Out[15]: 
state          C  O
number side        
one    left    3  0
       right   8  5
two    left    4  1
       right   9  6
three  left    5  2
       right  10  7

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

pandas之重塑和軸向旋轉

重塑和軸向旋轉用於重新排列表格型數據的基礎運算。

MySQL 核心模塊揭祕 | 18 期 | 鎖在內存里長什麼樣*

使用perf工具生成火焰圖

大齡程序員思考

響應式界面控件DevExtreme * 更強的數據分析和可視化功能

HttpSecurity 是如何組裝過濾器鏈的

數說海南——近6年海南各市縣人口簡單看

長序列中Transformers的高級注意力機制總結

WebStorm 創建 Vue 項目

django2+django-celery-beat+celery4實現任務的動態添加等管理、多臺機器部署

Python數據分析之NumPy數組的計算（通用函數、排序等）

各種學習網址總結-程序猿值得擁有持更

統計作圖函數（和matplotlib、pandas）

pandas之重塑和軸向旋轉

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

pandas之重塑和軸向旋轉

重塑和軸向旋轉 用於重新排列表格型數據的基礎運算。

重塑和軸向旋轉用於重新排列表格型數據的基礎運算。