【課程2.17】連接與修補 concat、combine_first

連接 - 沿軸執行連接操作

1.連接：concat


s1 = pd.Series([1,2,3])
s2 = pd.Series([2,3,4])
s3 = pd.Series([1,2,3],index = ['a','c','h'])
s4 = pd.Series([2,3,4],index = ['b','e','d'])
print(pd.concat([s1,s2]))
print(pd.concat([s3,s4]).sort_index())
print('-----')
# 默認axis=0，行+行

print(pd.concat([s3,s4], axis=1))
print('-----')
# axis=1,列+列，成爲一個Dataframe
----------------------------------------------------------------------
0    1
1    2
2    3
0    2
1    3
2    4
dtype: int64
a    1
b    2
c    2
d    4
e    3
h    3
dtype: int64
-----
     0    1
a  1.0  NaN
b  NaN  2.0
c  2.0  NaN
d  NaN  4.0
e  NaN  3.0
h  3.0  NaN
-----

2.連接方式：join，join_axes


s5 = pd.Series([1,2,3],index = ['a','b','c'])
s6 = pd.Series([2,3,4],index = ['b','c','d'])
print(pd.concat([s5,s6], axis= 1))
print(pd.concat([s5,s6], axis= 1, join='inner'))
print(pd.concat([s5,s6], axis= 1, join_axes=[['a','b','d']]))
# join：{'inner'，'outer'}，默認爲“outer”。如何處理其他軸上的索引。outer爲聯合和inner爲交集。
# join_axes：指定聯合的index
----------------------------------------------------------------------
     0    1
a  1.0  NaN
b  2.0  2.0
c  3.0  3.0
d  NaN  4.0
   0  1
b  2  2
c  3  3
     0    1
a  1.0  NaN
b  2.0  2.0
d  NaN  4.0

3.覆蓋列名


sre = pd.concat([s5,s6], keys = ['one','two'])
print(sre,type(sre))
print(sre.index)
print('-----')
# keys：序列，默認值無。使用傳遞的鍵作爲最外層構建層次索引

sre = pd.concat([s5,s6], axis=1, keys = ['one','two'])
print(sre,type(sre))
# axis = 1, 覆蓋列名
----------------------------------------------------------------------
one  a    1
     b    2
     c    3
two  b    2
     c    3
     d    4
dtype: int64 <class 'pandas.core.series.Series'>
MultiIndex(levels=[['one', 'two'], ['a', 'b', 'c', 'd']],
           labels=[[0, 0, 0, 1, 1, 1], [0, 1, 2, 1, 2, 3]])
-----
   one  two
a  1.0  NaN
b  2.0  2.0
c  3.0  3.0
d  NaN  4.0 <class 'pandas.core.frame.DataFrame'>

4.修補 pd.combine_first()

df1 = pd.DataFrame([[np.nan, 3., 5.], [-4.6, np.nan, np.nan],[np.nan, 7., np.nan]])
df2 = pd.DataFrame([[-42.6, np.nan, -8.2], [-5., 1.6, 4]],index=[1, 2])
print(df1)
print(df2)
print(df1.combine_first(df2))
print('-----')
# 根據index，df1的空值被df2替代
# 如果df2的index多於df1，則更新到df1上，比如index=['a',1]

df1.update(df2)
print(df1)
# update，直接df2覆蓋df1，相同index位置
----------------------------------------------------------------------

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Python數據分析實戰【第三章】2.17-Pandas連接與修補 concat、combine_first【python】

【課程2.17】連接與修補 concat、combine_first

1.連接：concat

2.連接方式：join，join_axes

3.覆蓋列名

4.修補 pd.combine_first()

Python數據分析實戰【第三章】2.8-時間模塊：datetime【python】

Python數據分析實戰【第三章】2.5-Pandas數據結構Dataframe：基本概念及創建【python】

Python數據分析實戰【第三章】3.8-Matplotlib面積圖、填圖、餅圖【python】

Python數據分析實戰【第三章】3.11-Matplotlib極座標圖【python】

Python數據分析實戰【第三章】3.10-Matplotlib散點圖、矩陣散點圖【python】

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結

Python數據分析實戰【第三章】2.17-Pandas連接與修補 concat、combine_first【python】

【課程2.17】 連接與修補 concat、combine_first

1.連接：concat

2.連接方式：join，join_axes

3.覆蓋列名

4.修補 pd.combine_first()

【課程2.17】連接與修補 concat、combine_first