如何在熊貓中獲取數據幀的列切片 - How to take column-slices of dataframe in pandas

原創

2021-10-08 09:16

問題：

I load some machine learning data from a CSV file.我從 CSV 文件加載了一些機器學習數據。 The first 2 columns are observations and the remaining columns are features.前兩列是觀測值，其餘列是特徵。

Currently, I do the following:目前，我執行以下操作：

data = pandas.read_csv('mydata.csv')

which gives something like:這給出了類似的東西：

data = pandas.DataFrame(np.random.rand(10,5), columns = list('abcde'))

I'd like to slice this dataframe in two dataframes: one containing the columns a and b and one containing the columns c , d and e .我想將此數據幀分成兩個數據幀：一個包含列a和b ，另一個包含列c 、 d和e 。

It is not possible to write something like不可能寫出類似的東西

observations = data[:'c']
features = data['c':]

I'm not sure what the best method is.我不確定最好的方法是什麼。 Do I need a pd.Panel ?我需要一個pd.Panel嗎？

By the way, I find dataframe indexing pretty inconsistent: data['a'] is permitted, but data[0] is not.順便說一下，我發現數據幀索引非常不一致：允許使用data['a'] ，但不允許使用data[0] 。 On the other side, data['a':] is not permitted but data[0:] is.另一方面，不允許使用data['a':]但允許使用data[0:] 。 Is there a practical reason for this?這有實際的原因嗎？ This is really confusing if columns are indexed by Int, given that data[0] != data[0:1]考慮到data[0] != data[0:1] ，如果列由 Int 索引，這真的很令人困惑

解決方案：

參考一： https://en.stackoom.com/question/ikgT
參考二： https://stackoom.com/question/ikgT

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

如何在熊貓中獲取數據幀的列切片 - How to take column-slices of dataframe in pandas

問題：

解決方案：

Window 安裝 Python 失敗 0x80070643，發生嚴重錯誤

如何在運行時更改約束優先級 - How can I change constraints priority in run time

如何爲地圖創建自己的比較器？ - How can I create my own comparator for a map?

SQL Server 中的表和索引大小 - Table and Index size in SQL Server

How to post an array of complex objects with JSON, jQuery to ASP.NET MVC Controller?

在 Java 中創建地圖 - Create Map in Java

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結