Python pandas（DataFrame）學習筆記5

exam_data  = {'attempts': [1, 3, 2, 3, 2, 3, 1, 1, 2, 1],
              'name': ['Anastasia', 'Dima', 'Katherine', 'James', 'Emily', 'Michael', 'Matthew', 'Laura', 'Kevin', 'Jonas'],
              'qualify': ['yes', 'no', 'yes', 'no', 'no', 'yes', 'yes', 'no', 'no', 'yes'],
              'score': [12.5, 9, 16.5, np.nan, 9, 20, 14.5, np.nan, 8, 19]}    
labels = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j']

import pandas as pd
import numpy as np

df=pd.DataFrame(data=exam_data,index=labels)
print(df)

1.選擇滿足attempt<=2和score>=15條件的數據

print(df.loc[(df['attempts']<=2)&(df['score']>=15)])

2.計算attemps總和

sum_of_attempts=0
for i in df['attempts']:
    sum_of_attempts+=i

# print(" the sum of the examination attempts by the students is：""%d"%sum_of_attempts)
print(sum_of_attempts)  # the answer is 19

3.計算score的平均值

# Method1：推薦使用方法1

sum_of_score=0
j=0
for i in df['score']:
    if pd.isnull(i)==False:
        sum_of_score+=i
        j+=1
    else:
        sum_of_score=sum_of_score
        j=j

mean_of_score=sum_of_score/j
print(mean_of_score)  # the answer is 13.5625

# Methpod2：

df=df.fillna(0)  #把score=NaN修改爲score=0
sum_of_score=0
j=0
for i in df['score']:
    if i!=0:
        sum_of_score+=i
        j+=1
    else:
        sum_of_score=sum_of_score
        j=j

mean_of_score=sum_of_score/j
print(sum_of_score)
print(j)
print(mean_of_score)

4.對score進行排序

print(df.sort_values(axis=0,ascending=False,by=['score']))

5.輸出列名

print(df.columns.tolist())   # 注意 df.columns的輸出形式

6.添加行和刪除行

print(df)
df1=pd.DataFrame({ "attempts": 1,"name": "Suresh",  "qualify": "yes", "score": 15.5} ,index=list("k"))
gd=[df,df1]
result1=pd.concat(gd)
print(result1)
result2=result1.drop('k',axis=0)   # axis=0表示行，axis=1表示列。用來刪除列的另一種方法：del df['column,name']
print(result2)

7.將列數據改爲bool值

df['qualify']=df['qualify']=='yes'
print(df)

8.修改列值

#query查詢
new_data = df.query("name=='James'")
new_data.name = 'Suresh'
df.loc[new_data.index] = new_data
print(df)

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Python pandas（DataFrame）學習筆記5

.NET有哪些好用的定時任務調度框架

Python 將PDF轉爲PDF/A、PDF/X，以及PDF/A轉回PDF

elk3

Kafka存儲機制

aws語音呼叫調用，告警電話

深度學習框架火焰圖pprof和CUDA Nsys配置指南

爬蟲兩種繞過5s盾的方法

【轉】[C#] WebAPI 防止併發調用二（冥等性）

【轉】[SQL Server]關掉 SSMS 的 IntelliSense

號稱能打敗MLP的KAN到底行不行？數學核心原理全面解析

Latex另起一頁的命令

latex中的重音命令和特殊字母（部分）

Python pandas（DataFrame）學習筆記5

python 入門學習筆記14

Python pandas（DataFrame）學習筆記4

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結