pandas學習-task1

練習一、現有一份關於美劇《權力的遊戲》劇本的數據集,請解決以下問題:
在所有的數據中,一共出現了多少人物?
564個人物
方法一:直接統計

import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Game_of_Thrones_Script.csv')
result = df['Name'].nunique()
print(df)
print(result)


輸出爲:
564

方法二:根據表長統計:
import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Game_of_Thrones_Script.csv')
result = df['Name'].value_counts()
print(df)
print(result)

輸出爲:
tyrion lannister      1760
jon snow              1133
daenerys targaryen    1048
cersei lannister      1005
jaime lannister        945
                      ... 
robett glover            1
tribesmen of vale        1
bolton bannerman         1
watchman                 1
officer                  1
Name: Name, Length: 564, dtype: int64

2.以單元格計數(即簡單把一個單元格視作一句),誰說了最多的話?
import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Game_of_Thrones_Script.csv')
result = df['Name'].value_counts()
man = result.index[0]
print(df)
print(result)
print(man)
輸出結果:
tyrion lannister      1760
jon snow              1133
daenerys targaryen    1048
cersei lannister      1005
jaime lannister        945
                      ... 
lord bolton              1
officer                  1
eddision                 1
pyattpree                1
rodrik cassal            1
Name: Name, Length: 564, dtype: int64
tyrion lannister

3.以單詞計數,誰說了最多的單詞?
import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Game_of_Thrones_Script.csv')
print(df)
result1 = df.groupby('Sentence')['Name'].unique().apply(lambda x: x[0]).value_counts()
result1 = result1.index[0]
print(result1)

結果爲:tyrion lannister

練習二、現有一份關於科比的投籃數據集,請解決如下問題:
1.哪種action_type和combined_shot_type的組合是最多的?
import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Kobe_data.csv')
print(df)
result1 = pd.Series(df['action_type']).value_counts()
result1 = result1.index[0]
print(result1)

結果爲:Jump Shot
2. 在所有被記錄的game_id中,遭遇到最多的opponent是一個支?

import pandas as pd

df = pd.read_csv(
    r'D:\Desktop\pandas\Learning\joyful-pandas-master\data\Kobe_data.csv')
print(df)
result1 = df.groupby('game_id')['opponent'].unique().apply(lambda x: x[0]).value_counts()
result1 = result1.index[0]
print(result1)
結果爲:SAS

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章