不斷更新
1.filter
(1)單條件過濾
data = df.filter(df['age'] == 20))
或者
data = df.filter('age = 20')
(2)多條件過濾
data = df.filter((df['age'] == 20) | (df['gender'] == 'male'))
不斷更新
1.filter
(1)單條件過濾
data = df.filter(df['age'] == 20))
或者
data = df.filter('age = 20')
(2)多條件過濾
data = df.filter((df['age'] == 20) | (df['gender'] == 'male'))
1. pyspark 版本 2.3.0版本 2. 官網 reduce(f)[source] Reduces the elements of this RDD using the specified