pyspark判斷column是否在list中 isin()

#Filter IS IN List values li=["OH","CA","DE"] df.filter(df.state.isin(li)).show() +--------------------+------------------+-----+------+ | name| languages|state|gender| +--------------------+------------------+-----+------+ | [James, , Smith]|[Java, Scala, C++]| OH| M| | [Julia, , Williams]| [CSharp, VB]| OH| F| |[Mike, Mary, Will...| [Python, VB]| OH| M| +--------------------+------------------+-----+------+ # Filter NOT IS IN List values #These show all records with NY (NY is not part of the list) df.filter(~df.state.isin(li)).show() df.filter(df.state.isin(li)==False).show()
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章