快捷導(dǎo)航

numpy判斷數(shù)值類型、過濾出數(shù)值型數(shù)據(jù)的方法

更新時(shí)間：2018年06月09日 09:40:09 作者：胖大海瘦西湖

今天小編就為大家分享一篇numpy判斷數(shù)值類型、過濾出數(shù)值型數(shù)據(jù)的方法，具有很好的參考價(jià)值，希望對(duì)大家有所幫助。一起跟隨小編過來看看吧

numpy是無法直接判斷出由數(shù)值與字符混合組成的數(shù)組中的數(shù)值型數(shù)據(jù)的，因?yàn)橛蓴?shù)值類型和字符類型組成的numpy數(shù)組已經(jīng)不是數(shù)值類型的數(shù)組了，而是dtype='<U11'。

1、math.isnan也不行，它只能判斷float("nan")：

>>> import math 
>>> math.isnan(1) 
False 
>>> math.isnan('a') 
Traceback (most recent call last): 
 File "<stdin>", line 1, in <module> 
TypeError: a float is required 
>>> math.isnan(float("nan")) 
True 
>>>

2、np.isnan不可用，因?yàn)閚p.isnan只能用于數(shù)值型與np.nan組成的numpy數(shù)組：

>>> import numpy as np 
>>> test1=np.array([1,2,'aa',3]) 
>>> np.isnan(test1) 
Traceback (most recent call last): 
 File "<stdin>", line 1, in <module> 
TypeError: ufunc 'isnan' not supported for the input types, and the inputs could 
 not be safely coerced to any supported types according to the casting rule ''sa 
fe'' 
>>> test2=np.array([1,2,np.nan,3]) 
>>> np.isnan(test2) 
array([False, False, True, False], dtype=bool) 
>>>

解決辦法：

方法1：將numpy數(shù)組轉(zhuǎn)換為python的list，然后通過filter過濾出數(shù)值型的值，再轉(zhuǎn)為numpy, 但是，有一個(gè)嚴(yán)重的問題，無法保證原來的索引

>>> import numpy as np 
>>> test1=np.array([1,2,'aa',3]) 
>>> list1=list(test1) 
>>> def filter_fun(x): 
... try: 
...  return isinstance(float(x),(float)) 
... except: 
...  return False 
... 
>>> list(filter(filter_fun,list1)) 
['1', '2', '3'] 
>>> np.array(filter(filter_fun,list1)) 
array(<filter object at 0x0339CA30>, dtype=object) 
>>> np.array(list(filter(filter_fun,list1))) 
array(['1', '2', '3'], 
 dtype='<U1') 
>>> np.array([float(x) for x in filter(filter_fun,list1)]) 
array([ 1., 2., 3.]) 
>>>

方法2：利用map制作bool數(shù)組，然后再過濾數(shù)據(jù)和索引：

>>> import numpy as np
>>> test1=np.array([1,2,'aa',3])
>>> list1=list(test1)
>>> def filter_fun(x):
... try:
...  return isinstance(float(x),(float))
... except:
...  return False
...
>>> import pandas as pd
>>> test=pd.DataFrame(test1,index=[1,2,3,4])
>>> test
 0
1 1
2 2
3 aa
4 3
>>> index=test.index
>>> index
Int64Index([1, 2, 3, 4], dtype='int64')
>>> bool_index=map(filter_fun,list1)
>>> bool_index=list(bool_index) #bool_index這樣的迭代結(jié)果只能list一次，一次再list時(shí)會(huì)是空，所以保存一下list的結(jié)果
>>> bool_index
[True, True, False, True]
>>> new_data=test1[np.array(bool_index)]
>>> new_data
array(['1', '2', '3'],
 dtype='<U11')
>>> new_index=index[np.array(bool_index)]
>>> new_index
Int64Index([1, 2, 4], dtype='int64')
>>> test2=pd.DataFrame(new_data,index=new_index)
>>> test2
 0
1 1
2 2
4 3
>>>

以上這篇numpy判斷數(shù)值類型、過濾出數(shù)值型數(shù)據(jù)的方法就是小編分享給大家的全部?jī)?nèi)容了，希望能給大家一個(gè)參考，也希望大家多多支持腳本之家。

您可能感興趣的文章: