NumPy: Count values in an array with conditions

Modified: 2024-02-03 | Tags: Python, NumPy

This article explains how to count values in a NumPy array (ndarray) that meet certain conditions.

Contents

Count values in an array with a condition: np.count_nonzero()
Count values row-wise or column-wise
Check if at least one value meets the condition: np.any()
Check if all values meet the condition: np.all()
Multiple conditions
Count NaN and non-NaN values
Count infinity (inf)

The size of the array (total number of elements) can be obtained with the size attribute.

NumPy: Get the number of dimensions, shape, and size of ndarray

For extracting, deleting, or replacing elements, rows, or columns that meet certain conditions, refer to the following articles.

The NumPy version used in this article is as follows. Note that functionality may vary between versions.

import numpy as np

print(np.__version__)
# 1.26.1

source: numpy_count.py

Count values in an array with a condition: `np.count_nonzero()`

np.count_nonzero() counts the number of non-zero values in an array.

numpy.count_nonzero — NumPy v1.26 Manual

a = np.arange(12).reshape((3, 4))
print(a)
# [[ 0  1  2  3]
#  [ 4  5  6  7]
#  [ 8  9 10 11]]

print(np.count_nonzero(a))
# 11

source: numpy_count.py

Using comparison operators such as <, ==, != to compare ndarray with a scalar value results in an element-wise comparison, producing a Boolean ndarray.

print(a < 4)
# [[ True  True  True  True]
#  [False False False False]
#  [False False False False]]

print(a % 2 == 0)
# [[ True False  True False]
#  [ True False  True False]
#  [ True False  True False]]

source: numpy_count.py

Since True is treated as 1 and False as 0, np.count_nonzero() can directly count the number of True values, representing the values that meet the condition.

print(np.count_nonzero(a < 4))
# 4

print(np.count_nonzero(a % 2 == 0))
# 6

source: numpy_count.py

np.sum() can also return the same result, but np.count_nonzero() is faster.

print(np.sum(a < 4))
# 4

print(np.sum(a % 2 == 0))
# 6

source: numpy_count.py

Count values row-wise or column-wise

For multi-dimensional arrays, np.count_nonzero() can process each axis by specifying the axis argument. The default, axis=None, counts non-zero values across the entire array.

For a two-dimensional array, setting axis=0 counts non-zero values column-wise, and axis=1 counts them row-wise.

a = np.arange(12).reshape((3, 4))
print(a)
# [[ 0  1  2  3]
#  [ 4  5  6  7]
#  [ 8  9 10 11]]

print(np.count_nonzero(a < 4))
# 4

print(np.count_nonzero(a < 4, axis=0))
# [1 1 1 1]

print(np.count_nonzero(a < 4, axis=1))
# [4 0 0]

source: numpy_count.py

Setting the keepdims argument to True makes the result have the same number of dimensions as the original array.

NumPy: Meaning of the axis parameter (0, 1, -1)

print(np.count_nonzero(a < 4, keepdims=True))
# [[4]]

print(np.count_nonzero(a < 4, axis=0, keepdims=True))
# [[1 1 1 1]]

print(np.count_nonzero(a < 4, axis=1, keepdims=True))
# [[4]
#  [0]
#  [0]]

source: numpy_count.py

Note that the axis argument was introduced to np.count_nonzero() in NumPy version 1.12, and keepdims in version 1.19. In contrast, both arguments have been available in np.sum() since version 1.7. Therefore, for versions older than 1.12, consider using np.sum().

Check if at least one value meets the condition: `np.any()`

np.any() returns True if at least one element in the specified array is True; otherwise, it returns False.

numpy.any — NumPy v1.26 Manual

This function is useful for determining whether any element meets a specified condition.

a = np.arange(12).reshape((3, 4))
print(a)
# [[ 0  1  2  3]
#  [ 4  5  6  7]
#  [ 8  9 10 11]]

print(np.any(a < 4))
# True

print(np.any(a > 100))
# False

source: numpy_count.py

Similar to np.count_nonzero(), np.any() has the axis argument.

print(np.any(a < 4, axis=0))
# [ True  True  True  True]

print(np.any(a < 4, axis=1))
# [ True False False]

source: numpy_count.py

While examples are not provided here, note that np.any() also supports the keepdims argument.

Check if all values meet the condition: `np.all()`

np.all() returns True if all elements in the specified array are True; otherwise, it returns False.

numpy.all — NumPy v1.26 Manual

This function is useful for determining whether all elements meet a specified condition.

a = np.arange(12).reshape((3, 4))
print(a)
# [[ 0  1  2  3]
#  [ 4  5  6  7]
#  [ 8  9 10 11]]

print(np.all(a < 4))
# False

print(np.all(a < 100))
# True

source: numpy_count.py

Similar to np.count_nonzero(), np.all() has the axis argument.

print(np.all(a < 4, axis=0))
# [False False False False]

print(np.all(a < 4, axis=1))
# [ True False False]

source: numpy_count.py

While examples are not provided here, note that np.all() also supports the keepdims argument.

Multiple conditions

To combine multiple conditions, enclose each conditional expression in parentheses () and connect them with & (AND) or | (OR). The negation ~ (NOT) is also usable.

a = np.arange(12).reshape((3, 4))
print(a)
# [[ 0  1  2  3]
#  [ 4  5  6  7]
#  [ 8  9 10 11]]

print((a < 4) | (a % 2 == 0))
# [[ True  True  True  True]
#  [ True False  True False]
#  [ True False  True False]]

print(np.count_nonzero((a < 4) | (a % 2 == 0)))
# 8

print(np.count_nonzero((a < 4) | (a % 2 == 0), axis=0))
# [3 1 3 1]

print(np.count_nonzero((a < 4) | (a % 2 == 0), axis=1))
# [4 2 2]

source: numpy_count.py

Note that using and or or, or omitting parentheses, raises an error.

How to fix "ValueError: The truth value ... is ambiguous" in NumPy, pandas

Count `NaN` and non-`NaN` values

For example, NaN can occur when reading a CSV file with missing data.

a_nan = np.genfromtxt('data/src/sample_nan.csv', delimiter=',')
print(a_nan)
# [[11. 12. nan 14.]
#  [21. nan nan 24.]
#  [31. 32. 33. 34.]]

source: numpy_count_nan.py

Since comparing NaN with NaN always returns False, you need to use np.isnan() to count NaN values.

numpy.isnan — NumPy v1.26 Manual

print(np.nan == np.nan)
# False

print(a_nan == np.nan)
# [[False False False False]
#  [False False False False]
#  [False False False False]]

print(np.isnan(a_nan))
# [[False False  True False]
#  [False  True  True False]
#  [False False False False]]

source: numpy_count_nan.py

Then, as in the previous examples, count the number of True with np.count_nonzero() or np.sum().

print(np.count_nonzero(np.isnan(a_nan)))
# 3

print(np.count_nonzero(np.isnan(a_nan), axis=0))
# [0 1 2 0]

print(np.count_nonzero(np.isnan(a_nan), axis=1))
# [1 2 0]

source: numpy_count_nan.py

To count non-NaN values, use the negation ~.

print(~np.isnan(a_nan))
# [[ True  True False  True]
#  [ True False False  True]
#  [ True  True  True  True]]

source: numpy_count_nan.py

For replacing or deleting missing values, refer to the following articles.

Count infinity (`inf`)

To check if a value is infinite (inf), use the np.isinf() function, which returns True for both positive and negative infinity.

numpy.isinf — NumPy v1.26 Manual

a_inf = np.array([-np.inf, 0, np.inf])
print(a_inf)
# [-inf   0.  inf]

print(np.isinf(a_inf))
# [ True False  True]

source: numpy_count_inf.py

np.isposinf(), which returns True for positive infinity, and np.isneginf(), which returns True for negative infinity, are also provided.

print(np.isposinf(a_inf))
# [False False  True]

print(np.isneginf(a_inf))
# [ True False False]

source: numpy_count_inf.py

Since infinity can be compared with ==, you can also use == to check if it is positive or negative infinity.

print(a_inf == np.inf)
# [False False  True]

print(a_inf == -np.inf)
# [ True False False]

source: numpy_count_inf.py

Once the Boolean array is obtained, just count True as in the previous examples.

print(np.count_nonzero(np.isinf(a_inf)))
# 2

print(np.count_nonzero(np.isposinf(a_inf)))
# 1

print(np.count_nonzero(np.isneginf(a_inf)))
# 1

source: numpy_count_inf.py

For operations with infinity (inf) in Python, refer to the following article.

Infinity (inf) in Python

NumPy: Count values in an array with conditions

Count values in an array with a condition: `np.count_nonzero()`

Count values row-wise or column-wise

Check if at least one value meets the condition: `np.any()`

Check if all values meet the condition: `np.all()`

Multiple conditions

Count `NaN` and non-`NaN` values

Count infinity (`inf`)

Related Categories

Related Articles

NumPy: Count values in an array with conditions

Count values in an array with a condition: np.count_nonzero()

Count values row-wise or column-wise

Check if at least one value meets the condition: np.any()

Check if all values meet the condition: np.all()

Multiple conditions

Count NaN and non-NaN values

Count infinity (inf)

Related Categories

Related Articles

Count values in an array with a condition: `np.count_nonzero()`

Check if at least one value meets the condition: `np.any()`

Check if all values meet the condition: `np.all()`

Count `NaN` and non-`NaN` values

Count infinity (`inf`)