pandasのインデックス指定で行・列を抽出

Modified: 2023-07-25 | Tags: Python, pandas

pandas.DataFrame, pandas.Seriesの[]によるインデックス指定で、行・列および要素を選択し抽出・取得できる。[]に指定する値のタイプによって取得できるデータが異なる。

pandas.DataFrameの列を抽出
pandas.DataFrameの行を抽出
pandas.Seriesの要素の値を取得
pandas.DataFrameの要素の値を取得
行名・列名が整数値の場合の注意点

at, iat, loc, ilocを使うと、より明確に範囲の選択が可能。列をスライス、行を行名・行番号やそのリストで選択することもできる。

関連記事: pandasで任意の位置の値を取得・変更するat, iat, loc, iloc

列をindexに割り当てるにはset_index()を使う。

関連記事: pandas.DataFrameの列をインデックス（行名）に割り当てるset_index

本記事のサンプルコードのpandasのバージョンは以下の通り。バージョンによって仕様が異なる可能性があるので注意。以下のpandas.DataFrameを例とする。

import pandas as pd

print(pd.__version__)
# 2.0.3

df = pd.DataFrame({'col_0': ['00', '10', '20', '30', '40'],
                   'col_1': ['01', '11', '21', '31', '41'],
                   'col_2': ['02', '12', '22', '32', '42'],
                   'col_3': ['03', '13', '23', '33', '43']},
                  index=['row_0', 'row_1', 'row_2', 'row_3', 'row_4'])
print(df)
#       col_0 col_1 col_2 col_3
# row_0    00    01    02    03
# row_1    10    11    12    13
# row_2    20    21    22    23
# row_3    30    31    32    33
# row_4    40    41    42    43

pandasのインデックス指定で行・列を抽出

pandas.DataFrameの列を抽出

[列名]: 単独の列をpandas.Seriesとして取得

[列名のリスト]: 単独・複数列をpandas.DataFrameとして取得

loc, ilocの例

pandas.DataFrameの行を抽出

[行名・行番号のスライス]: 単独・複数行をpandas.DataFrameとして取得

loc, ilocの例

[boolのリスト・Series]: Trueの行をpandas.DataFrameとして取得

pandas.Seriesの要素の値を取得

[ラベル名・番号]: 単独の要素の値をそれぞれの型で取得

[ラベル名・番号のリスト]: 単独・複数の要素の値をpandas.Seriesとして取得

[ラベル名・番号のスライス]: 単独・複数の要素の値をpandas.Seriesとして取得

[boolのリスト・Series]: Trueの要素をpandas.Seriesとして取得

pandas.DataFrameの要素の値を取得

行名・列名が整数値の場合の注意点

関連カテゴリー

関連記事

`[列名]`: 単独の列をpandas.Seriesとして取得

`[列名のリスト]`: 単独・複数列をpandas.DataFrameとして取得

`[行名・行番号のスライス]`: 単独・複数行をpandas.DataFrameとして取得

`[boolのリスト・Series]`: Trueの行をpandas.DataFrameとして取得

`[ラベル名・番号]`: 単独の要素の値をそれぞれの型で取得

`[ラベル名・番号のリスト]`: 単独・複数の要素の値をpandas.Seriesとして取得

`[ラベル名・番号のスライス]`: 単独・複数の要素の値をpandas.Seriesとして取得

`[boolのリスト・Series]`: Trueの要素をpandas.Seriesとして取得