当前位置：首页 > 文章列表 > 文章 > python教程 > PandasDataFrame指定行应用函数技巧

PandasDataFrame指定行应用函数技巧

2026-01-28 14:24:37 0浏览收藏

“纵有疾风来，人生不言弃”，这句话送给正在学习文章的朋友们，也希望在阅读本文《Pandas DataFrame 指定索引行应用函数方法》后，能够真的帮助到大家。我也会在后续的文章中，陆续更新文章相关的技术文章，有好的建议欢迎大家在评论留言，非常感谢！

如何在 Pandas DataFrame 中仅对指定索引行应用自定义向量化函数

本文介绍如何高效地将一个返回多值的 Python 函数（如 `computeLeft`）仅应用于 DataFrame 的特定行索引，并自动填充其余位置为 NaN，避免全量计算，兼顾可读性与性能。

在实际数据处理中，我们常需对 DataFrame 的部分行（而非全部）执行复杂计算，并将结果写入多个列。例如，函数 computeLeft(i) 接收索引 i，返回长度为 4 的 NumPy 数组 [2*i, 3*i, 4*i, 5*i]，期望将其结果分别填入 ["val1","val2","val3","val4"] 四列中，但仅针对指定索引（如 [2, 5, 7, 8, 10]），其余行保持为 NaN。

直接使用 np.vectorize 全量计算（如 df[results] = np.vectorize(...)(range(len(df)))）虽可行，但效率低且不满足“选择性执行”需求。正确做法分三步：

预分配目标列并初始化为 NaN：确保所有行都有占位，未计算行自然为空；
构造索引列表：明确要计算的行索引（注意：是 DataFrame 的 index 标签，非 .iloc 位置序号）；
用 df.loc[indices, columns] 定向赋值：结合向量化函数，精准写入。

以下是完整、健壮的实现代码：

import numpy as np
import pandas as pd

def computeLeft(i):
    return np.array([i * 2, i * 3, i * 4, i * 5])

# 向量化（支持输出为 (4,) 形状数组）
computeLeftVec = np.vectorize(computeLeft, signature="()->(4)")

# 假设 df 已存在，例如：
# df = pd.DataFrame(index=range(15), data={"x": range(15)})

results = ["val1", "val2", "val3", "val4"]
df[results] = np.nan  # 预分配，全部设为 NaN

indices_to_change = [2, 5, 7, 8, 10]
# ✅ 关键：loc 使用 index 标签匹配，要求 indices_to_change 中的值必须存在于 df.index 中
df.loc[indices_to_change, results] = computeLeftVec(indices_to_change)

⚠️ 重要注意事项：

df.loc[indices_to_change, ...] 依赖的是 DataFrame 的 index 标签（即行名），而非整数位置。若你的 DataFrame 索引不是默认 RangeIndex（如已重设为字符串或缺失部分整数），请先确认 indices_to_change 中的值确实在 df.index 中，否则会静默忽略或报错。可通过 assert set(indices_to_change).issubset(df.index) 验证。

若需按位置序号（iloc） 而非标签索引操作，请改用：

pos_indices = [2, 5, 7, 8, 10]  # 行位置（0-based）
df.iloc[pos_indices, df.columns.get_indexer(results)] = computeLeftVec(pos_indices)

性能优化建议：对于纯数值运算（如本例），直接用 NumPy 广播替代 np.vectorize 更快：

idx_arr = np.array(indices_to_change)
values = np.column_stack([idx_arr * 2, idx_arr * 3, idx_arr * 4, idx_arr * 5])
df.loc[indices_to_change, results] = values

该方法简洁、语义清晰，充分利用了 Pandas 的索引对齐特性，是生产环境中推荐的选择性函数应用模式。

好了，本文到此结束，带大家了解了《PandasDataFrame指定行应用函数技巧》，希望本文对你有所帮助！关注golang学习网公众号，给大家分享更多文章知识！