如何用Python获取Linux文件系统下全部inode并进行相关计算？

阿华AIGC实验室

2026-5-15

一次性获取所有inode并处理的可行方案

当然有靠谱的方案！我之前做文件系统分析的时候也有过一模一样的需求，不想一次次单独查inode，下面分享几个实用的方法，覆盖命令行和脚本两种场景：

一、用`find`命令快速批量导出（推荐）

这是类Unix系统下最直接的方式，find可以遍历指定路径下的所有文件/目录，一次性输出inode相关信息，后续你可以用管道或者脚本处理这些结果。

比如要导出某个路径下所有inode的编号、文件路径和大小：

# 替换成你要扫描的路径，比如"/"表示整个系统（需sudo）
find /path/to/target -printf "%i %p %s\n"

%i：输出inode编号
%p：输出文件/目录的完整路径
%s：输出文件大小（单位：字节）

如果需要更多信息，比如修改时间，可以加%t，具体参数可以看man find里的-printf选项。导出后，你可以用awk、sed或者写个shell脚本逐行处理，比如筛选特定大小的inode：

find /path/to/target -printf "%i %p %s\n" | awk '$3 > 1048576' # 筛选大小超过1MB的文件

二、底层工具`debugfs`直接读取inode表（适合特殊场景）

如果你需要更底层的inode数据（比如已删除但未被覆盖的inode），可以用debugfs，不过这个需要root权限，且针对具体的磁盘分区操作：

先找到目标分区对应的设备名，比如用df -h查看挂载点对应的设备（比如/dev/sda2）
运行以下命令导出inode表：

sudo debugfs -R 'dump_inode_table /tmp/inode_table.txt' /dev/sda2

导出的/tmp/inode_table.txt里会包含该分区所有inode的详细元数据，不过格式比较原始，需要自己解析，适合做深度文件系统分析的场景。

三、用脚本批量收集（适合复杂逻辑处理）

如果需要在代码里直接处理inode信息，比如Python脚本，用os.walk遍历目录，一次性收集所有inode的统计信息：

import os

def gather_all_inodes(root_dir):
    inode_records = []
    # 遍历目录树
    for dirpath, _, filenames in os.walk(root_dir):
        # 处理当前目录的inode
        try:
            dir_stat = os.stat(dirpath)
            inode_records.append({
                "inode": dir_stat.st_ino,
                "path": dirpath,
                "size": dir_stat.st_size,
                "type": "directory"
            })
        except OSError as e:
            print(f"无法访问目录 {dirpath}: {e}")
            continue
        
        # 处理目录下的文件
        for fname in filenames:
            file_path = os.path.join(dirpath, fname)
            try:
                file_stat = os.stat(file_path)
                inode_records.append({
                    "inode": file_stat.st_ino,
                    "path": file_path,
                    "size": file_stat.st_size,
                    "type": "file"
                })
            except OSError as e:
                print(f"无法访问文件 {file_path}: {e}")
                continue
    return inode_records

# 调用示例
all_inodes = gather_all_inodes("/path/to/scan")
# 逐行处理收集到的inode数据
for record in all_inodes:
    print(f"Inode {record['inode']} | {record['type']} | {record['path']} | 大小: {record['size']}字节")

这个方法的优势是可以灵活加入自定义逻辑，比如过滤硬链接、统计inode使用情况等。