Unix文件系统和pwd命令实现详解

一、Unix文件系统基础概念

1.1 核心数据结构

inode（索引节点）

每个文件/目录都有一个唯一的inode
存储文件的元数据（不包括文件名）
包含：文件类型、权限、所有者、大小、时间戳、数据块指针等

目录结构

目录本质上是特殊的文件
内容包含文件名到inode号的映射表
每个目录至少包含两个条目：
- . 指向当前目录的inode
- .. 指向父目录的inode

1.2 文件系统的层次结构

根目录 (/)
├── bin/     (基本命令)
├── etc/     (配置文件)
├── home/    (用户目录)
│   └── user/
├── dev/     (设备文件)
└── var/     (可变数据)

二、pwd命令的工作原理

2.1 基本功能

pwd = Print Working Directory
显示当前工作目录的绝对路径
遵循POSIX标准

2.2 两种实现方式

方式1：使用系统调用（标准方法）

#include <unistd.h>
#include <stdio.h>

char *getcwd(char *buf, size_t size);

方式2：手动遍历目录树

通过不断查找".."目录并与当前目录比较来确定路径。

三、pwd的C语言实现

3.1 使用getcwd()的简单实现

#include <stdio.h>
#include <unistd.h>
#include <stdlib.h>

#define PATH_MAX 4096

int main() {
    char cwd[PATH_MAX];

    if (getcwd(cwd, sizeof(cwd)) != NULL) {
        printf("%s\n", cwd);
        return 0;
    } else {
        perror("getcwd() error");
        return 1;
    }
}

3.2 手动实现的pwd（理解原理）

#include <stdio.h>
#include <stdlib.h>
#include <sys/stat.h>
#include <sys/types.h>
#include <dirent.h>
#include <string.h>
#include <unistd.h>

ino_t get_inode(const char *path) {
    struct stat stat_buf;
    if (stat(path, &stat_buf) == -1) {
        perror("stat");
        exit(1);
    }
    return stat_buf.st_ino;
}

void print_path_to(ino_t this_inode) {
    ino_t parent_inode;
    char its_name[BUFSIZ];

    if (get_inode("..") == this_inode) {
        // 到达根目录
        return;
    }

    // 切换到父目录
    chdir("..");

    // 在当前目录中查找对应inode的文件名
    DIR *dir_ptr = opendir(".");
    if (dir_ptr == NULL) {
        perror("opendir");
        exit(1);
    }

    struct dirent *direntp;
    while ((direntp = readdir(dir_ptr)) != NULL) {
        if (direntp->d_ino == this_inode) {
            strcpy(its_name, direntp->d_name);
            break;
        }
    }

    closedir(dir_ptr);

    // 递归获取父目录路径
    parent_inode = get_inode(".");
    print_path_to(parent_inode);

    printf("/%s", its_name);
}

int main() {
    // 保存起始目录
    char start_dir[BUFSIZ];
    if (getcwd(start_dir, BUFSIZ) == NULL) {
        perror("getcwd");
        exit(1);
    }

    // 打印路径
    print_path_to(get_inode("."));
    printf("\n");

    // 返回起始目录
    chdir(start_dir);

    return 0;
}

四、详细工作原理

4.1 路径解析算法

获取当前目录的inode

struct stat st;
stat(".", &st);
ino_t current_inode = st.st_ino;

获取父目录的inode

stat("..", &st);
ino_t parent_inode = st.st_ino;

递归查找

如果当前目录inode == 父目录inode，说明到达根目录
否则，在父目录中查找具有当前inode的文件名
切换到父目录，继续递归

4.2 关键系统调用

系统调用	功能	使用示例
`stat()`	获取文件状态信息	`stat(".", &buf)`
`chdir()`	改变工作目录	`chdir("..")`
`opendir()`	打开目录流	`opendir(".")`
`readdir()`	读取目录项	`readdir(dir_ptr)`
`getcwd()`	获取当前目录	`getcwd(buf, size)`

五、实际示例

假设目录结构：

/
├── home/
│   └── user/
│       ├── docs/
│       └── projects/

在 /home/user/docs 中执行pwd：

获取docs的inode：1004 进入..（user目录）在user目录中查找inode 1004对应的文件名："docs" 获取user的inode：1003 进入..（home目录）在home目录中查找inode 1003对应的文件名："user" 获取home的inode：1002 进入..（根目录）发现当前inode == 父目录inode，到达根目录输出：/home/user/docs