当前位置：首页 > C > 正文

深入理解C语言哈希函数（从零开始构建高效哈希表）

主机测评网
C
2025-12-17
1042

在计算机科学中，C语言哈希函数 是一种将任意长度的数据映射为固定长度值（通常是一个整数）的算法。这种技术广泛应用于数据库索引、缓存系统、编译器符号表等场景。本教程将手把手教你如何在 C 语言中设计并实现一个简单但高效的哈希函数和哈希表，即使你是编程小白也能轻松上手！

什么是哈希函数？

哈希函数的核心作用是：给定一个输入（比如字符串），快速计算出一个“指纹”——即一个整数，这个整数通常用于数组下标，从而实现 O(1) 的平均查找时间。

深入理解C语言哈希函数（从零开始构建高效哈希表） C语言哈希函数哈希表实现 C语言数据结构哈希冲突处理第1张

设计一个简单的哈希函数

最经典的字符串哈希函数之一是 djb2，由 Daniel J. Bernstein 提出。它简单、快速且分布均匀。下面是我们用 C 语言实现的版本：

// djb2 哈希函数实现unsigned long hash_djb2(const char *str) {    unsigned long hash = 5381;    int c;    while ((c = *str++)) {        hash = ((hash << 5) + hash) + c; // hash * 33 + c    }    return hash;}

这段代码中，我们从初始值 5381 开始，对字符串中的每个字符进行位运算和加法操作，最终返回一个无符号长整型哈希值。

构建哈希表（Hash Table）

有了哈希函数，我们就可以构建一个完整的哈希表。哈希表通常由一个数组组成，每个数组元素（称为“桶”）可以存储一个或多个键值对。为了简化，我们使用链地址法（Chaining）来处理 哈希冲突处理 ——即不同键产生相同哈希值的情况。

首先定义数据结构：

// 键值对节点typedef struct Node {    char *key;    int value;    struct Node *next;} Node;// 哈希表结构#define TABLE_SIZE 100Node *hash_table[TABLE_SIZE] = {NULL};

接下来是插入和查找函数：

// 插入键值对void insert(const char *key, int value) {    unsigned long index = hash_djb2(key) % TABLE_SIZE;        Node *new_node = (Node*)malloc(sizeof(Node));    new_node->key = strdup(key);    new_node->value = value;    new_node->next = hash_table[index];    hash_table[index] = new_node;}// 查找值int get(const char *key) {    unsigned long index = hash_djb2(key) % TABLE_SIZE;    Node *current = hash_table[index];        while (current != NULL) {        if (strcmp(current->key, key) == 0) {            return current->value;        }        current = current->next;    }        // 未找到    return -1;}

完整示例与测试

下面是一个完整的可运行示例，展示了如何使用上述代码：

#include <stdio.h>#include <stdlib.h>#include <string.h>// 此处插入上面定义的 hash_djb2、Node、hash_table、insert、get 函数int main() {    insert("apple", 10);    insert("banana", 20);    insert("cherry", 30);        printf("apple: %d\n", get("apple"));     // 输出 10    printf("banana: %d\n", get("banana"));   // 输出 20    printf("orange: %d\n", get("orange"));   // 输出 -1（未找到）        return 0;}