当前位置：首页 > C > 正文

C语言游程编码详解（手把手教你实现Run-Length Encoding数据压缩算法）

主机测评网
C
2025-12-10
322

在计算机科学中，游程编码（Run-Length Encoding，简称RLE）是一种简单而高效的数据压缩技术。它特别适用于包含大量连续重复字符的数据，比如黑白图像、日志文件或某些类型的文本。本教程将带你从零开始，用C语言实现一个完整的游程编码与解码程序，即使是编程新手也能轻松掌握！

什么是游程编码？

游程编码的基本思想是：将连续重复的字符（称为“游程”）替换为“重复次数 + 字符”这一对信息。例如：

原始字符串：AAAAABBBCCD
压缩后结果：5A3B2C1D

可以看到，原本11个字符被压缩成了8个字符，实现了初步的数据压缩效果。

C语言游程编码详解（手把手教你实现Run-Length Encoding数据压缩算法） C语言游程编码游程压缩算法 C语言数据压缩 Run-Length Encoding教程第1张

C语言实现游程编码

下面我们用C语言编写两个函数：rle_encode() 用于压缩，rle_decode() 用于解压。

1. 编码函数（压缩）

#include <stdio.h>#include <string.h>#include <stdlib.h>char* rle_encode(const char* input) {    if (input == NULL) return NULL;        int len = strlen(input);    if (len == 0) return strdup("");        // 分配足够大的缓冲区（最坏情况：每个字符都不同）    char* output = (char*)malloc(len * 4); // 每个字符最多占4字节（如999A）    if (!output) return NULL;        int out_index = 0;    int i = 0;        while (i < len) {        char current = input[i];        int count = 1;                // 统计连续相同字符的数量        while (i + count < len && input[i + count] == current) {            count++;        }                // 将计数和字符写入输出缓冲区        out_index += sprintf(output + out_index, "%d%c", count, current);        i += count;    }        return output;}

2. 解码函数（解压）

char* rle_decode(const char* encoded) {    if (encoded == NULL) return NULL;        int len = strlen(encoded);    if (len == 0) return strdup("");        // 估算最大输出长度（保守估计）    char* output = (char*)malloc(len * 10);    if (!output) return NULL;        int out_index = 0;    int i = 0;        while (i < len) {        // 读取数字部分        int count = 0;        while (i < len && encoded[i] >= '0' && encoded[i] <= '9') {            count = count * 10 + (encoded[i] - '0');            i++;        }                // 读取字符        if (i < len) {            char ch = encoded[i];            for (int j = 0; j < count; j++) {                output[out_index++] = ch;            }            i++;        }    }        output[out_index] = '\0';    return realloc(output, out_index + 1); // 调整内存大小}

完整测试程序

int main() {    const char* original = "AAAAABBBCCD";    printf("原始字符串: %s\n", original);        char* compressed = rle_encode(original);    printf("压缩后: %s\n", compressed);        char* decompressed = rle_decode(compressed);    printf("解压后: %s\n", decompressed);        // 验证是否还原成功    if (strcmp(original, decompressed) == 0) {        printf("✅ 压缩与解压成功！\n");    } else {        printf("❌ 出现错误！\n");    }        // 释放内存    free(compressed);    free(decompressed);        return 0;}