tanszek:oktatas:techcomm:lzw_coding
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
tanszek:oktatas:techcomm:lzw_coding [2024/10/07 08:16] – [Step 2: Start Encoding] knehez | tanszek:oktatas:techcomm:lzw_coding [2024/11/19 07:54] (current) – knehez | ||
---|---|---|---|
Line 6: | Line 6: | ||
1. **Initialize a dictionary** with all single characters in the input data (e.g., all ASCII characters if the input is text). | 1. **Initialize a dictionary** with all single characters in the input data (e.g., all ASCII characters if the input is text). | ||
+ | |||
2. **Scan the input** string character by character, building longer and longer substrings that exist in the dictionary. | 2. **Scan the input** string character by character, building longer and longer substrings that exist in the dictionary. | ||
+ | |||
3. **Add new substrings** to the dictionary and output the dictionary index for the current substring when it can no longer be extended by the next character in the input. | 3. **Add new substrings** to the dictionary and output the dictionary index for the current substring when it can no longer be extended by the next character in the input. | ||
+ | ---- | ||
=== Example of LZW Compression === | === Example of LZW Compression === | ||
Line 75: | Line 78: | ||
The final output (in terms of dictionary indices) is: | The final output (in terms of dictionary indices) is: | ||
- | ``` | + | |
- | 65 66 256 258 | + | < |
- | ``` | + | |
This represents the compressed version of the original string `" | This represents the compressed version of the original string `" | ||
Line 83: | Line 85: | ||
--- | --- | ||
- | ### Decoding LZW | + | ==== Decoding LZW ==== |
To decode an LZW-compressed string, you use the same dictionary-based approach but in reverse. The decoder reconstructs the dictionary entries as it processes the encoded output, using the indices to retrieve the corresponding substrings. | To decode an LZW-compressed string, you use the same dictionary-based approach but in reverse. The decoder reconstructs the dictionary entries as it processes the encoded output, using the indices to retrieve the corresponding substrings. | ||
- | Let’s go through the decoding process of the encoded sequence | + | Let’s go through the decoding process of the encoded sequence |
- | 1. **65**: Look up in the dictionary. It maps to `" | + | 1.) **65**: Look up in the dictionary. It maps to `" |
- | - **Decoded string so far**: `" | + | |
- | 2. **66**: Look up in the dictionary. It maps to `" | + | |
- | | + | |
- | 3. **256**: Look up in the dictionary. It maps to `"AB"`. | + | 2.) **66**: Look up in the dictionary. It maps to `"B"`. |
- | - **Decoded string so far**: `" | + | |
- | 4. **65**: Look up in the dictionary. It maps to `" | + | |
- | | + | |
- | The original | + | 3.) **256**: Look up in the dictionary. It maps to `" |
+ | |||
+ | * **Decoded | ||
+ | |||
+ | 4.) **258**: Look up in the dictionary. It maps to `" | ||
+ | |||
+ | * **Decoded string so far**: | ||
+ | |||
+ | The original string '' | ||
--- | --- | ||
- | ### Summary of LZW Compression | + | ==== Summary of LZW Compression |
1. **Dictionary Initialization**: | 1. **Dictionary Initialization**: | ||
+ | |||
2. **Scanning Input**: Input data is scanned character by character, grouping together characters to form substrings. | 2. **Scanning Input**: Input data is scanned character by character, grouping together characters to form substrings. | ||
+ | |||
3. **Dictionary Expansion**: | 3. **Dictionary Expansion**: | ||
+ | |||
4. **Output**: The dictionary indices corresponding to the substrings are output, resulting in a compressed sequence of codes. | 4. **Output**: The dictionary indices corresponding to the substrings are output, resulting in a compressed sequence of codes. | ||
Line 115: | Line 124: | ||
+ | C implementation of LZW encoding: | ||
+ | |||
+ | <sxh c> | ||
+ | #include < | ||
+ | #include < | ||
+ | |||
+ | #define MAX_sxhS 100 | ||
+ | |||
+ | char sxhTable[MAX_sxhS][10] = {" | ||
+ | |||
+ | int tableElements = 5; | ||
+ | |||
+ | int isInsxhTable(char *s) | ||
+ | { | ||
+ | for(int i = 0; i < tableElements; | ||
+ | { | ||
+ | if(strcmp(sxhTable[i], | ||
+ | { | ||
+ | return i; | ||
+ | } | ||
+ | } | ||
+ | return -1; | ||
+ | } | ||
+ | |||
+ | int main() | ||
+ | { | ||
+ | char text[] = " | ||
+ | char *p = text; | ||
+ | int bufferEnd = 1; | ||
+ | |||
+ | int lastFoundIndex = -1; | ||
+ | while(*p != ' | ||
+ | { | ||
+ | char subStr[10]; | ||
+ | strncpy(subStr, | ||
+ | subStr[bufferEnd] = ' | ||
+ | int foundIndex = isInsxhTable(subStr); | ||
+ | if(foundIndex != -1) | ||
+ | { | ||
+ | | ||
+ | | ||
+ | | ||
+ | } | ||
+ | p += strlen(subStr) - 1; | ||
+ | bufferEnd = 1; | ||
+ | strcpy(sxhTable[tableElements++], | ||
+ | printf(" | ||
+ | } | ||
+ | } | ||
+ | </ |
tanszek/oktatas/techcomm/lzw_coding.1728288971.txt.gz · Last modified: 2024/10/07 08:16 by knehez