Dictionary encoding is a loss-less compression technique to compress a list of words. Dictionary encoding works by maintaining a mapping of unique words to their compressed value, and replacing occurrence of the words with the compressed value Here is an example: Input String:
“USA,USA,USA,USA,Mexico,Canada,Mexico,Mexico,Mexico,Argentina” Potential Encoder Output String:
“USA,Mexico,Canada,Argentina:0,0,0,0,1,2,1,1,1,3”