A method and apparatus for extracting information from symbolically compressed document images. A deciphering module generates first and second text strings by deciphering respective sequences of template identifiers in first and second symbolically compressed document images. A conditional n-gram module...http://www.google.com.au/patents/US20040042667?utm_source=gb-gplus-sharePatent US20040042667 - Extracting information from symbolically compressed document images