Before a document is scanned, DansGuardian can optionally convert each %XX to a char. If you find documents are getting past the phrase filtering due to encoding, then enable this. However, this can break Big5 and other 16-bit texts.