Using NGRAM for the composition attribute, the six Korean characters of the composite noun information processing institute are indexed as six tokens. This illustration shows the last two tokens.