how

… can the source of online string in any language be located ?  Can intra-text word-frequencies be used to identify the source ?