Permutation DRR In Perm Nrm Btw Perm Nrm


A pdf with a full description of the normalization + the formulas can be found [here]

The PermutationDRRInPermutationNormalizedBetweenPermutations- Normalized normalization extends the PermutationDRRIn- PermutationNormalized normalisation- reader. It reads its data through that one and then it uses the data to also apply the normalisation between permutations. So all values are transformed linearily to become 1 on average also between permutations

Note that the normalisation is done per value (ngram) over all permutations. So this normalisation values all ngrams with the same weight

This normalization uses temp-data. It's slow when ran for the first time (and it uses a few gb of space depending on the size of your data-set), then it becomes much faster, because it then just reads in what it calculated before

Usefull for finding the ngrams that make the most difference relatively (not absolute, because all ngrams will have the average value 1, no matter the frequency differences between them)
Part of the LogiLogi Network: The LogiLogi Foundation - LogiLogi.org - OgOg.org
This is an old version for archival purposes, see www.LogiLogi.org for the current version.
< Edit this document | View history | Printer friendly (inc. links) >
Visited 534 times
Document last modified Fri, 29 Jul 2005 07:11:10
All content is available under the GNU Free Documentation License. The LogiLogi-system is under the GPL
SourceForge.net Logo Zylon Internet Services-Groningen Logo
Visitor statistics