N-gram frequencies for languages

Using the Methodius library, this has generated frequencies and positions of letters, character bigrams and trigrams, and words.

The content used is various translations of the Universal Declaration of Human Rights.

Click on a token in a table to see it highlighted.

Any of the samples can be edited.

Germanic Languages


Romance Languages

Gallo-Romance

Occitano-Romance

Ibero-Romance

Central Romance


Isolate Languages


Celtic Languages


Semitic Languages