July 23, 2008 Leave a comment
Recently I needed a language recognition library to identify the language of specific chunks of text. I asked a network of colleagues here in the Boston area and they came up with the following:
There is also:
And all this led to:
In the event it seemed simple enough to write my own using the text collection in TextCat as source material for the ngrams and associated frequencies.