Search⌘ K

FinBERT for Finnish

Explore how FinBERT is designed specifically for the Finnish language, outperforming multilingual BERT models like M-BERT in tasks such as named entity recognition and part-of-speech tagging. Learn about its training data, architecture, and practical usage to improve Finnish NLP solutions.

FinBERT is the pre-trained BERT model for the Finnish language. FinBERT outperforms M-BERT on many downstream Finnish NLP tasks. We learned that M-BERT is trained using the Wikipedia text of 104 languages, but it comprises only 3% Finnish text. FinBERT is trained using the Finnish text from news articles, online discussions, and internet crawling. It consists of about 50K ...