Text Search Languages

The text index and the $text operator supports the following languages:

Changed in version 2.6: MongoDB introduces version 2 of the text search feature. With version 2, text search feature supports using the two-letter language codes defined in ISO 639-1. Version 1 of text search only supported the long form of each language name.

Changed in version 3.2: MongoDB Enterprise includes support for Arabic, Farsi (specifically Dari and Iranian Persian dialects), Urdu, Simplified Chinese, and Traditional Chinese. To support the new languages, the text search feature uses the three-letter language codes defined in ISO 636-3. To enable support for these languages, see Text Search with Basis Technology Rosette Linguistics Platform.

Language Name ISO 639-1 (Two letter codes) ISO 636-3 (Three letter codes) RLP names (Three letter codes)
danish da    
dutch nl    
english en    
finnish fi    
french fr    
german de    
hungarian hu    
italian it    
norwegian nb    
portuguese pt    
romanian ro    
russian ru    
spanish es    
swedish sv    
turkish tr    
arabic   ara  
dari   prs  
iranian persian   pes  
urdu   urd  
simplified chinese or hans     zhs
traditional chinese or hant     zht


If you specify a language value of "none", then the text search uses simple tokenization with no list of stop words and no stemming.