Set the text content language and stop words

 


The text content language and stop word lists are managed via project properties. Project properties' settings apply to every user who accesses the project.

 

This video about word frequency queries discusses the stop words list at 3:10

In server projects, all project users can view project properties (including the text content language and stop words list), but only Project Owners can make changes—refer to Collaborate in a server project for more information.

In this topic


 


Why set the text content language?

NVivo provides the following text content languages: Chinese, English (UK), English (US), French, German, Japanese, Portuguese and Spanish.  

Each text content language has an associated dictionary that is used for:

Each text content language also has an associated 'stop word' list. The words in the stop word list are filtered out (removed) when you:

  • Run a Word Frequency or Text Search query

  • Generate a cluster analysis diagram comparing words in sources or nodes

If you auto code based on existing coding patterns, NVivo excludes the stop words when comparing wording of text passages to previously coded content.

If you auto code themes or sentiment, the text content language determines the language used when processing your sources. The 'stop word' list is not used for this feature. (NVivo Plus only)

Make sure that the text content language is set to match the language of your source materials—refer to Set the text content language for detailed instructions.  If the language of your source materials is not available as a text content language, you should set the text content language to 'Other'. When the text content language is set to 'Other', you cannot look for similar words when you run a Text Search query or Word Frequency query.

Top of Page

What stop words are provided by default?

NVivo provides default stop words for Chinese, English (UK), English (US), French, German, Japanese, Portuguese and Spanish. The default stop words are less significant words like conjunctions or prepositions that may not be meaningful to your analysis. You can view the stop words associated with each language, and you can customize the stop words by adding or removing words from the stop word list.

The text content language setting in your project, determines which list of stop words is in use—for example, the English (US) stop words list, or the Spanish stop words list. If the language of your source materials is not available in the Text content language list, you should select 'Other'. By default, the stop word list for the language 'Other' is empty—you can add your own words to create a stop word list to suit the language of your sources.

Top of Page

Set the text content language for your project

  1. Click the File tab and then click Project Properties.

The Project Properties dialog box opens.

  1. On the General tab, select the language used in your sources from the Text content language list.

  2. Click OK.

NOTE  

  • If the language used in your sources is not available in the Text content language list, select the language 'Other'. By default, the stop word list for the language 'Other' is empty—you can add your own words to create a stop word list to suit the language of your sources.

  • You can also set the default text content language that will apply to any future projects you create, refer to Set application options (General tab options) for more information.

Top of Page

View, add or remove stop words

  1. Click the File tab and then click Project Properties.

The Project Properties dialog box opens.

  1. On the General tab, click the Stop Words button. The Stop Words dialog box opens.

  2. (Optional) Edit the list of stop words. You can add or remove words from the list. Each word must be separated by a space.

  3. Click OK. to close the Stop Words dialog box.

  4. Click OK.

NOTE 

  • You can also add stop words by selecting words displayed in the results of a Word Frequency query. Select the word you want to exclude from the query results, then click Add to Stop Words List, in the Actions group on the Query tab.

  • If you want to change the stop words for more than one language, make sure you click Apply on the Project Properties dialog after changing the stop words for a language (before you select the next language).

  • In a server project, you cannot add or remove stop words when another user has a source (internal, external or memo) open in edit mode.

Top of Page

Restore the default stop words

CAUTION  If you restore the default stop word list for a text content language, any words that you have added to the stop word list are removed.

  1. Click the File tab, and then click Project Properties.

The Project Properties dialog box opens.

  1. On the General tab, click the Stop Words button. The Stop Words dialog box opens.

  2. Click Reset.

  3. Click OK.

Top of Page