

The effect of overexposure usually appears when mainstream attention on a particular topic increases. Unlike exclusion and overgeneralization which can be addressed algorithmically, overexposure stems from research design. The fix – To address overgeneralization, we can use dummy variables rather than the ‘tertium non-datur’ approach for classification. However, if similar confusion happens with your religious views or sexual orientation, you might not take it lightly. Let’s say you receive an email congratulating you for your anniversary on your 28th birthday. Though it holds promising solutions and practical benefits, wrong assumptions can prove counterproductive. Let’s consider automatic inference of user attributes - an interesting NLP task. While exclusion is a side-effect of data, overgeneralization is that of modeling. The fix – Measures to address imbalanced or overfitting data can be used to correct demographic bias in data. In the long run, it can also lead to exclusion of these groups. This hidden ambiguity reinforces already existing demographical differences and makes the technology less user-friendly for the excluded groups. For instance, the use of standard language technology may be easier for white males rather than women or Asian men (as they were included while developing it).


This leads to exclusion or demographic misinterpretation. In Natural Language Processing, overfitting is due to the model’s assumption that all languages are identical to the training sample. And overfitting to this bias can severely affect the applicability of findings. However, the incredibly complex, inconsistent, and fluid human language still creates several challenges that NLP needs to resolve.ĭue to the situatedness of language, every data set carries a demographic bias. With AI becoming an indispensable part of our day-to-day lives, NLP and Natural Language Understanding (NLU) are also growing by leaps and bounds. Alexa, Siri, email and text autocorrect, and chatbots, all use Natural Language Processing (NLP) to process and respond to human language, spoken as well as written. You put it up for search and get all the related synonyms and antonyms. You come across a new word while reading. You want to play a song? You ask Alexa to play it for you.
