Analysis and Mitigation of Religion Bias in Indonesian Natural Language Processing Datasets
Previous studies have shown the existence of misrepresentation regarding various religious identities in Indonesian media.Misrepresentations of other marginalized identities in natural language processing (NLP) datasets have been recorded to inflict harm against such marginalized identities in cases such as automated content moderation, and as such