Subtasks

Task Introduction

Online polarization is the sharp division and hostility between social, political, or identity groups. Online polarization has become a growing concern, as it often precedes hate speech, offensive discourse, and social fragmentation.

In its extreme form, polarization can create a fragmented society where individuals or groups are unable to engage in constructive dialogue, leading to a breakdown in community cohesion and social unity. Thus, detecting and mitigating polarization before it escalates is crucial to ensure safer and more inclusive online spaces.

For the first time, we introduce a polarization task, aimed at detection of online polarization. The task focuses on the identification of multilingual, multicultural and multievent polarization, capturing the complexity of online discourse across diverse contexts. Participants may participate in one or more of the following three sub-tasks: Polarization Detection, Polarization Type Classification and Polarization Manifestation Identification.

Our task consists of three subtasks. Participants may choose to compete in one or more of the three subtasks. Each subtask is designed to address a specific aspect of polarization detection and analysis in multilingual social media content.

Dataset Information

Data sources include news websites, Reddit, blogs, Bluesky, and regional forums, covering events such as elections, conflicts, gender rights, migration, and more. Each language contains 3,000–5,000 annotated instances.

A few sample instances are provided here for review, and additional trial data can be found at: TRIAL DATA

Languages Covered

The task covers 22 languages across different cultural and geographical contexts:

Amharic, Arabic, Bengali, Burmese, Chinese, English, German, Hausa, Hindi, Italian, Khmer, Nepali, Odia, Persian, Polish, Punjabi, Russian, Spanish, Swahili, Telugu, Turkish, Urdu.

Subtask 1: Polarization Detection

If a text includes one or more of the polarized specified characteristics, it is classified as polarized. Conversely, social media texts that do not display any of these characteristics are classified as non-polarized. Therefore, a given text is classified into one of two categories:

Text that contains/shows polarized opinion (Yes)
Text that does not contain polarized opinion (No)

Note: Only texts that clearly reflect attitude polarization are classified as such, with consideration of the context and the overall meaning of the text, not just individual words or phrases.

Subtask-1: Binary classification: to determine whether a post contains polarized content (Polarized or Not Polarized).

id	text	polarization
2745	Find yourself a west bank settler gf	1
2738	Fascist oligarchs now control the USA	1
3184	Someone end this lunatic before he starts ethnic cleansing	1
1614	The EU is increasing military aid to Ukraine to one billion euros the press release on the website of the Council of Europe.	0
716	House drafts bill to strike Iran proxies amid IsraelHamas	0
309	Contested races across county early voting	0

Go to Subtask-1 Competition

Subtask 2: Polarization Type Classification

Looking at the given social media texts, the type or target polarization is classified as follows:

Political/ideological polarization: This type of extremism focuses on division, intolerance, and conflict between political parties and followers. Political polarization refers to political beliefs and affiliations becoming more extreme. People may identify more strongly with their political party, leading to deeper divides and a reduced willingness to compromise. It broadens ideological differences between political groups.
Racial or ethnic polarization: This type of polarization focuses on ethnic identity or racial origin and incites division, intolerance, and conflict between ethnic groups or races. This type of polarization arises when individuals identify more strongly with their own racial or ethnic group, leading to increased separation, mistrust, or conflict with individuals from other groups.
Religious polarization: This type of polarization focuses on religious identity and incites division, intolerance, and conflict between religious followers.
Gender polarization: This type of polarization refers to the exclusion, discrimination, and marginalization of individuals based on their gender. Sexual orientation polarization: This refers to the increasing division and distinction between different sexual orientations within society, often leading to heightened tensions, misunderstandings, conflicts, or marginalization among various groups.
Other: polarization texts targeting other groups/identities such as economy, technology, media, polarization, etc.

Subtastk-2: Multi-label classification: to identify the target of polarization as one of the following categories: Political, Racial/Ethnic, Religious, Gender/Sexual or Other.

id	text	political	racial/ethnic	gender/sexual
2745	Find yourself a west bank settler gf	1	0	1
2738	Fascist oligarchs now control the USA	1	0	0
3184	Someone end this lunatic before he starts ethnic cleansing	1	1	0

Go to Subtask-2 Competition

Subtask 3: Manifestation Identification

A message/ text on social media is considered to be polarizing if it exhibits one or more of the following characteristics:

Stereotype: This manifestation occurs when a message generalizes certain characteristics of individuals to all members of a group, ignoring individual differences. Stereotypes simplify complex personalities into one-size-fits-all representations.
Vilification: Vilification appears when a text defames or demonizes a particular group, person, or entity, often inciting fear through exaggeration, misrepresentation, or biased framing that portrays the subject in a harmful or negative light.
Dehumanization: This occurs when language strips a group or individual of human qualities or dignity, often by comparing them to animals, machines, or objects, or by otherwise denying their humanity and individuality.
Extreme Language and Absolutism: This manifestation involves the use of extreme or absolutist language that reflects polarized attitudes, such as words like “always,” “never,” “worst,” or “best.” It often presents issues in dichotomous terms such as “us vs. them” or “right vs. wrong.”
Lack of Empathy or Understanding: This occurs when the text shows no empathy or understanding for others’ perspectives or experiences. It may involve marginalizing alternative viewpoints or refusing to understand or relate to them.
Invalidation: Invalidation appears when a text denies or rejects the identity or existence of certain people or groups, dismissing their legitimacy or right to exist.

Subtask-3: Multi-label classification: to classify how polarization is expressed, with multiple possible labels including Stereotype, Vilification, Dehumanization, Extreme Language, Lack of Empathy", or Invalidation.

NOTE: Italian and Russian languages are not included in this subtask.

id	text	stereotype	vilification	extreme_language
2745	Find yourself a west bank settler gf	1	1	0
2738	Fascist oligarchs now control the USA	0	1	1
3184	Someone end this lunatic before he starts ethnic cleansing	1	1	1

Go to Subtask-3 Competition