AI/automation ethics glossary
Fairness
AI/automation ethics glossary
Fairness
Fairness refers to the use/misuse of an AI/automated system to create or amplify unfair, prejudiced or discriminatory results due to biased data, poor governance, or other factors.
Fairness in AI concerns whether a system's decisions, predictions, and outputs distribute benefits and harms justly across different people and groups. It operates on several levels: individual fairness (treating similar people similarly), group fairness (ensuring comparable outcomes across demographic groups), and procedural fairness (ensuring the process by which decisions are reached is equitable and transparent).
There is no single agreed definition of algorithmic fairness. Mathematically, different fairness criteria — demographic parity, equalized odds, predictive parity, calibration — are often mutually incompatible. This means that designing a "fair" system requires explicit choices about which group or which kind of harm takes priority, choices that are inherently value-laden and contested.
Fairness problems typically emerge at the intersection of data (what was collected, and about whom), models (what patterns were learned), and deployment (what decisions the system is used to make). The issue is compounded when AI is used as a decision-making layer that obscures human responsibility.
AI systems are increasingly embedded in high-stakes decision-making across domains such as healthcare, finance, criminal justice, and employment, and evidence has accumulated showing that these systems can reproduce and amplify structural inequities, leading to ethical, social, and technical concerns.
When AI systems are unfair, they make mistakes in a patterned, structural way that compounds existing social disadvantage. Individuals already marginalised by historical discrimination face compounded harm when automated systems - which carry the appearance of objectivity - replicate or worsen that discrimination at scale and speed no human bureaucracy could match.
When AI fairness norms break down, the consequences span individuals, institutions, and society:
Individual harm. People are wrongly denied jobs, credit, bail, healthcare, or welfare benefits. The harm is often invisible; the person simply receives a rejection with no explanation and no recourse.
Amplification of inequality. Biased systems trained on historically unequal data perpetuate and entrench the same patterns. Biases can perpetuate systemic discrimination and inequality, with detrimental effects on individuals and communities in areas like hiring, lending, and criminal justice.
Erosion of institutional trust. When governments and corporations are shown to use opaque, discriminatory systems, public trust in both AI and the institutions deploying it is damaged - often irreversibly.
Fairness issues in AI rarely stem from malicious intent; instead, they are usually caused by:
Historical data bias. Training data that reflects past human prejudices, structural inequalities, or uneven societal practices.
Representative bias. Data sampling that underrepresents or misrepresents certain demographic groups (e.g., facial recognition trained primarily on lighter skin tones).
Proxy variables. Removing protected attributes (like race) but keeping data points strongly correlated with them (like zip/post codes or neighbourhood data), leading to indirect discrimination.
Flawed objective functions. Optimising an algorithm solely for efficiency or profit without constraining it for equitable outcomes.
The fairness impossibility problem. No model can satisfy all common mathematical definitions of fairness simultaneously when base rates differ across groups. Choosing which definition to satisfy is a political and moral choice, not just a technical one - yet it is typically made by engineers, invisibly.
Accuracy versus equity. Correcting for group disparities sometimes reduces a model's overall accuracy. Deciding who bears that cost, and whether it is worth paying, is genuinely difficult.
Individual versus group fairness. Group-level interventions (e.g. demographic parity) can produce outcomes that are unfair to specific individuals within those groups. The two goals can pull in opposite directions.
Historical data as ground truth. Models trained on historical outcomes treat past human decisions as correct labels. Where those decisions were themselves biased, the model learns discrimination as signal.
Transparency versus gaming. Making a model's fairness criteria publicly known enables accountability, but may also allow bad actors to game the system - submitting applications that satisfy the model's criteria without reflecting genuine merit or need.
Who defines fairness? The communities most affected by algorithmic decisions are rarely the ones defining what fairness means in a given deployment context - raising questions of power, legitimacy, and democratic accountability.
Facebook Cross-check criticised as unfair, under-resourced and opaque
Robo review 'unfairly' targets Bulgarians for benefit fraud investigation
Hingham High School accused of unfairly disciplining students for AI use
Amazon Driveri delivery driver monitoring slammed as inaccurate, unfair
UK sham marriage tool found to disproportionately flag Greeks, Albanians
Poland COVID-19 Cultural Support Fund assessments blasted as unfair
Author: Charlie Pownall 🔗
Published: May 18, 2026
Last updated: May 18, 2026
You are welcome to use, copy, adapt, and redistribute this definition under a CC BY-SA 4.0 licence.
Let us know if you have any comments or suggestions about how to improve this definition, or would like to suggest and/or contribute additional terms to define.