Bias is a systematic difference in the treatment of objects, people or groups compared to others, leading to an imbalance in the distribution of data.
Biases are part of people’s lives. They usually start with habits or unconscious actions (cognitive biases) which, over time, materialize as technical biases (data biases and processing biases). Such a scenario increases or creates risks that could result in untrustful AI systems.
Biases in AI systems arise from human cognitive biases, the characteristics of the data used or the algorithms themselves. Where AI systems are trained on real-world data, there is the possibility that models can learn from, or even amplify, existing biases.
In a statistical context, errors in predictive systems are the difference between the values predicted as model output and the real value of the variables considered in the sample. When the error occurs systematically in one direction or for a subset of data, bias can be identified in the data treatment.
Cognitive biases
Cognitive biases are systematic errors in judgements or decisions common to human beings owing to cognitive limitations, motivational factors and adaptations accumulated throughout life. Sometimes, actions that reveal cognitive biases are unconscious.
For a list of cognitive biases, refer to the sub-guideline Generic risks and biases: Cognitive bias types.
Data biases
Data biases are a type of error in which certain elements of a data set are more heavily weighted or represented than others, painting an inaccurate picture of the population. A biased data set does not accurately represent a model’s use case, resulting in skewed outcomes, low accuracy levels and analytical errors.
For a list of cognitive biases, refer to the sub-guideline Generic risks and biases: Data bias types.
Processing and validation biases
Processing and validation biases arise from systematic actions and can occur in the absence of prejudice, partiality or discriminatory intent. In AI systems, these biases are present in algorithmic processes used in the development of AI applications.
For a list of cognitive biases, refer to the sub-guideline Generic risks and biases: Processing and validation bias types.