Lesson 9 of 96 in this sequence

Not completed yet

Saved on this device only — not synced to an account yet.

Learning path

Step 9 of 96

Continue the sequence

ResumeReturn to this step ReviewRevisit the core idea RemediateGet repair practice Next stepChoose the next lesson manually

Completion

Review objectives and instruction
Complete the activity with manual learner confirmation
Answer every understanding-check item
Show completion feedback and reward only after the learner finishes the check

Progress

Step 9/96
15 content types
AI help source context available
Auto-advance blocked

Review objectives and instruction

AI help

Available only with source grounding, privacy limits, age controls, and measurement.

AI source context

How Bias Gets Built into Algorithms and AI · Machines and Society (Systems & Impact) · 3A-IC-24 · 3B-IC-27

Evidence: local lesson AI-help metadata only; source-grounding, privacy, age-safety, and measurement hooks do not prove real model quality, production telemetry, child AI approval, or AI tutor production readiness

Production AI readiness not claimed

AI source context

Assessment shape

Local lesson assessment shape is complete.

Local shape only; production answer-attempt telemetry, live ownership/RLS evidence, mastery interpretation, release, and production readiness are not claimed here.

Objectives: 5
Activity: Prompt ready
Quiz items: 3 items, 3 answer keys, 3 explanations
Reward: Completion reward ready

Assessment progress stays manual: do not auto-advance through uncertain answers, failed writes, consent, media, payment, learner choice, or AI-assisted checks.

Learning-loop evidence: Content-map contract only; persistence and production learner-progress writes require route/API evidence.

Manual policy: assessment, media, consent, payment, learner choice, failed writes, and AI-uncertain help stay under learner control.

Prerequisite: Complete or review Hiding Complexity with Layers of Abstraction

AI guardrails

Privacy

Do not send raw child free text without consent
Use source excerpts and non-identifying progress state only
Asset is not registry-gated

Age safety

Apply academy audience controls before AI-help claims
Keep child learner AI paths behind guardian/product review evidence before readiness claims

Measurement

Log AI help usage by surface and asset slug
Evaluate answer quality against source-grounded rubric before production-readiness claims
Tie remediation prompts to completion, retry, or review outcomes

Local AI-help guardrails only; they do not prove guardian approval, production telemetry, model evaluation, release, deployment, or AI tutor production readiness.

Source and rights

Source path: koydo-app/scripts/curriculum-authoring
License: ai-generated-koydo
Attribution: Attribution decision needs owner review
AI provenance: claude-opus-4-8

Source review gates: attribution_missing

Local source evidence only; this does not prove publication rights, attribution approval, AI provenance clearance, release, deployment, or production readiness.

Route review

Current route: /library/academy/computer_science/en/lesson/algorithmic-bias-in-ai
Route source: Fallback route needs owner canonical review
Candidate route: /library/academy/computer_science/en/lesson/algorithmic-bias-in-ai

Local route evidence only; fallback candidates need owner-approved canonical route policy before route writes, production discoverability, or readiness claims.

Content gates: canonical_target_web_missing_uses_library_fallback

lessonslessonslesson objectivesmedia scenesdrillspractice setsquizzesfeedbackcharacterscitationsconcept mapsprogressresumereviewremediationnext steps

How Bias Gets Built into Algorithms and AI

with Byte

Byte, a sharp-eyed robot guide with a circuit-board chest panel, stands in a dimly lit data warehouse surrounded by towering stacks of labeled file boxes — some stacks towering high, others nearly empty — projecting a glowing decision flowchart onto a screen while pointing out the uneven piles with a calibrated laser stylus.

Explain how skewed training data causes an algorithm to produce biased outputs.
Identify at least two real-world domains where algorithmic bias has caused documented harm.
Distinguish between bias introduced at the data-collection stage and bias introduced at the model-design stage.
Predict how a specific gap in training data would distort an algorithm's decisions for an underrepresented group.
Evaluate an AI output critically by questioning the source and composition of the training data.

Key terms

Historical bias: Bias that arises when training data faithfully records past discriminatory human decisions.
Proxy variable: A seemingly neutral feature that statistically encodes a protected attribute like race or gender.
Differential performance: When a model's accuracy or error rate varies systematically across demographic subgroups.
False positive rate: The fraction of negative cases the model incorrectly flags as positive predictions.
Label bias: Bias introduced when the humans annotating training data apply prejudiced judgments.

Where Bias Enters the Pipeline

Bias is not a single defect but a family of failure points across the machine-learning pipeline. At collection time, sampling that underrepresents a group starves the model of examples it needs to generalize. At labeling time, annotator prejudice teaches the model to replicate biased judgments. At feature-engineering time, proxy variables let excluded attributes re-enter through correlation. Auditing for fairness therefore means inspecting every stage, not just the final accuracy number, because each stage can independently inject systematic harm that aggregate metrics easily conceal.

Why Aggregate Accuracy Misleads

A model can post an impressive overall accuracy while failing badly on a small subgroup, because that subgroup contributes few rows to the average. Imagine 95 percent accuracy overall but only 60 percent for a group that is 10 percent of the data; the aggregate barely moves while real people in that group face wrong decisions. Responsible evaluation disaggregates metrics by subgroup and compares error types, especially false-positive and false-negative rates, since the same overall score can hide opposite harms across populations.

Worked examples

Explain how a recidivism tool can be unfair even with equal overall accuracy across two groups.

Suppose two groups each have the same overall accuracy, so a naive audit sees no problem.
Disaggregate the errors into false positives (flagging non-reoffenders) and false negatives (missing reoffenders).
If one group has a much higher false-positive rate, innocent members of that group are flagged as high-risk more often.
Equal accuracy can therefore mask a sharply unequal distribution of who bears the cost of the model's mistakes.

Answer: Equal overall accuracy can hide unequal error types; a higher false-positive rate for one group penalizes its innocent members disproportionately.

Hey — I'm Byte, and I need to walk you through something that matters a lot right now. Machine learning works like this: you feed an algorithm a large dataset of examples, it finds statistical patterns, and then it generalizes those patterns to new inputs. Sounds neutral, right? But here is the catch — the algorithm can only learn what is in the data. If the data does not represent the real world fairly, the algorithm's decisions will not be fair either. Imagine training a hiring algorithm on ten years of a company's past hiring decisions. If that company historically hired far more men than women in technical roles, the algorithm learns that 'technical role = male candidate.' It has not been programmed to discriminate — it has been trained on discrimination that already happened. That is called historical bias, and it gets baked right into the model. Data can be skewed in several ways. Underrepresentation means some groups appear far less often than others in the training set. Labeling bias means the humans who labeled the data made biased judgments (for example, marking the same resume as 'strong' or 'weak' depending on the applicant's name). Proxy variables are features that seem neutral but actually encode group membership — like zip code, which can closely track race due to historical housing segregation. Once a biased model is deployed, the harm compounds. A 2016 ProPublica investigation examined a recidivism-prediction tool used by courts to estimate the likelihood that someone would reoffend. Researchers found that, after controlling for prior offenses, age, and charge type, the tool flagged defendants who did not go on to reoffend as high-risk at roughly twice the rate for Black defendants as for white defendants — a disparity in false positive rates that penalizes the innocent more severely by race. The tool was not simply 'more accurate' for one group; it was systematically wrong in a way that fell hardest on Black defendants. Critical evaluation is the defense. Before trusting any algorithmic output, ask: Who collected the training data, and from where? Are all affected groups well represented? What was the label source? Were humans making those labels also subject to bias? Was the model audited on subgroups, or only on aggregate accuracy? Aggregate accuracy can look great while a subgroup is being harmed — that gap is called differential performance. Here is the key insight: an algorithm is not objective just because it is mathematical. It inherits the biases embedded in its training data. Recognizing that is the first move toward building and using AI responsibly.

Activity

Sort each scenario card into the correct bias source: Data Collection, Labeling, or Proxy Variable.

Practice

A loan model excludes race yet still disadvantages one group; name the likely mechanism and explain it.

List three audit questions you would ask before trusting any deployed classification model.

Common mistakes to avoid

Math makes algorithms objectiveAlgorithms inherit whatever bias is present in their training data, so mathematical form does not guarantee fairness.
Removing protected attributes ends biasProxy variables can still encode the excluded attribute, allowing discriminatory patterns to persist indirectly.

Check your understanding

A hiring algorithm trained on a company's past decisions consistently ranks male applicants higher for engineering roles, even when qualifications are equal. What is the most accurate explanation for this outcome?

A predictive-policing algorithm achieves 92% overall accuracy on its test set. A civil-rights researcher argues this score is insufficient proof of fairness. Which concern best supports the researcher's position?

Which of the following best describes a 'proxy variable' in the context of algorithmic bias?

Recap

Machine-learning models learn statistical patterns from data and reproduce any bias that data contains. Bias enters through skewed collection, biased labels, and proxy variables, and aggregate accuracy can hide subgroup harm, so fairness requires disaggregated auditing.

Reflect

What real decision in your community might be shaped by an algorithm trained on biased historical data?