All courses > Technology and Programming > Artificial Intelligence and Machine Learning ::

Limits, Risks, and Responsible Use: Bias, Privacy, and Reliability

Capítulo 10

Estimated reading time: 12 minutes

Why “Responsible Use” Matters for Beginners

AI systems can feel confident, fast, and objective. But they can also be wrong, unfair, invasive, or unsafe—often in ways that are hard to notice at first. Responsible use means understanding where AI can fail, what risks it can create for people, and what practical steps you can take to reduce harm. In this chapter, we focus on three major areas that show up in real projects: bias (fairness), privacy (data protection), and reliability (how dependable outputs are).

Limits and Risks: A Simple Map

Many AI risks can be grouped into a few buckets:

Bias and unfair outcomes: Some groups may be treated worse than others due to skewed data, flawed assumptions, or the way a system is deployed.
Privacy and data leakage: Personal or sensitive information can be collected, inferred, stored, or exposed in ways users did not expect.
Reliability problems: Outputs may be incorrect, inconsistent, or overly confident; systems may fail in edge cases or under changing conditions.
Continue in our app.
- Listen to the audio with the screen off.
- Earn a certificate upon completion.
- Over 5000 courses for you to explore!
Or continue reading below...
Download the app
Misuse and overreach: A tool built for one purpose may be used for another (for example, using a writing assistant as a medical advisor).
Security threats: Attackers can manipulate inputs, steal data, or exploit system behavior.

Responsible use is not only a technical issue. It includes product decisions (what you allow the system to do), communication (how you describe limitations), and process (how you test and monitor).

Bias: What It Is and Why It Happens

What “Bias” Means in AI

In everyday language, bias often means prejudice. In AI, bias usually refers to systematic differences in outcomes across groups or situations. A biased system might consistently approve fewer loans for one demographic group, misidentify certain faces more often, or rank some applicants lower for reasons unrelated to job performance.

Bias is not always intentional. It often emerges from the combination of data, design choices, and real-world context.

Common Sources of Bias

Skewed or incomplete data: If one group is underrepresented, the system may perform worse for that group. Example: a customer support chatbot trained mostly on messages from one region may misunderstand slang or cultural references from another.
Historical patterns baked into data: If past decisions were unfair, the data can reflect that unfairness. Example: if a company historically promoted fewer women into leadership roles, a model trained on “who got promoted” may learn that pattern.
Proxy features: Even if you remove sensitive attributes (like race), other variables (like ZIP code) can act as proxies and recreate similar effects.
Measurement bias: The thing you measure may not match what you truly care about. Example: using “arrest records” as a measure of “crime” can reflect policing patterns rather than actual crime rates.
Deployment mismatch: A system may be trained in one environment and used in another. Example: a hiring screening tool trained on office roles may be used for field roles, creating unfair filtering.

Fairness Is Not One Single Number

Fairness can mean different things depending on the context. For example:

Equal opportunity: qualified people across groups should have similar chances of being correctly selected.
Equal error rates: mistakes should not disproportionately affect one group.
Consistent treatment: similar cases should get similar outcomes.

These goals can conflict. Improving one fairness metric can worsen another. Responsible use requires choosing a fairness goal that matches the real-world impact and documenting that choice.

Practical Step-by-Step: Bias Checks You Can Do Without Being a Data Scientist

Even as a beginner or non-technical stakeholder, you can run meaningful bias checks by asking structured questions and requesting simple breakdowns.

Step 1: Define who could be harmed. List groups affected by the system (customers, employees, applicants, students). Consider protected characteristics (gender, age, disability) and also context groups (language, region, device type).
Step 2: Define the decision and the stakes. Is the AI deciding something important (loan approval) or low-stakes (music recommendations)? Higher stakes require stricter controls.
Step 3: Ask for performance “slices.” Request that accuracy or error rates be reported separately for different groups. If the vendor or team cannot provide this, treat it as a risk signal.
Step 4: Look for “disparate impact” patterns. Compare outcomes: who is accepted/rejected, flagged/not flagged, prioritized/deprioritized. Large differences deserve investigation.
Step 5: Test representative scenarios. Create a small set of realistic examples across groups. For a résumé screener, include equivalent résumés with different names or schools; for a chatbot, include different dialects or accessibility needs.
Step 6: Decide mitigations. Options include changing the data, changing the decision threshold, adding human review, limiting use cases, or providing an appeal process.
Step 7: Document and monitor. Bias can reappear as user behavior changes. Track complaints, outcomes, and periodic audits.

Example: A Hiring Assistant That Ranks Candidates

Imagine an AI tool that ranks candidates from 1 to 100. Risks include:

The tool may learn to prefer candidates from historically favored schools (proxy for socioeconomic status).
It may penalize career breaks (which can disproportionately affect caregivers).
It may overvalue keywords that correlate with a particular demographic.

Responsible use actions could include: limiting the tool to summarizing résumés rather than ranking; requiring human review for all rejections; auditing rankings by demographic slices; and providing candidates a way to request reconsideration.

Privacy: What Can Go Wrong

Privacy Risks Are Not Only About “Stolen Data”

Privacy problems can happen even when nobody “hacks” anything. Common issues include collecting too much data, using data for unexpected purposes, or exposing sensitive information through outputs.

Key Privacy Concepts (Practical Definitions)

Personal data: information that identifies a person directly (name, email) or indirectly (device ID, location patterns).
Sensitive data: health, financial details, biometrics, precise location, private communications, or data about children.
Consent and purpose limitation: people should know what data is collected and why; data should not be reused for unrelated purposes without a valid basis.
Data minimization: collect only what you need, keep it only as long as necessary.
Access control: only authorized people/systems should access sensitive data.

How AI Can Leak or Infer Sensitive Information

Prompt and chat logs: users may paste private information into an AI assistant (contracts, medical notes). If logs are stored or reviewed, privacy risk increases.
Outputs that reveal secrets: an AI might repeat sensitive text that was included in the conversation or internal documents.
Inference: even if you don’t collect a sensitive attribute, AI can sometimes infer it from patterns (for example, inferring health status from purchase behavior).
Third-party sharing: using an external AI service may send data outside your organization, creating compliance and trust risks.

Practical Step-by-Step: Privacy-Safe Use of AI Tools at Work

Step 1: Classify the data you plan to use. Label it as public, internal, confidential, or highly sensitive. If you cannot confidently classify it, treat it as sensitive.
Step 2: Decide what must never be entered. Create a “do not paste” list: passwords, API keys, customer PII, medical details, unreleased financials, private HR notes.
Step 3: Use redaction and summarization. Replace names with roles (Customer A), remove account numbers, and summarize rather than copy full documents.
Step 4: Check tool settings and contracts. Confirm whether inputs are stored, used for training, or shared with subcontractors. Prefer options that disable training on your data and provide retention controls.
Step 5: Limit access. If a team uses an AI tool with sensitive data, restrict who can use it, require authentication, and log usage.
Step 6: Create an incident plan. Decide what to do if someone accidentally pastes sensitive data: who to notify, how to remove it, how to document the event.

Example: Using an AI Assistant to Draft Customer Emails

A support agent wants help writing a response. Risky approach: pasting the full customer record including address, order history, and payment details. Safer approach: provide only what is needed to write the email.

Bad input: “Customer John Smith, 14 Pine St, card ending 1234, order #9981... write an apology.”
Better input: “Write an apology email for a delayed shipment. The customer is upset and wants an updated delivery estimate. Keep it polite and concise.”

The safer prompt still achieves the goal while reducing exposure of personal data.

Reliability: When AI Sounds Right but Isn’t

Reliability vs. Correctness vs. Usefulness

A reliable AI system behaves consistently and predictably within its intended scope. It may still be wrong sometimes, but it should fail in understandable ways, and it should not pretend to know things it doesn’t. Reliability includes:

Accuracy: how often outputs are correct.
Consistency: similar inputs produce similar outputs.
Robustness: small changes (typos, formatting) don’t cause big failures.
Calibration: confidence matches reality (not overconfident when wrong).
Scope control: the system stays within what it is designed to do.

Common Reliability Failure Modes

Hallucinations (made-up details): a system may generate plausible but false statements, citations, or steps.
Out-of-date or incomplete knowledge: the system may not know recent changes, policies, or product details.
Ambiguity handling: when a prompt is unclear, the system may guess instead of asking clarifying questions.
Edge cases: rare situations (unusual inputs, uncommon languages, atypical customer requests) can cause disproportionate errors.
Automation bias: humans may trust the AI too much, especially when it sounds confident.

Practical Step-by-Step: Making AI Outputs More Reliable in Daily Use

Step 1: State the task and constraints clearly. Include audience, format, and what the AI must not do. Example: “Do not invent numbers; if missing, ask questions.”
Step 2: Provide trusted reference material when possible. Paste a short policy excerpt or product spec (redacted if needed) and instruct the AI to use only that source.
Step 3: Ask for uncertainty and checks. Request: “List assumptions,” “What could be wrong?” or “What information is needed to be sure?”
Step 4: Use verification steps. For factual claims, require citations to your provided text or ask the AI to quote the exact line it used. For calculations, ask it to show the math.
Step 5: Use a second-pass review. Have the AI critique its own answer against a checklist (tone, policy compliance, missing info). Then do a human review for high-stakes content.
Step 6: Limit autonomy for high-stakes actions. Don’t let AI send emails, approve refunds, or change records without human confirmation unless you have strong controls and monitoring.

Example: A Policy Q&A Bot for Employees

An internal bot answers questions like “How many vacation days do I have?” Reliability risks include outdated policy, misinterpretation, or answering for the wrong country. Responsible design choices include:

Require the bot to ask clarifying questions (country, employment type) before answering.
Restrict answers to an approved policy document repository.
Show the policy excerpt used to generate the answer.
Provide a “contact HR” escalation link for uncertain cases.

Human Oversight: When to Keep a Person in the Loop

Human oversight is not just “someone glances at it.” It means designing a workflow where humans can catch errors and where the system’s role is appropriate for the stakes.

Good Candidates for Human-in-the-Loop

High-impact decisions: hiring, firing, credit, insurance, medical advice, legal decisions.
Safety-critical contexts: equipment operation, security monitoring, emergency response.
Vulnerable users: children, patients, people in crisis.
Low tolerance for mistakes: financial reporting, compliance, public statements.

Practical Step-by-Step: Designing Oversight That Works

Step 1: Decide the AI’s role. Assistant (drafting), advisor (suggesting), or decision-maker (automatic). Prefer assistant/advisor roles for higher-stakes areas.
Step 2: Define review checkpoints. Identify where a human must approve: before sending, before publishing, before rejecting an application.
Step 3: Provide reasons and evidence. If the AI recommends an action, require it to present supporting factors and highlight uncertainty.
Step 4: Create an appeal or correction path. Users should be able to challenge outcomes and correct data errors.
Step 5: Track outcomes. Monitor error rates, complaints, and near-misses. Update prompts, policies, or system rules based on what you learn.

Responsible Prompting and Output Handling

Prompting Practices That Reduce Harm

Ask for neutral language: “Avoid stereotypes; use respectful, inclusive wording.”
Request multiple perspectives: “List potential risks for different user groups.”
Force clarification: “If the request is missing key details, ask up to 3 questions before answering.”
Prevent fabrication: “If you don’t know, say so. Do not invent citations or policies.”

Output Handling Rules for Teams

Responsible use includes what happens after the AI responds.

Label AI-assisted content: internally mark drafts as AI-assisted so reviewers know to verify.
Separate drafting from approval: the person who prompts is not always the best person to approve.
Keep an audit trail: store prompts and outputs for high-stakes workflows (with privacy controls) so you can investigate issues.
Don’t store sensitive outputs unnecessarily: minimize retention and restrict access.

Red Flags: When Not to Use AI (or When to Pause)

You cannot explain what data is used or where it goes.
The system affects rights or opportunities, but you have no fairness testing.
Users believe the AI is a qualified professional (doctor, lawyer) when it is not.
The tool frequently produces confident errors and there is no verification step.
There is no way to correct mistakes, appeal decisions, or contact a human.
Security is unclear: no access controls, no logging, no incident plan.

A Practical Responsible-Use Checklist (Printable)

Use this checklist before deploying or widely adopting an AI tool:

Purpose: What is the tool for, and what is it explicitly not for?
Stakes: What harm could occur if it is wrong?
Bias: Have we checked outcomes across relevant groups and scenarios?
Privacy: Are we minimizing data, redacting sensitive info, and controlling retention?
Reliability: Do we have verification steps, scope limits, and escalation paths?
Oversight: Where does a human approve, and how are decisions audited?
Communication: Do users understand limitations and how to report issues?

Now answer the exercise about the content:

Which practice best reduces privacy risk when using an AI assistant to draft a customer email at work?

You are right! Congratulations, now go to the next page

You missed! Try again.

Privacy issues can happen without hacking. A safer approach is data minimization: share only what is needed, redact sensitive details, and summarize rather than pasting full customer records.