Alt+←/→to navigatePage7/978

Safety, Ethics & Responsible AI · Page 1 of 1

Responsible LLM Usage

30 min Beginner

Safety, Ethics & Responsible AI

Key Concerns with LLMs

1. Hallucinations (Making Up Facts)

User: "Who was the first president of Argentina?"
GPT-4: "Juan Manuel de Rosas" (WRONG - correct: Manuel Belgrano)

Why? Model is predicting likely text, not retrieving facts!

Mitigation:

Use RAG for factual questions
Fine-tune on factual data
Fact-check outputs
Acknowledge limitations

2. Bias

Training data contains human biases
- Gender bias (women underrepresented in tech)
- Race bias (historical stereotypes)
- Socioeconomic bias (favors wealthy topics)

Model learns and reproduces these biases!

Example:
Prompt: "A successful programmer is..."
Response: "A successful programmer is a smart young man..."
(Subtle bias toward male gender)

Mitigation:

Diverse training data
Bias testing before deployment
Monitoring outputs for bias
Allow user feedback

3. Misuse & Harmful Content

LLMs can be misused for:
- Generating misinformation
- Creating spam/phishing
- Automating harassment
- Synthesizing illegal content

Safeguards (implemented by OpenAI, etc.):
- Refuse harmful requests
- No sexual content involving minors
- No instructions for violence
- No impersonation

4. Privacy Concerns

If you send private data to an LLM API:
- Data goes to company servers
- Company may store/use it for training
- Data could be at risk

Examples:
- Don't send: patient medical records, financial data, trade secrets
- Do send: generic questions, public information

OpenAI Policy:
- Business API data: Not stored or used for training
- ChatGPT free tier: May be stored

5. Deepfakes & Misinformation

LLMs can generate:
- Fake news articles
- Convincing lies
- Manipulated quotes

Impact:
- Election interference
- Stock manipulation  
- Damage to reputations

Defense:
- Media literacy
- Fact-checking AI detection tools
- Digital signatures for authentic content

Responsible Deployment

Transparency

Tell users:

"This response generated by AI"
"May contain errors"
"Not a substitute for expert advice"

Accountability

Have human oversight
Allow feedback/corrections
Monitor for misuse
Document decisions

Fairness Testing

Before deployment:

Test for gender bias
Test for racial bias
Test for socioeconomic bias
Use diverse test scenarios

Data Privacy

Don't send sensitive data to APIs
Use self-hosted models for sensitive data
Comply with regulations (GDPR, etc.)
Get user consent

Regulations & Future

Emerging regulations:
- EU AI Act: Classifies AI by risk
- US Executive Order on AI
- China AI regulation

Requirements:
- Transparency (explain decisions)
- Human oversight (especially high-stakes)
- Bias auditing
- Incident reporting

Expect: More regulation incoming!

Best Practices

Transparency: Disclose AI use
Accuracy: Fact-check, especially factual claims
Fairness: Test for bias
Privacy: Don't send sensitive data
Accountability: Have oversight
Explainability: Help users understand reasoning
Safety: Consider misuse potential
Consent: Ask before using user data

main.py

OUTPUT

▶Click "Run Code" to execute…