Issues¶

Environmental Issues¶

Sources of carbon footprint

train models
data collection
inference
creating new chips

Types¶

Embodied carbon
Operational carbon

Not just the energy cost of running computers, but also energy required to build them

Untitled

Solutions¶

Efficiency of models
Software infra to promote development of efficiency
AutoML instead of grid search
Easy-to-use APIs for efficient models
Efficient computer chips
Efficient datacenters
Microsoft: underwater servers
- 8 x more reliable, using Nitrogen, etc
- No need for fresh water for cooling
Use renewable energy
Transitioning to 100% renewables will not eliminate the carbon footprint of chips
Stop planned obscelence

4 M’s of AI Efficiency¶

Model
Machine
Mechanism
Map: Location

Jevon’s Paradox/Effect¶

Improved efficiency in resource utilization increases the total consumption of resources, due to increased rate of consumption from increased demand

Hence, definition of “efficiency” is important

PUE¶

Power Usage Effectiveness

Lower is better

Typically $\in [1.1, 1.6]$

How the power input is being used for compute & for other supporting consumption such as cooling, lighting, etc. $$ \begin{aligned} \text{PUE} &= \dfrac{\text{Total Energy}}{\text{Productive Energy}} \ &= 1 + \dfrac{\text{Overhead}}{\text{Productive Energy}} \end{aligned} $$

Ethical Issues¶

New technologies are disruptive, not neutral

AI can - change current practices - influence human decisions - regulate human behavior

Responsible AI¶

Researchers and engineers should
- be proactive - not reactive - about ethics
- anticipate how their work will impact society
- incorporate human values throughout all stages of product's lifecycle
Increased public trust means
- Improved marketability
- Higher product adoption

Abstraction¶

Researchers/Engineers often abstracted away from application they’re working on, hence not aware of bad things that can be used for

Data Ownership¶

Data governance

Trade-off: Privacy vs Progress

Model monetization trained on consumer data, eg: GitHub Copilot

Data Bias¶

Dataset curation

Federated learning

Research Inequality¶

Inequitable access to computing resources
Inequitable access to datasets

Bias¶

Solution

Solution	Meaning	Pro	Con
Group unawareness	Sensitive attributes not included as features	Avoids disparate treatment	Possibility of highly correlated features that are proxies of the sensitive attribute Incompatible with group threshold
Group threshold	Counteract historical biases in data by adjusting confidence thresholds independently for each group		Incompatible with group unawareness
Demographic Parity	TP + FP equal for all groups		Introduces false negatives
Equal opportunity	TP + FN equal for all groups		Introduces false positives Incompatible with equal accuracy
Equal accuracy	TP + TN equal for all groups		Higher FN for one group Higher FP for one group Incompatible with equal opportunity

Last Updated: 2024-12-26 ; Contributors: AhmedThahir, web-flow