Machine Learning & Artificial Intelligence
I. Introduction: Understanding AI and ML
Dr Sudheendra S G provides a comprehensive overview of
Machine Learning (ML) and Artificial Intelligence (AI), distinguishing between
the two concepts and exploring key techniques, challenges, and ethical
considerations. The core idea is that "ML is software that learns patterns
from data and uses them to make predictions or decisions."
Key Distinction:
- AI
(Artificial Intelligence): The broader "goal" or
"ambition" – systems that perform tasks we associate with
intelligence. AI encompasses a wide range of approaches, including but not
limited to ML.
- ML
(Machine Learning): A specific "set of techniques" or
"toolbox" within AI. ML involves algorithms that "learn
from data."
II. Families of Machine Learning
Machine Learning is broadly categorized into three main
families:
- Supervised
Learning:
- Concept:
Algorithms learn from "labeled examples" to predict a
"label" or target output.
- Scenario
Examples: Spam filters (predicting "spam" or "not
spam" from subject lines), forecasting house prices, or classifying
moth species based on features like wingspan and mass.
- Core
Idea: Given input-output pairs, the model learns a mapping function.
- Unsupervised
Learning:
- Concept:
Algorithms find structure or patterns in data "without labels."
- Scenario
Example: Grouping news articles into categories based on their
content, without prior knowledge of the categories.
- Core
Idea: Discovering hidden relationships or clusters in data.
- Reinforcement
Learning (RL):
- Concept:
An agent learns by "trial, reward, and punishment" through
interaction with an environment. It aims to develop a "policy"
to maximize cumulative reward.
- Scenario
Examples: Game-playing agents (like AlphaGo), robotics, or navigating
a "gridworld" to reach a goal with rewards for good moves and
penalties for bad ones.
- Core
Idea: Learning optimal actions through feedback from an environment.
III. Core Concepts and Techniques in Supervised Learning
A practical supervised learning scenario involves building a
"moth classifier" to predict species from features like wingspan and
mass. This process introduces several fundamental concepts:
- Features
(Inputs): The measurable properties or attributes of the data used for
prediction (e.g., wingspan in mm, mass in g).
- Label
(Target): The output or outcome that the model is trying to predict
(e.g., moth species: Emperor or Luna).
- Decision
Boundary: A line or plane that separates different classes in a
dataset. Simple models might use straight lines, while complex models can
create more intricate boundaries.
- Training
vs. Testing:Training Data: The portion of the dataset used to teach
the model and identify patterns.
- Test
Data: A separate, "held-out" portion of the dataset used to
evaluate the model's performance on unseen data. This is crucial for
assessing generalization.
- Generalization:
A model's ability to perform well on new, unseen data, not just the data
it was trained on.
- Overfitting:
Occurs when a model learns the training data too well, capturing noise and
specific details rather than underlying patterns. This results in
excellent performance on training data but poor performance on test data.
An "overfit" boundary is "a zig-zag boundary that hugs
every point."
- Underfitting:
Occurs when a model is too simple to capture the underlying patterns in
the data, leading to poor performance on both training and test data. An
"underfit" boundary is "one crude line misclassifies both
clusters."
- Confusion
Matrix: A table used to evaluate the performance of a classification
model. It breaks down predictions into:
- True
Positive (TP): Correctly predicted positive class.
- True
Negative (TN): Correctly predicted negative class.
- False
Positive (FP): Incorrectly predicted positive class (Type I error).
- False
Negative (FN): Incorrectly predicted negative class (Type II error).
- Metrics
from Confusion Matrix:Accuracy: The proportion of correctly classified
instances (TP + TN) / Total. "Accuracy is not enough" when
classes are imbalanced.
- Precision:
Of all instances predicted as positive, how many were actually positive
(TP / (TP + FP)).
- Recall:
Of all actual positive instances, how many were correctly identified (TP /
(TP + FN)).
IV. Algorithmic Approaches
Several algorithms are used to build ML models:
- Decision
Trees & Random Forests:
- Decision
Tree: A series of "IF-THEN rules" that split data based on
feature values to make a prediction.
- Random
Forest: An ensemble method where "many trees vote" to make a
prediction, leading to a "more robust, less overfitting" model.
- Support
Vector Machines (SVM):
- Concept:
SVMs find "the widest margin line/plane that separates classes"
in the data, creating the "best 'buffer zone'" between different
categories.
- Intuition:
Imagine an "elastic band stretched between two pushpin
clusters—widest gap."
- Neural
Networks:
- Concept:
Composed of "layers of simple units (neurons)" that
"combine features with weights, add bias, apply an activation."
- Architecture:
Typically include an input layer, one or more hidden layers (making them
"Deep" if many), and an output layer.
- Components:Weights:
Determine the strength of connections between neurons.
- Bias:
An additional input to a neuron that shifts the activation function.
- Activation
Function: Introduces non-linearity, allowing the network to learn
complex patterns.
- Applications:
"Great for images, speech, language."
V. Ethical Considerations and Challenges
As ML models learn patterns from data, they inevitably
reflect and can amplify societal issues. "Models learn patterns in
data—including biases. Fairness and privacy are design requirements, not
afterthoughts."
Key Dangers:
- Biased
Data → Biased Decisions: If the training data contains historical or
systemic biases, the model will learn and perpetuate these biases, leading
to unfair or discriminatory outcomes. "Data encodes history,
including inequities."
- Privacy
Leaks: ML models, especially those trained on sensitive data, can
inadvertently reveal private information.
- Misuse:
AI/ML technologies can be intentionally misused for harmful purposes.
Mitigation Strategies:
- Data
Level:Balance samples to ensure diverse representation.
- Audit
datasets for biases and document their characteristics.
- Modeling
Level:Measure "per-group metrics" to assess fairness across
different demographic groups.
- Calibrate
"thresholds" to balance precision and recall for different
groups.
- Deployment
Level:Implement "human-in-the-loop" systems for critical
decisions.
- Establish
"monitoring" systems to detect performance degradation or bias
in real-world use.
- Provide
an "appeals process" for individuals affected by automated
decisions.
Guiding Question: When designing and evaluating ML
systems, always ask: "Right for whom? Right compared to what
baseline?"
VI. Misconceptions and Best Practices
- AI
≠ Human-like intelligence: "Most deployed systems are narrow
(great at one task)."
- "More
complex model = always better" is false: Can "overfit and
hurt generalization."
- "Accuracy
is enough" is false: Not when classes are imbalanced; consider
precision/recall.
- "Data
is objective" is false: "Data encodes history, including
inequities; plan for audits."
- Algorithm
Choice: When asked "Which algorithm is best?" the answer is:
"It depends—try a few, compare on held-out data, and mind the
problem’s costs."
VII. Conclusion
"AI is the ambition; ML is the toolbox; data is the
fuel; and evaluation & ethics keep us on the road." A robust
understanding of ML requires not only technical proficiency but also a critical
awareness of its limitations, potential for bias, and the ethical
responsibilities involved in its development and deployment. Always prioritize
separating training from testing, and acknowledge that no model is perfect,
especially with ambiguous data.
No comments:
Post a Comment