AWS-AIF-C01 & AWS CERTIFICATION | 💡 First Principle: Core Terminology (Model, Algorithm, Inferencing, Bias, Fairness)

1.1.3. 💡 First Principle: Core Terminology (Model, Algorithm, Inferencing, Bias, Fairness)

First Principle: A shared, precise vocabulary is essential for effective communication and understanding in AI/ML. Key terms like model, algorithm, inference, bias, and fairness represent fundamental building blocks of any AI system.

Mastering these core terms is crucial for discussing AI/ML concepts accurately.

Algorithm: The procedure or set of rules that a machine learning system uses to learn from data. It's the "how-to" guide for creating a model.
- Example: The decision tree algorithm is a set of rules for splitting data based on its features.
Model: The output of the training process. The algorithm is run on a dataset, and the artifact it produces—which contains the learned patterns—is the model. The model is what you use to make predictions.
- Example: A specific decision tree that has been trained on your customer data to predict churn.
Inferencing (or Prediction): The process of using a trained model to make a prediction on new, unseen data.
- Example: Giving the model a new customer's data and getting back a prediction of "churn" or "no churn."
Bias: A systematic error in an AI system that results in unfair outcomes, often stemming from flawed or unrepresentative data.
- Example: A hiring model that was trained on historical data reflecting past biases might unfairly favor one gender over another.
Fairness: The ethical goal of ensuring that an AI model's predictions do not create discriminatory or unjust outcomes for different demographic groups.
- Example: An insurance pricing model is fair if it provides similar rates to individuals with similar risk profiles, regardless of their race or gender.

Scenario: A project manager says, "We need to run the XGBoost model on our data." You need to gently correct their terminology to ensure clarity for the team.

Reflection Question: How would you clarify that the team will be using the "XGBoost algorithm" to train a "model" on the data, and then use that model for inferencing? Why is this precision important?

💡 Tip: Remember the sequence: You use an Algorithm to train a Model on data. You then use that Model for Inferencing. You must evaluate the model for Bias to ensure Fairness.