1.3.2. Understanding Model Sources (Pre-trained vs. Custom)
First Principle: The source of an ML model represents a fundamental strategic choice between leveraging existing general intelligence (pre-trained) and building specialized intelligence from scratch (custom).
Not all models are built by your team. You can source them in different ways.
- Pre-trained Models: These are models trained by a third party (like AWS or a research institution) on a massive, general-purpose dataset.
- Source: AWS AI Services (e.g., Rekognition, Comprehend), open-source model hubs (e.g., Hugging Face).
- When to use: For common, well-defined problems like standard object recognition, sentiment analysis, or language translation.
- Advantage: Fast to implement, no ML expertise required, leverages state-of-the-art architectures.
- Custom Models: These are models you train yourself, from scratch, using your own specific dataset and business logic.
- Source: Trained by your team using a platform like Amazon SageMaker.
- When to use: For specialized problems unique to your business where pre-trained models don't apply (e.g., predicting churn for your specific customers, forecasting demand for your unique products).
- Advantage: Highly tailored to your specific problem and data, can provide a competitive advantage.
Scenario: A project requires two AI features: one to detect generic objects (cars, trees) in photos, and another to predict the likelihood of failure for your company's proprietary manufacturing equipment.
Reflection Question: Which model source (pre-trained or custom) would you choose for each feature, and why does this choice represent the most efficient use of resources?
š” Tip: The decision rule is simple: If a good pre-trained model exists for your task, use it. Only invest in a custom model when your problem is unique and you have the necessary data and skills.