The Anthropic Developer Platform – From First Call to Production By the end of this lesson, students should be able to: The model tier framework. Claude models are grouped into capability tiers (verify current models and naming at docs.anthropic.com/en/docs/about-claude/models): Opus: Highest capability. Best for complex reasoning, nuanced writing, multi-step analysis, and tasks where quality is the primary constraint. Highest cost per token. Sonnet: Balanced capability and cost. Suitable for most production applications – strong performance across a wide range of tasks without Opus pricing. The common default for production integrations. Haiku: Fastest and cheapest. Strong for classification, extraction, simple summarization, routing, and tasks with well-defined low-complexity requirements. Poor choice for complex reasoning or nuanced generation. The model selection decision framework. For any application, answer four questions: Cost estimation. Token pricing is published at anthropic.com/pricing (always verify current pricing – it changes). General approach: For most applications, Sonnet is the default starting point. Drop to Haiku for high-volume, low-complexity tasks. Move to Opus for tasks where quality failures have real cost. When to change models. Signals to upgrade the model: Signals to downgrade: A developer builds a customer support ticket classification system (routes tickets to the correct team). She starts with Sonnet. At 50,000 tickets/day, the cost projection is higher than planned. She evaluates Haiku: the task is simple classification (8 categories, short input), not complex reasoning. She tests Haiku on 500 historical tickets and measures accuracy: 97% vs. 98.5% for Sonnet. For a routing task, 97% accuracy is acceptable. She switches to Haiku and reduces API cost by 75% without meaningful quality impact. Model selection matched to task complexity. Model selection has latency and availability implications beyond cost. Verify current model availability, context window limits, and rate limits for your target model before committing to it in production architecture. Context window differences between models can affect whether your application works at all if prompt size approaches the limit. Log in and enroll to access lesson quizzes.
Lesson 3: Choosing the Right Model for Your Use Case
Lesson Objectives
Lesson Content
Practical Example
Safety Notes