Next Steps — Tổng hợp, cam kết hành động, và đi sâu hơn — Năng lực & Giới hạn của AI

Bạn đã vào khóa học này với một phiên bản nào đó của câu hỏi:

Bạn sẽ học được

Tổng hợp 4 thuộc tính + training fingerprints thành một working mental model duy nhất
Kết nối Capabilities & Limitations Framework với 4D Framework — hai nửa của một hệ thống
Xác định một thay đổi cụ thể bạn sẽ áp dụng trong AI practice tuần này
Biết đi đâu tiếp theo để đi sâu hơn

Cái bạn mang theo

Hai giai đoạn training → mọi hành vi truy ngược được

Mọi hành vi lạ của AI đều truy ngược về 1 trong 2 fingerprints này. Bạn đã mất khả năng để "bị surprised" theo nghĩa xấu.

Bốn thuộc tính, mỗi cái là một continuum

Mỗi cái có:

Lỗi thường là 2 thuộc tính giao thoa

Hầu hết failures thực tế = 2 properties colliding. Bạn diagnose bằng cách hỏi "Which 2 collided?", không phải "What broke?".

5 collision patterns chính:

Capability zone (bên trái): AI hoạt động tốt, cần verify ít
Limitation zone (bên phải): AI hỏng dễ, cần compensate nhiều
Product features đẩy edge ra xa
NTP × Knowledge → hallucinated specifics
Working Memory × Steerability → long-conversation drift
NTP × Steerability → letter-over-spirit
Knowledge × Steerability → confidently wrong niche
All 4 → complex agentic failures

🔮 Next Token Prediction        Câu trả lời từ đâu ra?
   [well-worn] ←──────→ [novel]
   
🌐 Knowledge                    AI thực sự biết gì?
   [frequent] ←──────→ [rare/niche/post-cutoff]
   
🧠 Working Memory                AI đang chú ý gì?
   [fits] ←──────→ [cliff]
   
🎚️ Steerability                  Bạn kiểm soát đến đâu?
   [concrete] ←──────→ [abstract]

Two halves of one system: 4D + 4 Properties

Capabilities & Limitations Framework và 4D Framework không phải 2 thứ riêng bạn phải juggle. Chúng là 2 mặt của cùng một đồng xu:

Mapping cụ thể:

Next Token Prediction → sharpens Discernment

Knowledge → sharpens Delegation

Working Memory → sharpens Description

Steerability → sharpens both Description and Delegation

Diligence stitches them all

Biết output được generated, không phải retrieved → biết loại scrutiny
Fluency và accuracy là variables độc lập → kiểm tra tách biệt
Biết ở đâu model "dày", ở đâu "mỏng" → biết khi nào giao, khi nào tự cung cấp
Know limits → know when to go elsewhere
Biết context là leverage → biết structure prompt thế nào
Biết cliff exists → biết khi nào start fresh
Biết gap words/intent → biết cách diễn đạt
Biết limitation zone → biết task nào không nên giao nguyên
Verify where fabrication concentrates (NTP)
Cite where knowledge is uneven (Knowledge)
Repeat critical context (WM)
State goals (Steerability)

┌──────────────────────────────────────────────────────────┐
│                                                          │
│  WHAT THE MACHINE DOES              WHAT YOU DO          │
│  ───────────────────                ──────────           │
│                                                          │
│  🔮 Next Token Prediction           🔍 Discernment        │
│  (fluency ≠ accuracy)               (evaluate outputs)    │
│           ↕ explains ↕                                    │
│                                                          │
│  🌐 Knowledge                        📋 Delegation        │
│  (broad, uneven, frozen)            (what to hand off)    │
│           ↕ explains ↕                                    │
│                                                          │
│  🧠 Working Memory                   ✍️ Description        │
│  (fixed window, cliff)              (how to describe      │
│           ↕ explains ↕                 + supply context)   │
│                                                          │
│  🎚️ Steerability                    (Description +        │
│  (pattern-match instructions)        Delegation)          │
│                                                          │
│                                     🛡️ Diligence          │
│                                     (the entire ethical   │
│                                      posture across all)  │
│                                                          │
└──────────────────────────────────────────────────────────┘

Calibrated Trust là một habit, không phải một attitude

Trước khi giao bất cứ gì cho AI, chạy quick internal check:

Bạn không trust AI. Bạn không distrust AI. Bạn locate the task và set habits accordingly.

┌──────────────────────────────────────────────────────────┐
│                                                          │
│  PRE-DELEGATION CHECKLIST (30 giây):                     │
│                                                          │
│  🔮 NTP:                                                 │
│    - Task ở well-worn territory hay sparse?              │
│    - Cần specifics (names, numbers, citations)?          │
│    - Nếu yes/no → cần verify ở đâu?                      │
│                                                          │
│  🌐 Knowledge:                                           │
│    - Topic mainstream hay niche?                         │
│    - Stable hay recent/changing?                         │
│    - Nếu niche/recent → bật search hay supply context?   │
│                                                          │
│  🧠 Working Memory:                                      │
│    - Context có fit comfortably trong window?            │
│    - Critical info ở edges (đầu/cuối) hay middle?        │
│    - Cần chunk hay standing context setup?               │
│                                                          │
│  🎚️ Steerability:                                        │
│    - Instructions concrete hay abstract?                 │
│    - Có goal statement kèm instruction?                  │
│    - Multi-step → cần checkpoints?                       │
│                                                          │
│  THEN ADJUST:                                            │
│    - More verification where fabrication concentrates    │
│    - More context where model can't guess                │
│    - More checkpoints where reasoning runs long          │
│                                                          │
└──────────────────────────────────────────────────────────┘

Hình dạng của framework stays useful

Models sẽ tiếp tục thay đổi. Context windows grow. Hallucination rates drop. Features close gaps. Nhưng AI sẽ tiếp tục là:

Những fact này không expire khi version number tăng. Bạn đã xây một mental model durable on purpose.

Cái sẽ thay đổi — và cái sẽ không

Một predictor có fluency chạy trước accuracy
Có uneven knowledge với cutoff
Làm việc trong finite window
Theo instructions qua gap giữa words và intent

Will change	Won't change
Tỷ lệ hallucination	NTP mechanism → fabrication possible
Knowledge cutoff dates	Knowledge is finite
Context window sizes	Windows still bounded
Specific features available	Gap intent-vs-instruction always exists
Model names/versions	4 property dimensions remain

Một vài ghi chú về điều khóa học không nói

Để bạn có mental model hoàn chỉnh, cần biết cái gì nằm ngoài khóa học này:

1. Agentic systems

Khóa nói về AI trả lời 1 prompt at a time. Agentic AI (Claude với tool use, browser automation, code execution tự chủ) là một chiều kích riêng — collision patterns phức tạp hơn, security risks lớn hơn, cần governance framework riêng.

Reference: Anthropic's work on agentic safety, Claude Code, Claude Plays Pokemon paper.

2. Multi-modal

Khóa chủ yếu nói text. Vision (image understanding), audio, video có properties riêng — ví dụ: hallucination trong image description có patterns khác text.

3. Fine-tuning custom models

Bạn có thể fine-tune Claude/GPT cho domain riêng. Điều này dịch chuyển Knowledge zone + reduce hallucination cho that domain — nhưng có tradeoffs (capabilities general có thể giảm).

4. AI safety & alignment research

Deeper questions về: tại sao model làm những điều không dự đoán được, làm thế nào ensure models behave as intended ở scale. Research areas: interpretability, RLHF refinements, Constitutional AI.

Commitment: Một thay đổi cụ thể tuần này

Đừng để khóa học này chỉ là knowledge sitting in head. Chọn một thay đổi bạn sẽ áp dụng tuần này.

4 categories of change

Category 1: Verification habit

Category 2: Standing context setup

Category 3: Checkpoint protocol

Category 4: Goal-stated prompting

Choose ONE. Write it down.

Every time AI gives specifics (names, numbers, citations) in [specific domain], verify via [specific source] before acting
Example: "Every Claude citation for client work → Google Scholar check"
Set up [Project / Memory / Custom Instructions] for [recurring task]
Example: "Claude Project 'Monthly Financial Review' with uploaded templates + brand guidelines"
For [multi-step task type], insert checkpoint after step [X]
Example: "DCF calculations → pause after revenue projection, verify assumptions before margins"
For [task type where I've been format-only], add goal statement
Example: "Email rewrites → always state 'Goal: [reader outcome]' alongside 'make shorter/better/etc'"

Task where I'll make a change: _________________________

Change I'll make this week: _____________________________

How I'll know it worked: ________________________________

Accountability: _________________________________________
(share với team? set weekly reminder?)

Áp dụng ngay

Bài tập: Your Commitment (15 phút)

Lý do: Khóa học kết thúc ở đây. Nhưng thực hành mới bắt đầu.

Bước 1: Quay lại task list từ Bài 17.0 một lần cuối.

For every task, jot nhanh cảm nhận:

Bước 2: Pick 1 task + 1 change bạn'll make this week:

Options:

Write it down:

Bước 3 (optional): Share với one person — colleague, mentor, partner. Accountability = follow-through.

Verification step
Standing-context setup
Checkpoint in multi-step process
Goal-stated (not just format) prompt habit

Task	NTP zone	Knowledge zone	WM zone	Steerability zone	Mitigations cần
1
2
3
4
5
6

Task: ______________________________________

Change: ____________________________________

By when: ___________________________________

Expected impact: ____________________________

Suy ngẫm bài học (cuối cùng)

Shift lớn nhất trong cách bạn nghĩ về AI behavior từ Bài 17.0 đến giờ?
Which of 4Ds (Delegation, Description, Discernment, Diligence) cảm thấy immediately sharpened bởi những gì bạn vừa học?
Có câu chuyện AI failure nào trước đây giờ bạn'd diagnose hoàn toàn khác?

Tóm tắt khóa học

🎯 Bạn giờ hold một working mental model: 4 thuộc tính như continuums, characteristic failures as property intersections.

🎯 Framework này và 4D Framework là 2 sides of 1 system. Properties giải thích cái 4D competencies đang respond to.

🎯 Calibrated trust nghĩa là locating your task trên mỗi continuum và matching verification + context habits với nơi nó sit.

🎯 Models sẽ tiếp tục thay đổi. Shape của properties stays useful dù exact boundaries shift.

🎯 Mental model durable là mục đích. Bạn không chỉ học AI hôm nay — bạn học cách hiểu AI ngày mai khi công nghệ advance.

Đi sâu hơn

Nếu bạn chưa học AI Fluency: Framework & Foundations

Đây là next step tự nhiên. Nó đi sâu vào 4Ds (Delegation, Description, Discernment, Diligence) — nhưng từ góc nhìn con người. Bạn đã có machine-side context cho 4Ds; khóa kia sẽ complete the picture.

Link: anthropic.com/ai-fluency

Advanced reading

Research papers (Anthropic):

Books:

Hands-on:

Keep testing edges

Properties stay stable, but where the lines sit keep moving as models improve. Every major model release: re-test your assumptions. Your Bài 17.0 task list → quarterly revisit. The habits stay; the specific thresholds may shift.

"Tracing the Thoughts of a Large Language Model" (2025) — interpretability research
"Constitutional AI" (2022) — how Claude được aligned
"Claude Plays Pokemon" (2025) — agentic AI lessons
"Prediction Machines" (Agrawal, Gans, Goldfarb) — economic lens
"The Age of AI" (Kissinger, Schmidt, Huttenlocher) — societal implications
Claude Cookbook (github.com/anthropics/anthropic-cookbook) — code patterns
Claude API docs (docs.anthropic.com) — deep technical

Feedback

Khóa học sẽ improve liên tục nếu có feedback của bạn. Những gì bạn thấy:

Anthropic luôn welcomes feedback — share tại đây (gốc tiếng Anh) hoặc comment trên bài này nếu bạn đang dùng bản Vietnamese.

Giá trị nhất?
Khó hiểu nhất?
Phần nào muốn thêm depth?
Use case của bạn mà framework chưa address?

Lời kết

Ba tuần đầu bạn gặp một colleague mới (AI) bạn không hiểu. Bạn làm việc cùng, nhưng mỗi lần xài là một gamble.

Sau khóa học này, bạn đã get to know this colleague. Không phải theo nghĩa "biết mọi ngôi sao Thuộc Cung nó"; mà theo nghĩa biết khi nào dựa vào, khi nào cẩn trọng, khi nào nhờ verify, khi nào giao cho người khác.

Calibrated trust. Đó là tên của thing bạn vừa build.

Cám ơn vì đã dành 6-8 giờ cho khóa học này. Hy vọng cái bạn mang theo sẽ useful qua các generations của AI tiếp theo — không chỉ hôm nay.

— The Anthropic Academy team

Tài liệu tham khảo

Anthropic — AI Fluency: Framework & Foundations — Next course natural
Anthropic Academy — anthropic.com/learn
Anthropic Research — anthropic.com/research
Claude Documentation — docs.anthropic.com

Nội dung này có hữu ích không?