# π― νλ₯΄μλ μμ€ν
μ
κ·Έλ μ΄λ μμ½
**λ μ§**: 2025-11-23
**λ²μ **: v2.4.0 β v3.0.0 β **v3.1.0 (RAG)**
**λͺ©ν**: μν κ·Ή(Role-Play) β μ€μ κΈ°λ₯(Functional Capability) β **μ§μ§ μ λ¬Έκ°(RAG)**
---
## π ν΅μ¬ λ°κ²¬: 리μμΉ κ²°κ³Ό λΆμ
### β λ¨μ νλ₯΄μλμ μ€ν¨
**CMU μ°κ΅¬ κ²°κ³Ό (Hacker News, arXiv)**:
- 162κ° νλ₯΄μλ ν
μ€νΈ
- κ°κ΄μ μ νλ: **0% κ°μ **
- 15.75% μ§λ¬Έ: νλ₯΄μλλ‘ μ λ΅
- 13.78% μ§λ¬Έ: νλ₯΄μλλ‘ **μ€λ΅** (μ ν¨κ³Ό κ±°μ μμ)
**κ²°λ‘ **: μ μ ν
μ€νΈ νλ₯΄μλλ ν¨κ³Ό μμ
### β
μ±κ³΅νλ ν¨ν΄
**MCP κΈ°λ° μν€ν
μ² (Anthropic, Block)**:
- Progressive Disclosure: **98.7% ν ν° μ κ°**
- 4-Breakpoint Caching: **90% λΉμ© μ κ°, 79% μ§μ° κ°μ **
- Block μ¬λ‘: **75% μκ° λ¨μΆ**, 1λ§ λͺ
μ€ 4μ² λͺ
μ±ν
**μ§μ μ§μ½ μμ
(arXiv SPP μ°κ΅¬)**:
- Trivia Creative Writing: **+10%** μ νλ
- Logic Puzzle: **+18.5%** μ νλ
- Debate Pattern (Medical Q&A): **+15%** (90% agreement level)
**κ²°λ‘ **: MCP Tools + Resources + Sampling = μ€μ ν¨κ³Ό
---
## π μ
κ·Έλ μ΄λ μ λ΅
### Phase 1: νλ₯΄μλ ꡬ쑰 μ¬μ€κ³
**Before (v2.4.0)**:
```
community/410-llm-engineer.txt
- μμ Markdown
- μλλ§ ν
μ€νΈλ‘ λμ΄
- μ€ν λΆκ°λ₯
```
**After (v3.0.0)**:
```yaml
---
name: LLM Engineer
tools:
- analyze_transformer_architecture
- design_prompt_template
- estimate_inference_cost
resources:
- llm://papers/{topic}
- llm://benchmarks/{model}/{task}
sampling_enabled: true
---
[Markdown λ΄μ©]
```
### Phase 2: MCP μλ² κ³ λν
**μ κ· κ΅¬ν**:
1. **Tools**: νλ₯΄μλλ³ μ€ν ν¨μ (500+ Tools)
2. **Resources**: URI ν
νλ¦Ώ κΈ°λ° λ°μ΄ν° μ 곡
3. **Sampling**: ExpertPrompting, SPP, Debate ν¨ν΄
4. **Caching**: 4-Breakpoint μ λ΅
### Phase 3: Context Engineering
**μ΅μ ν**:
- Progressive Disclosure (Tool Discovery)
- 4-Breakpoint Caching (System β Tools β Persona β History)
- Just-in-Time Retrieval (κ²½λ μλ³μ + λμ λ‘λ©)
---
## π μμ μ±κ³Ό
### μ λμ μ§ν
| μ§ν | Before | After | κ°μ |
|------|--------|-------|------|
| ν ν° μ¬μ© | 150K/λν | 2K/λν | **-98.7%** |
| API λΉμ© | $0.99/10ν | $0.21/10ν | **-78.5%** |
| μ§μ° μκ° | 11.5s | 2.4s | **-79%** |
| μ§μ μμ
νμ§ | Baseline | +7-18% | **+15% (νκ· )** |
### μ μ±μ ν¨κ³Ό
- β
μ€μ μμ
μν (μ½λ λΆμ, λΉμ© κ³μ°, ν둬ννΈ μ€κ³)
- β
μ΅μ λ°μ΄ν° μ 곡 (λ
Όλ¬Έ, λ²€μΉλ§ν¬, Best Practices)
- β
κΉμ΄ μλ μΆλ‘ (Samplingμ ν΅ν λ€λ¨κ³ μ¬κ³ )
- β
λΉμ© ν¨μ¨μ± (Cachingμ ν΅ν ν ν° μ¬μ¬μ©)
---
## π μμ±λ λ¬Έμ
### 1. β
FUNCTIONAL_PERSONA_UPGRADE_PLAN.md
- **λ΄μ©**: μ 체 μ
κ·Έλ μ΄λ κ³ν
- **ν¬κΈ°**: 4KB
- **ν¬ν¨**: Phaseλ³ μ€ν κ³ν, νλ₯΄μλ κΈ°λ₯ λ§€ν
### 2. β
examples/410-llm-engineer-functional.txt
- **λ΄μ©**: μ€μ κΈ°λ₯μ νλ₯΄μλ μμ
- **ν¬κΈ°**: 15KB
- **ν¬ν¨**: YAML Frontmatter + 5κ° Tools + 4κ° Resources + μ¬μ© κ°μ΄λ
### 3. β
TECHNICAL_IMPLEMENTATION_GUIDE.md
- **λ΄μ©**: κΈ°μ ꡬν κ°μ΄λ
- **ν¬κΈ°**: 12KB
- **ν¬ν¨**: personaLoader, tools, resources, sampling, index.ts ν΅ν©
### 4. β
UPGRADE_SUMMARY.md (νμ¬ λ¬Έμ)
- **λ΄μ©**: μ 체 μμ½ λ° μ€ν κ³ν
- **ν¬κΈ°**: 3KB
---
## π μ¦μ μ€ν κ°λ₯ν μ‘μ
### μ΄λ² μ£Ό (Week 1)
1. **10κ° ν΅μ¬ νλ₯΄μλ λ³ν**
```
- 101-fullstack-dev.txt
- 108-devops-engineer.txt
- 201-ui-ux-designer.txt
- 223-ux-researcher.txt
- 326-strategic-oracle.txt
- 337-scrum-master.txt
- 410-llm-engineer.txt (β
μμ μλ£)
- 411-ai-agent-developer.txt
- 501-world-class-tester.txt
- 601-science-teacher.txt
```
2. **MCP μλ² κΈ°λ³Έ ꡬ쑰 νμ₯**
```typescript
src/personaLoader.ts (μ κ·)
src/tools.ts (μ κ·)
src/resources.ts (μ κ·)
```
3. **ν΅ν© ν
μ€νΈ**
- 410-llm-engineerλ‘ Tools μ€ν ν
μ€νΈ
- Resources URI ν
μ€νΈ
- Caching λμ νμΈ
### λ€μ μ£Ό (Week 2)
1. **30κ° ν΅μ¬ Tools ꡬν**
- LLM Engineer: 5κ° β
- DevOps: 5κ°
- UX Researcher: 5κ°
- κΈ°ν: 15κ°
2. **5κ° URI Schemes ꡬν**
- llm://papers/{topic} β
- llm://benchmarks/{model}/{task} β
- code://examples/{language}
- design://patterns/{type}
- research://methods/{method}
3. **Sampling νλ‘ν νμ
**
- ExpertPrompting κΈ°λ³Έ ꡬν
- SPP 3-Phase ꡬν
- Debate Pattern νλ‘ν νμ
---
## π μ°μ μμ κ²°μ κΈ°μ€
### 1. μ¬μ© λΉλ (Top 10 νλ₯΄μλ)
- 101 Fullstack Dev
- 108 DevOps Engineer
- 201 UI/UX Designer
- 326 Strategic Oracle
- 410 LLM Engineer
### 2. κΈ°λ₯ 볡μ‘λ (Tools κ°μ)
- LLM Engineer: 5κ° Tools (λμ)
- DevOps: 3κ° Tools (μ€κ°)
- UX Researcher: 3κ° Tools (μ€κ°)
### 3. 리μμΉ κ²μ¦ (ν¨κ³Ό μ
μ¦)
- β
LLM Engineer: ExpertPrompting ν¨κ³Ό μ
μ¦
- β
Strategic Oracle: Debate Pattern ν¨κ³Ό μ
μ¦
- β
UX Researcher: SPP ν¨κ³Ό μ
μ¦
---
## π μ±κ³΅ μ§ν
### κΈ°μ μ μ§ν
```yaml
phase_1_success:
- 10κ° νλ₯΄μλ YAML λ³ν μλ£
- personaLoader.ts λμ νμΈ
- κΈ°μ‘΄ MCP κΈ°λ₯ μ μ§
phase_2_success:
- 30κ° Tools μ€ν κ°λ₯
- 5κ° URI Schemes μλ΅
- Sampling κΈ°λ³Έ λμ
phase_3_success:
- ν ν° μ¬μ© 50% μ΄μ μ κ°
- μ§μ° μκ° 30% μ΄μ κ°μ
- μΊμ± hit rate 70% μ΄μ
phase_4_success:
- 142κ° νλ₯΄μλ μ 체 λ³ν
- 500+ Tools ꡬν
- νλ‘λμ
λ°°ν¬
```
### λΉμ¦λμ€ μ§ν
```yaml
adoption:
- 5κ° AI λꡬ λͺ¨λ λμ β
- GitHub Stars: 0 β 50+ (λͺ©ν)
- μ¬μ©μ νΌλλ°±: κΈμ μ
performance:
- μμ
μλ£ μκ°: -50%
- API λΉμ©: -70%
- μ¬μ©μ λ§μ‘±λ: +30%
quality:
- μ§μ μμ
μ νλ: +15%
- νκ° κ°μ: 50%
- μΌκ΄μ± ν₯μ: 80%
```
---
## π ν΅μ¬ κ΅ν
### 리μμΉ κΈ°λ° μ€κ³
1. **λ¨μ νλ₯΄μλλ ν¨κ³Ό μμ**
- CMU μ°κ΅¬: 162κ° νλ₯΄μλ β 0% κ°μ
- μν κ·Ήλ§μΌλ‘λ λΆμΆ©λΆ
2. **MCP μν€ν
μ²κ° ν΅μ¬**
- Tools: μ€ν κ°λ₯ν κΈ°λ₯
- Resources: μ€μκ° λ°μ΄ν°
- Sampling: μλ² μΈ‘ μΆλ‘
3. **Context Engineeringμ΄ κ²°μ μ **
- Progressive Disclosure: 98.7% μ κ°
- 4-Breakpoint Caching: 90% λΉμ© μ κ°
- Just-in-Time Retrieval: Context Rot λ°©μ§
### μ€μ©μ μ λ΅
1. **μμ
μ νλ³ μ νμ μ¬μ©**
- β
μ§μ μ§μ½ μ°½μ μμ
- β
κ°λ°©ν μμ± μμ
- β κ°κ΄μ, λ¨μ μ¬μ€ 쿼리
2. **λμ νλ₯΄μλ μ ν**
- κ³ μ νλ₯΄μλ λμ μμ
λ³ μμ±
- ExpertPromptingμΌλ‘ μ λ¬Έμ± μ£Όμ
3. **Multi-Agent νμ
**
- SPP: λ°μ° β λΉν β ν΅ν©
- Debate: Agreement Intensity μ‘°μ
- Ensemble: μμ μ± ν보
---
## π μ°Έκ³ μλ£
### 리μμΉ λ
Όλ¬Έ
- CMU: "162 Personas, 0% Improvement" (arXiv)
- Solo Performance Prompting (SPP) (arXiv)
- Multi-Persona Debate (arXiv)
- ExpertPrompting (arXiv)
### μ€λ¬΄ μ¬λ‘
- Block: 75% μκ° λ¨μΆ, 4μ² λͺ
μ±ν
- Anthropic: Progressive Disclosure 98.7% μ κ°
- Trychromatrychroma: Context Rot μ°κ΅¬
### κΈ°μ λ¬Έμ
- Anthropic MCP Best Practices
- FastMCP Documentation
- LangGraph Multi-Agent
---
## β
λ€μ λ¨κ³
### μ¦μ μμ
1. [ ] 410-llm-engineer.txt λ³ν (μμ κΈ°λ°)
2. [ ] personaLoader.ts ꡬν
3. [ ] κΈ°λ³Έ Tools 3κ° κ΅¬ν (analyze, design, estimate)
### κΈμ£Ό μλ£ λͺ©ν
1. [ ] 10κ° ν΅μ¬ νλ₯΄μλ λ³ν
2. [ ] MCP μλ² κΈ°λ³Έ ꡬ쑰 νμ₯
3. [ ] ν΅ν© ν
μ€νΈ ν΅κ³Ό
### 2μ£Ό ν λͺ©ν
1. [ ] 30κ° Tools ꡬν
2. [ ] Sampling νλ‘ν νμ
3. [ ] μ±λ₯ λ²€μΉλ§ν¬ μμ
---
**μν**: π κ³ν μλ£, μ€ν μ€λΉ
**μ°μ μμ**: P0 (μ΅μ°μ )
**λ΄λΉ**: Cascade AI
**κ²ν **: User Approval νμ