feat: test strategy classification for instructions

- Classify rules by testability: pattern, llm-judge, manual
- Add ID, TestStrategy, Testable, Keywords, Pattern fields to Instruction
- Unicode emoji detection via regex ranges
- strategy_counts in /api/instructions response
- ROADMAP.md for evaluation loop implementation