This recipe executes a structured 2-day workshop that transforms raw signal source data into a validated classification taxonomy. Day 1 focuses on knowledge extraction from domain experts. Day 2 focuses on quantification — scoring signal strength, setting false positive thresholds, designing compound signals, and validating against live data. [src1, src3]
Which workshop format?
├── In-person (recommended)
│ └── PATH A: Full 2-day on-site workshop
├── Remote synchronous
│ └── PATH B: 2-day virtual via Miro/FigJam
├── Hybrid (expert remote)
│ └── PATH C: Facilitator on-site, expert via video
└── Async (expert unavailable for 2 days)
└── PATH D: 4x half-day sessions over 2 weeks
| Path | Format | Cost | Speed | Output Quality |
|---|---|---|---|---|
| A: In-person | Full 2-day on-site | $5K-$8K | 5 days total | Excellent |
| B: Remote | 2-day virtual | $3K-$5K | 5 days total | Good |
| C: Hybrid | Mixed on-site/remote | $4K-$6K | 5 days total | Good |
| D: Async | 4 half-day sessions | $3K-$5K | 10-14 days | Adequate |
Duration: 1 day · Tool: Document preparation + data staging
Prepare materials: print audit report, load sample data into shared spreadsheet, create taxonomy template, prepare 20+ company examples for validation, draft and circulate agenda. [src3]
Verify: All materials prepared, sample data accessible, participants confirmed. · If failed: Workshop can proceed without sample data but Day 2 validation will be weaker.
Duration: 3 hours · Tool: Structured interview + whiteboard
Semi-structured interviews: trigger events, behavioral changes 3-6 months before purchase, relevant public data, false signals. Map onto preliminary signal hierarchy grouped by category. [src1, src5]
Verify: Minimum 20 signal types across 3+ categories. · If failed: Prompt expert with audit report examples.
Duration: 3 hours · Tool: Audit report walkthrough
Walk each data source with domain expert: extractable signals, individual strength, source-to-hierarchy mapping, gaps, compound opportunities, trigger event sequences.
Verify: Hierarchy mapped to sources. Compound opportunities documented. · If failed: Demo sample data to expert for interpretation.
Duration: 3 hours · Tool: Scoring framework + spreadsheet
Score each signal: individual strength (1-10), reliability (1-5), timeliness (1-5), compound multiplier (1.0-2.0). Set false positive threshold explicitly. [src2, src4]
Verify: All signals scored, threshold documented. · If failed: Use median scores, document dissent for calibration.
Duration: 3 hours · Tool: Sample data + taxonomy + scoring model
Validate against 50+ company records. Apply scoring, compare to expert judgment, calculate false positive/negative rates, identify edge cases. [src4]
Verify: False positive rate within threshold. Edge cases documented. · If failed: Tighten weights, add qualifying signals, revalidate.
Duration: 2-3 days · Tool: Document + JSON schema generation
Produce deliverables: taxonomy document, JSON schema, calibration dataset, false positive analysis, implementation notes.
Verify: Reviewed by domain expert and engagement lead. · If failed: Flag as “provisional,” schedule review call.
{
"output_type": "signal_taxonomy",
"format": "JSON schema + document",
"sections": [
{"name": "signal_hierarchy", "type": "object", "description": "Category > type > indicator tree with weights"},
{"name": "scoring_model", "type": "object", "description": "Strength, reliability, timeliness, compound multiplier"},
{"name": "false_positive_threshold", "type": "number", "description": "Maximum acceptable FP rate"},
{"name": "compound_signals", "type": "array", "description": "Multi-source combinations with multipliers"},
{"name": "validation_dataset", "type": "array", "description": "50+ scored examples with outcomes"},
{"name": "edge_cases", "type": "array", "description": "Exceptions and special handling rules"}
]
}
| Quality Metric | Minimum Acceptable | Good | Excellent |
|---|---|---|---|
| Signal types identified | > 20 | > 30 | > 50 |
| Validation examples tested | 50 | 75 | 100+ |
| False positive rate | < 30% | < 20% | < 10% |
| Signal categories covered | 3 | 4 | 5+ |
| Compound signals designed | 2 | 5 | 10+ |
If below minimum: Extend workshop by half day or schedule follow-up session.
| Error | Likely Cause | Recovery Action |
|---|---|---|
| Domain expert unavailable | Schedule conflict | Reschedule or switch to async format |
| Fewer than 15 signals | Narrow vertical or inexperienced expert | Prompt with audit examples; consider second expert |
| False positive rate > 40% | Taxonomy too broad | Tighten definitions, add qualifiers, revalidate |
| No compound signals found | Signals are independent | Valid outcome — proceed with individual signals |
| Scoring disagreement | Different buyer behavior assumptions | Document both, test empirically in pilot |
| Component | Remote ($3K-$5K) | In-Person ($5K-$8K) | Deep ($7K-$12K) |
|---|---|---|---|
| Pre-workshop prep | $500-$800 | $800-$1.2K | $1.2K-$2K |
| Day 1: Domain interviews | $800-$1.2K | $1.2K-$2K | $2K-$3K |
| Day 2: Scoring + validation | $800-$1.2K | $1.2K-$2K | $2K-$3K |
| Post-workshop docs | $500-$800 | $800-$1.2K | $1.2K-$2K |
| Domain expert compensation | $400-$800 | $800-$1.5K | $1.5K-$2K |
| Total | $3K-$5K | $5K-$8K | $7K-$12K |
Building classification from data patterns alone. Result: classifier flags wrong companies, sales team loses trust. [src2]
Minimum 1 domain expert for full 2 days. Industry knowledge catches false patterns data alone misses.
Finalizing taxonomy without testing against real data. Result: elegant on paper, fails on first dataset. [src4]
Day 2 afternoon dedicated to applying taxonomy to real data. Iterate before leaving the workshop.
Not discussing acceptable error rates. Result: too many irrelevant dossiers overwhelm sales team. [src4]
Agree on maximum false positive rate during Day 2. Write it into the taxonomy as a hard constraint.
Use when designing an intent signal classification system for a specific vertical. This is Phase 2 of the Signal Stack engagement — it transforms raw signal data into a structured taxonomy for automated classification.