Signal Taxonomy Workshop

Type: Execution Recipe Confidence: 0.85 Sources: 5 Verified: 2026-03-29

Purpose

This recipe executes a structured 2-day workshop that transforms raw signal source data into a validated classification taxonomy. Day 1 focuses on knowledge extraction from domain experts. Day 2 focuses on quantification — scoring signal strength, setting false positive thresholds, designing compound signals, and validating against live data. [src1, src3]

Prerequisites

Constraints

Tool Selection Decision

Which workshop format?
├── In-person (recommended)
│   └── PATH A: Full 2-day on-site workshop
├── Remote synchronous
│   └── PATH B: 2-day virtual via Miro/FigJam
├── Hybrid (expert remote)
│   └── PATH C: Facilitator on-site, expert via video
└── Async (expert unavailable for 2 days)
    └── PATH D: 4x half-day sessions over 2 weeks
PathFormatCostSpeedOutput Quality
A: In-personFull 2-day on-site$5K-$8K5 days totalExcellent
B: Remote2-day virtual$3K-$5K5 days totalGood
C: HybridMixed on-site/remote$4K-$6K5 days totalGood
D: Async4 half-day sessions$3K-$5K10-14 daysAdequate

Execution Flow

Step 1: Pre-Workshop Preparation (Day 0)

Duration: 1 day · Tool: Document preparation + data staging

Prepare materials: print audit report, load sample data into shared spreadsheet, create taxonomy template, prepare 20+ company examples for validation, draft and circulate agenda. [src3]

Verify: All materials prepared, sample data accessible, participants confirmed. · If failed: Workshop can proceed without sample data but Day 2 validation will be weaker.

Step 2: Day 1 Morning — Domain Expert Interviews (3 hours)

Duration: 3 hours · Tool: Structured interview + whiteboard

Semi-structured interviews: trigger events, behavioral changes 3-6 months before purchase, relevant public data, false signals. Map onto preliminary signal hierarchy grouped by category. [src1, src5]

Verify: Minimum 20 signal types across 3+ categories. · If failed: Prompt expert with audit report examples.

Step 3: Day 1 Afternoon — Data Source Deep-Dive (3 hours)

Duration: 3 hours · Tool: Audit report walkthrough

Walk each data source with domain expert: extractable signals, individual strength, source-to-hierarchy mapping, gaps, compound opportunities, trigger event sequences.

Verify: Hierarchy mapped to sources. Compound opportunities documented. · If failed: Demo sample data to expert for interpretation.

Step 4: Day 2 Morning — Signal Strength Scoring (3 hours)

Duration: 3 hours · Tool: Scoring framework + spreadsheet

Score each signal: individual strength (1-10), reliability (1-5), timeliness (1-5), compound multiplier (1.0-2.0). Set false positive threshold explicitly. [src2, src4]

Verify: All signals scored, threshold documented. · If failed: Use median scores, document dissent for calibration.

Step 5: Day 2 Afternoon — Live Validation (3 hours)

Duration: 3 hours · Tool: Sample data + taxonomy + scoring model

Validate against 50+ company records. Apply scoring, compare to expert judgment, calculate false positive/negative rates, identify edge cases. [src4]

Verify: False positive rate within threshold. Edge cases documented. · If failed: Tighten weights, add qualifying signals, revalidate.

Step 6: Post-Workshop Documentation (Days 3-5)

Duration: 2-3 days · Tool: Document + JSON schema generation

Produce deliverables: taxonomy document, JSON schema, calibration dataset, false positive analysis, implementation notes.

Verify: Reviewed by domain expert and engagement lead. · If failed: Flag as “provisional,” schedule review call.

Output Schema

{
  "output_type": "signal_taxonomy",
  "format": "JSON schema + document",
  "sections": [
    {"name": "signal_hierarchy", "type": "object", "description": "Category > type > indicator tree with weights"},
    {"name": "scoring_model", "type": "object", "description": "Strength, reliability, timeliness, compound multiplier"},
    {"name": "false_positive_threshold", "type": "number", "description": "Maximum acceptable FP rate"},
    {"name": "compound_signals", "type": "array", "description": "Multi-source combinations with multipliers"},
    {"name": "validation_dataset", "type": "array", "description": "50+ scored examples with outcomes"},
    {"name": "edge_cases", "type": "array", "description": "Exceptions and special handling rules"}
  ]
}

Quality Benchmarks

Quality MetricMinimum AcceptableGoodExcellent
Signal types identified> 20> 30> 50
Validation examples tested5075100+
False positive rate< 30%< 20%< 10%
Signal categories covered345+
Compound signals designed2510+

If below minimum: Extend workshop by half day or schedule follow-up session.

Error Handling

ErrorLikely CauseRecovery Action
Domain expert unavailableSchedule conflictReschedule or switch to async format
Fewer than 15 signalsNarrow vertical or inexperienced expertPrompt with audit examples; consider second expert
False positive rate > 40%Taxonomy too broadTighten definitions, add qualifiers, revalidate
No compound signals foundSignals are independentValid outcome — proceed with individual signals
Scoring disagreementDifferent buyer behavior assumptionsDocument both, test empirically in pilot

Cost Breakdown

ComponentRemote ($3K-$5K)In-Person ($5K-$8K)Deep ($7K-$12K)
Pre-workshop prep$500-$800$800-$1.2K$1.2K-$2K
Day 1: Domain interviews$800-$1.2K$1.2K-$2K$2K-$3K
Day 2: Scoring + validation$800-$1.2K$1.2K-$2K$2K-$3K
Post-workshop docs$500-$800$800-$1.2K$1.2K-$2K
Domain expert compensation$400-$800$800-$1.5K$1.5K-$2K
Total$3K-$5K$5K-$8K$7K-$12K

Anti-Patterns

Wrong: Designing taxonomy without domain expertise

Building classification from data patterns alone. Result: classifier flags wrong companies, sales team loses trust. [src2]

Correct: Domain expert drives taxonomy design

Minimum 1 domain expert for full 2 days. Industry knowledge catches false patterns data alone misses.

Wrong: Skipping live data validation

Finalizing taxonomy without testing against real data. Result: elegant on paper, fails on first dataset. [src4]

Correct: Validate against 50+ real examples

Day 2 afternoon dedicated to applying taxonomy to real data. Iterate before leaving the workshop.

Wrong: Setting implicit false positive thresholds

Not discussing acceptable error rates. Result: too many irrelevant dossiers overwhelm sales team. [src4]

Correct: Explicitly set and document the threshold

Agree on maximum false positive rate during Day 2. Write it into the taxonomy as a hard constraint.

When This Matters

Use when designing an intent signal classification system for a specific vertical. This is Phase 2 of the Signal Stack engagement — it transforms raw signal data into a structured taxonomy for automated classification.

Related Units