Applied Language Systems

Building a Pipeline for Metaphor Language

A structured workflow linking schema-driven annotation, rule-based metaphor detection, and human-facing interpretation.

By Stella Bullo · Updated: 2026-02-20 · Tags: annotation, metaphor modelling, rule-based NLP, localisation

Metaphors shape how we describe ambition, responsibility, guilt, illness, and political change. They are cognitively powerful but computationally difficult. A robust modelling approach requires more than keyword matching; it requires structured linguistic design.

Key idea

Effective metaphor analysis requires infrastructure: human annotation → structured taxonomy → rule-based tagging → interpretable output.

1 · Schema-Driven Annotation

Most annotation platforms prioritise speed and label collection. Linguistic modelling requires something different: explicit theoretical categories that preserve interpretive assumptions.

  • Fields derived from metaphor theory and stance analysis
  • Encoding of phenomenon, stance_tone, and translation_shift
  • Version-controlled schemas preserving modelling logic

Categories are not neutral. They shape what researchers can observe and what systems can learn. A schema-driven interface makes those assumptions visible.

2 · Rule-Based Metaphor Tagger

The Metaphor Tagger formalises conceptual domains such as FIRE (intensity), WEIGHT (burden), and SHARP OBJECTS (intrusion). Detection rules distinguish figurative from literal usage through contextual constraints.

  • Taxonomy derived from corpus-based metaphor research
  • Rule logic informed by narrative analysis
  • Evaluation using Precision, Recall, F1, and agreement metrics
Annotation App
        ↓
Gold-standard dataset
        ↓
Rule-based tagger
        ↓
Evaluation (Precision / Recall / F1 / κ)

3 · Human-Facing Application

Explain My Pain demonstrates how modelling infrastructure becomes interpretable output. Model results are translated into accessible explanations for clinicians and patients.

  • Highlight metaphor usage
  • Explain semantic entailments
  • Support clearer communication

Localisation Layer

Extending the pipeline across English and Spanish enables comparative metaphor modelling. Parallel annotation allows analysis of translation shifts, register variation, and conceptual divergence.

EN / ES parallel corpus
        ↓
Schema with translation_shift field
        ↓
Tagger (EN + ES)
        ↓
Cross-cultural comparison

Conclusion

Metaphor modelling is not a single tool but a structured workflow. When annotation design, taxonomy construction, and rule logic are aligned, linguistic theory becomes executable infrastructure.

This pipeline demonstrates how qualitative insight can be formalised, evaluated, and translated into systems that remain transparent and interpretable.