Where Synonyms End and Concepts Begin
Observation
Empathy and sympathy share zero ontological traits. Jaccard similarity: 0.000. These words that dictionaries treat as near-interchangeable occupy completely disjoint regions of UHT space — empathy activates Signalling and Ethically Significant, while sympathy activates Symbolic and Social Construct. This is the sharpest finding from a systematic test of UHT’s ability to detect concept boundaries between near-synonyms, and it exposes something embeddings structurally cannot: the categorical kind of difference between two concepts, not merely that they differ.
The experiment classified 22 entities forming 8 near-synonym pairs and 3 true-synonym pairs. The question: can UHT distinguish concepts that embeddings cluster together, while correctly keeping genuine synonyms close?
Evidence
Near-synonym results (Hamming distance, Jaccard similarity):
- learning / education: Hamming 8, Jaccard 0.111. Learning activates State-Transforming and Temporal — a personal cognitive process. Education activates Intentionally Designed, Regulated, Economically Significant, and Politicised — an institutional system. The largest gap in the set.
- method / methodology: Hamming 5, Jaccard 0.444. Methodology gains Synthetic, Symbolic, Signalling, Normative, and Temporal over method’s bare procedural core.
- ethics / morality: Hamming 5, Jaccard 0.500. Ethics adds Symbolic (philosophical framework); morality adds Human-Interactive, System-Integrated, Economically Significant, and Politicised (lived practice).
- empathy / sympathy: Hamming 4, Jaccard 0.000.
- knowledge / information: Hamming 4, Jaccard 0.333. Knowledge gains Meta — it is about itself. Information gains Digital/Virtual.
- strategy / tactics: Hamming 3, Jaccard 0.625. Strategy is Symbolic and Normative (direction-setting); tactics is Rule-Governed (execution).
- hypothesis / theory: Hamming 2, Jaccard 0.600. Theory adds Intentionally Designed and Compositional — it has deliberate internal structure.
- accuracy / precision: Hamming 0 — both null-hex (00000000). The sole failure; both are abstract scalar measurement qualities that the trait set cannot capture.
True-synonym results: automobile/car Hamming 2 (Jaccard 0.882), physician/doctor Hamming 1, velocity/speed Hamming 1. Mean true-synonym Hamming: 1.3 versus mean near-synonym Hamming: 3.9.
Score: 7/8 near-synonym pairs discriminated (threshold was 5/8); 3/3 true synonyms preserved (threshold was 2/3). Hypothesis confirmed.
Interpretation
UHT does not merely detect that near-synonyms differ — it explains how. The trait differences between learning and education, between ethics and morality, between strategy and tactics map onto distinctions that philosophers and lexicographers have debated for centuries: personal process versus institutional structure, theoretical framework versus lived practice, direction versus execution. Embeddings compress these pairs into near-identical vectors because they co-occur in identical contexts. UHT separates them because they are ontologically different kinds of thing.
The accuracy/precision failure is itself informative — both are abstract scalar properties, joining viscosity, pressure, density, voltage, wavelength, and gradient in the null-hex category. The trait set lacks a “quantitative measure” or “scalar property” dimension that would give these concepts something to activate.
Action
Created HYP-ACTIVEHYPOTHESES-021 (confirmed, moved to Closed) and RES-CALIBRATIONRESULTS-019. Stored research facts for the empathy/sympathy zero-Jaccard finding, the learning/education divergence, and two new null-hex entries (accuracy, precision). The next session should consider a trait proposal for a “Quantitative/Scalar” trait to rescue the persistent null-hex problem with measurement properties — this would address the accuracy/precision failure and the broader cluster of physics quantities trapped at 00000000.