Regulators demand it. Engineers need it. And your models have to deliver it. Explainable AI (XAI) refers to methods and techniques that make AI model decisions understandable to humans. There are two broad approaches: post-hoc explanation, which approximates why a black-box model reached a decision after the fact, and inherent interpretability, where the model's structure is itself human-readable. Logic-Based Networks belong to the second category — every decision is traceable to a logical rule, with no explanation gap.
Explainable AI (XAI) is the field concerned with making AI systems' decisions understandable — to engineers, to operators, to regulators, and to the people affected by those decisions.
The demand for XAI arises directly from the opacity of modern machine learning. Deep neural networks can achieve remarkable predictive accuracy, but their internal structure — millions of floating-point weights distributed across hundreds of layers — does not map to anything a human can inspect and understand. You can observe what a neural network predicts; you cannot directly observe why.
This matters wherever AI decisions carry consequences. A medical diagnosis model that produces a prediction without explanation leaves clinicians unable to verify or contextualise the output. A credit risk model that denies a loan application without traceable reasoning cannot be audited for bias or challenged under data protection law. An industrial safety system that flags an anomaly without identifying the source gives operators nowhere to look first.
XAI attempts to solve this problem. The question is how.
Post-hoc explanation tools — SHAP (SHapley Additive exPlanations), LIME (Local Interpretable Model-agnostic Explanations), attention visualisation, integrated gradients, and others — work by interrogating a trained model after the fact to produce human-readable explanations of its predictions.
SHAP, for example, estimates each input feature's contribution to a prediction by computing a weighted average of marginal contributions across all possible feature subsets. This produces an attribution score for each feature — a number indicating how much that feature pushed the prediction up or down.
These tools are valuable. For many deployed neural network systems, they are the only practical path to any form of explanation. But they have structural limitations that cannot be engineered away:
An inherently interpretable model is one whose structure can be directly inspected and understood. The explanation is not computed after inference — it is the inference.
Decision trees are the classic example: every path from root to leaf is a sequence of conditions that can be read as a plain logical rule. The model's reasoning for any prediction is the path it took. There is no approximation.
Logic-Based Networks extend this principle to a class of models powerful enough for real-world classification tasks on structured data. LBNs learn propositional clauses — rules of the form "if condition A and condition B and not condition C, then vote for class X." Every inference is the evaluation of those clauses. Every prediction is traceable to the specific clauses that fired. The explanation is exact, not approximate.
The debate between post-hoc explanation and inherent interpretability is sometimes framed as a trade-off: interpretable models sacrifice accuracy for transparency, so post-hoc tools allow you to have both. This framing is increasingly contested — and for edge AI applications, it misrepresents the actual trade-offs.
With an inherently interpretable model, the explanation you give is the reasoning the model used. With post-hoc tools, there is always a gap between the explanation and the actual computation. That gap is small in typical cases and potentially significant in outlier cases — which are often precisely the cases that matter most.
In regulated contexts, being able to explain a specific decision — not the model's general behaviour — is often what the regulation requires. Inherently interpretable models support this without qualification. Post-hoc explanations support it approximately.
Post-hoc explanation tools add code, computation, and maintenance burden. An inherently interpretable model requires none of this; the explainability is already there.
For EU AI Act compliance, GDPR's right to explanation, and sector-specific requirements in healthcare, finance, and safety systems, inherently interpretable models offer a cleaner compliance position. See AI Act compliance for edge AI for more detail.
LBNs learn propositional clauses through a reinforcement-based training process. The trained model is a set of these clauses — logical rules that can be inspected, read aloud, and debated by engineers and domain experts.
For an anomaly detection model trained on vibration sensor data, a clause might read: "if vibration amplitude in the 200–400Hz band is high AND operating temperature is elevated AND the running variance of amplitude has increased over the past 10 seconds: vote anomaly." This is not a simplification or summary; it is the actual learned rule. It can be verified against domain knowledge, challenged if wrong, and used to guide diagnostic action.
This is what explanation means in practice. Not a bar chart of SHAP values that requires a data scientist to interpret, but a logical condition that an engineer can read, evaluate, and act on.
The full technical account of LBN architecture and training is on the Logic-Based AI page.
Most XAI discussion focuses on large models deployed in cloud or server environments. The edge AI context introduces additional constraints that make post-hoc explanation particularly difficult and inherent interpretability particularly valuable.
Post-hoc explanation tools are computationally expensive. Running SHAP on an embedded device with a 32MHz processor and 256KB of RAM is not practical for real-time inference. The explanation tooling that works well in a cloud environment cannot be ported to constrained hardware without significant additional engineering — and often cannot be ported at all.
LBNs on the edge are inherently explainable without any additional computation. The clause evaluation that produces the prediction simultaneously constitutes the explanation. There is nothing extra to compute and no additional memory required.
This matters for deployments where explainability is required at the point of inference — safety systems that must indicate which condition triggered an alert, medical devices that must surface the features driving a classification, industrial monitors that must guide operator response. Explore the explainable edge AI page for a detailed treatment of this intersection.
Regulatory pressure on AI explainability has moved rapidly. Several frameworks are now directly relevant to deployed AI systems:
High-risk AI systems — including those used in healthcare, critical infrastructure, employment, education, and several other categories — must meet transparency and explainability requirements. Systems must provide sufficient information for users to understand the system's outputs and for operators to detect and correct errors. Inherently interpretable models have a significantly cleaner compliance position than black-box models with post-hoc explanation overlays.
The right to explanation under GDPR — specifically Article 22 and Recital 71 — requires that individuals subject to automated decision-making be able to obtain a meaningful explanation of the decision. "Meaningful" is not yet precisely defined in case law, but post-hoc approximations are a weaker position than inherent interpretability.
Medical device regulation (MDR, FDA guidance), financial services regulation (SR 11-7 on model risk management, PRA/FCA expectations), and industrial safety standards (IEC 61508 and related) each impose requirements on model transparency and traceability that are more easily satisfied by inherently interpretable models.
See AI Act compliance for edge AI for a more detailed treatment of the regulatory landscape.
Explainable AI refers to methods and techniques that make the outputs of AI systems understandable to humans. This includes both post-hoc tools that approximate the reasoning of black-box models (such as SHAP and LIME) and inherently interpretable models whose structure is directly human-readable. The goal is to enable trust, auditability, error correction, and regulatory compliance.
Interpretability refers to the degree to which a model's internal workings can be directly understood. Interpretable models — such as decision trees and Logic-Based Networks — produce decisions that can be traced to readable rules. Explainability is broader: it includes post-hoc tools that explain a model's outputs even when the model itself is opaque. An inherently interpretable model is always explainable; an explainable model is not always interpretable. The interpretability vs explainability page covers this distinction in detail.
No. SHAP, LIME, and equivalent tools are designed to explain models whose internal reasoning is not directly accessible — primarily neural networks. Logic-Based Networks are inherently interpretable: every prediction is traceable to the specific logical clauses that fired. No post-hoc approximation is required, and no explanation gap exists.
ModelMill is the platform for training Logic-Based Networks on your sensor data and deploying explainable edge AI on standard microcontroller hardware. The explainability is built in — no SHAP, no LIME, no additional tooling required.