Explainable AI Engineering | The Garnet Wiki

A model that predicts loan denial must explain why. A model that recommends treatment must justify its reasoning. Explainable AI (XAI) makes the black box transparent — not just for regulatory compliance, but for debugging, trust, and adoption. If nobody understands why the model decided something, nobody will trust it.

Why Explainability Matters

Regulatory:
  ☐ EU AI Act requires "meaningful explanation" for high-risk AI
  ☐ GDPR Article 22: Right to explanation for automated decisions
  ☐ US Equal Credit Opportunity Act: Must explain credit denials
  ☐ Healthcare: Clinicians need to understand AI recommendations

Trust:
  ☐ Users trust what they understand
  ☐ "The model says you should..." → skepticism
  ☐ "Based on your income and employment history..." → trust

Debugging:
  ☐ Model predicting incorrectly? Explanations show WHY
  ☐ Feature leakage visible in explanations
  ☐ Bias detection through feature importance

SHAP (SHapley Additive exPlanations)

import shap

# Train model
model = xgb.XGBClassifier()
model.fit(X_train, y_train)

# Generate SHAP explanations
explainer = shap.TreeExplainer(model)
shap_values = explainer(X_test)

# Global explanation: Which features matter most overall?
shap.summary_plot(shap_values, X_test)
# Output: Ranked feature importance with direction of impact
# Example:
# income          ████████████████   (higher income → approved)
# debt_ratio      ██████████████     (higher debt → denied)
# employment_yrs  ████████████       (more years → approved)
# credit_score    ██████████         (higher score → approved)

# Local explanation: Why was THIS application denied?
shap.waterfall_plot(shap_values[42])
# Output for application #42:
# Base prediction: 65% approval
# debt_ratio = 0.85:    -25%  (very high debt)
# credit_score = 580:   -15%  (below threshold)
# income = $45k:        +5%   (adequate)
# employment_yrs = 8:   +3%   (stable)
# Final prediction: 33% approval → DENIED
#
# Explanation: "Your application was denied primarily due to 
# a high debt-to-income ratio (0.85) and a credit score below 
# our minimum threshold."

Model Cards

# model_card.yaml: Documentation for every production model

model:
  name: "Loan Approval Model v3.2"
  type: "XGBoost Classifier"
  version: "3.2.0"
  date_trained: "2024-03-01"
  owner: "risk-ml-team"

intended_use:
  primary: "Automated loan pre-qualification screening"
  users: "Loan officers and automated approval pipeline"
  out_of_scope: "Final lending decisions (human review required)"

training_data:
  source: "Historical loan applications (2018-2023)"
  size: "2.4M applications"
  demographics: "US applicants only"
  known_biases: |
    Training data underrepresents rural applicants (12% vs 18% population).
    Model may be less accurate for rural regions.

performance:
  overall:
    accuracy: 0.89
    precision: 0.87
    recall: 0.91
    auc_roc: 0.94
  
  by_subgroup:
    - group: "age_18_30"
      accuracy: 0.85
      notes: "Lower accuracy for younger applicants (less credit history)"
    - group: "age_31_50"
      accuracy: 0.91
    - group: "age_51+"
      accuracy: 0.88

limitations:
  - "Not validated for commercial loans"
  - "Performance degrades on applications with < 2 years credit history"
  - "Does not account for cryptocurrency assets"

ethical_considerations:
  - "Model does not use protected characteristics (race, gender, religion)"
  - "Proxy variables (zip code) monitored for disparate impact"
  - "Quarterly fairness audit required"

Anti-Patterns

Anti-Pattern	Consequence	Fix
Black box in regulated domain	Regulatory violation, fines	Explainability required for high-risk AI
Post-hoc explanations only	Explanations may not reflect actual model logic	Use inherently interpretable models when possible
No model card	Nobody knows limitations or biases	Model card for every production model
Global explanations only	”Feature X is important overall” — but why for THIS user?	Both global and local explanations
Feature importance = causation	Correlation displayed as reason	Clear language: “associated with” not “caused by”

Explainability is not a nice-to-have — it is the difference between a model that ships and a model that gets blocked by legal, compliance, or user distrust.

Why Explainability Matters

SHAP (SHapley Additive exPlanations)

Model Cards

Anti-Patterns

More in AI Engineering

AI Agent Orchestration

AI Agent Tool Selection Optimization

AI Agent Architecture