John de Graft-Johnson

AI/ML Engineer SME

Skills & Technology

Languages & Frameworks

AI & ML

RAG LLM-as-judge Agentic AI Anthropic Claude LangChain Embeddings pgvector Knowledge Graphs Neo4j XGBoost SHAP scikit-learn Prompt Engineering Responsible AI

Data, Cloud & Ops

Azure Azure Container Apps Cosmos DB AWS AWS Lambda AWS S3 AWS DynamoDB CloudFront API Gateway EventBridge dbt DuckDB Delta Lake PostgreSQL Docker CI/CD LLMOps MLOps Evals Time series

Healthcare & Compliance

HIPAA FHIR SNOMED CT OMOP CDM NICE ESF Tier B NHS Core20PLUS5 UK GDPR Art. 22

Experience —

Eight years across data engineering, applied ML, and AI platform delivery — recently focused on evaluation frameworks, fine-tuning, and responsible-AI tooling.

01
AI Engineering Lead · Swain Solutions LLC
Oct 2025 – Present
Washington, D.C.
- Shipped production agentic LLM systems end-to-end for regulated customers — a 32-tool MCP server for AI proposal intelligence, a Clinical Decision RAG grounded in NICE / SNOMED / FHIR R4, and a LangSmith-instrumented chat inspector with 7 evaluators — embedding with stakeholders to translate ambiguous requirements into shipped, customer-facing surfaces.
- Owned a 9-service Azure-native LLM platform plus a 37-Lambda AWS counterpart: TypeScript / Next.js front-ends on Vercel, FastAPI handlers, Cosmos DB / ADLS medallion data (dbt-duckdb bronze → silver → gold), sub-1ms CUDA GPU inference (BART/BERT); federated learning across 217 tickers with a live options calibration model over 19K+ rows.
- Fine-tuned open-weights models (Gemma family) with LoRA and preference-optimization workflows; built deterministic judges, paired LLM auditors, and weighted composite scorecards covering fairness, factuality, and compliance — instrumented with inspect_petri red-teaming, Fairlearn audits, and LangSmith observability so customers see evidence, not assertions.
- Embedded a 13-point Responsible-AI framework (HIPAA per NIST SP 800-66r2, CMMC, MHRA GMLP) into CI gates, IaC, and Solana-anchored audit trails; every customer deployment ships with a reproducible evidence pack rather than a checklist. Mentored engineers on cross-domain ownership (data + ML + infra + frontend) and customer-facing delivery standards.
PythonTypeScriptNext.jsLangChain / LangGraph / LangSmithMCPRAGLoRA / SFTvLLMAzureAWS
02
Product Analytics Manager · W.R. Grace
Apr 2024 – Oct 2025
Columbia, MD
- Spearheaded enterprise commercial and volume forecasting systems powering $700MM+ revenue planning, capital allocation, and scenario-based performance modeling; embedded with Finance, Operations, and executive stakeholders to translate ambiguous business questions into decision-ready forecasts.
- Shipped a forecast-accuracy measurement and reconciliation framework — instrumented as a production evaluation harness across business units — reducing forecast-to-actual variance by 18%.
- Set analytics platform strategy and cloud data architecture (AWS / Azure) for enterprise-scale modeling and demand planning; advanced ML and predictive-modeling frameworks for product performance, adoption velocity, and market risk; owned data integrity and enterprise analytics standards.
ForecastingScenario ModelingML / PredictiveAWS / AzureGovernance
03
Senior Business Management Analyst (Senior Data Analyst function) · W.R. Grace
Aug 2022 – Apr 2024
Columbia, MD
- Directed enterprise revenue-intelligence program across SAP and Salesforce ecosystems using SQL-driven analytics, delivering visibility into $600MM+ monthly performance and growth signals to executive stakeholders.
- Deployed statistical forecasting and performance models to production, reducing planning cycle time by 25% and stabilizing revenue predictability; engineered centralized pipelines integrating SAP, Salesforce, and operational datasets.
- Shipped executive-facing KPI visualization frameworks (Tableau, Power BI); applied segmentation and cohort modeling to surface evidence-based signals for go-to-market prioritization.
SAP / SalesforceSQLStatistical ForecastingTableau / Power BISegmentation
04
Manufacturing Leadership Program — Data & Process Engineering Rotations · W.R. Grace
Jul 2019 – Aug 2022
Baltimore, MD
- Led data integrity, governance, and compliance programs across regulated reporting environments; established standards and audit frameworks later scaled at the enterprise level.
- Designed Power BI dashboards with automated anomaly detection and process-deviation monitoring across multi-site operations; evidence-based interventions reduced system downtime by 28% and sustained 99% product quality.
- Formulated capacity-planning algorithms and volume-forecasting models in SQL and Python that reduced bottlenecks by 20% and stabilized production planning; refined predictive-modeling frameworks through algorithm tuning and feature engineering.
- Raised data accuracy by 14% through systematic predictive modeling, outlier remediation, and root-cause analysis; maintained scalable data models supporting performance monitoring and early volume forecasting.
Power BIAnomaly DetectionPredictive ModelingGovernanceSQL / Python

Selected Projects ★

Each project is shipped end-to-end — data pipeline, model, governance, deployed UI. Cards with an architecture map can be expanded inline.

LLM Evaluation & Oversight(2)

LLM Eval · OversightOpen Source

AI Proposal Intelligence

Production LLM Evaluation Harness · Scalable-Oversight Pattern

Summary: LLM-as-judge evaluation harness with paired auditors and a weighted composite scorecard — gates AI outputs before release.
Tech: Python · FastAPI · pydantic · pytest · CI/CD
Data & AI: Frontier LLM · LLM evaluation · LLM-as-judge · paired auditors · scalable oversight
Use Cases: Government / FSI proposal QA · AI eval pipelines · Scalable-oversight tooling

GitHub →Try It Out →

AI Eval HarnessOpen Source

Healthcare Dashboard Ops

LLM-as-Judge platform · Power BI · GIS · Forecast models

Summary: One spec produces a Power BI dashboard, GIS choropleth, and 12-month forecast — gated by 16 deterministic + LLM evaluators on 31M CMS Medicaid rows.
Tech: Python · Leaflet · Azure Container Apps
Data & AI: DuckDB · dbt · Power BI · Frontier LLM · LLM evaluation · SARIMA · Prophet
Standards: CMS Medicaid (T-MSIS)
Use Cases: Healthcare AI platforms · BI release gating · Medicaid / payer analytics

GitHub →Try It Out →

RAG / LLM Systems(1)

Clinical RAGLive

Clinical Decision Support RAG Assistant

Evidence-grounded answers · DOI-cited

Summary: DOI-anchored RAG over peer-reviewed biomedical evidence with knowledge-graph entity resolution and low-evidence fallback.
Tech: Python · TypeScript · Next.js · FastAPI
Data & AI: RAG · LangChain · pgvector · Neo4j (knowledge graph)
Standards: DOI citations · peer-reviewed sources
Use Cases: Biomedical evidence assembly · Member triage · Drug-target dossiers · Rare-disease cohorts

GitHub →Try It Out →

Machine Learning & Algorithms(1)

Clinical AILive

Patient Disengagement Prediction

NHS Primary Care · AI Decision Support

Summary: XGBoost early-warning model (AUC 0.94) for GP disengagement, with SHAP explainability, IMD fairness audit, and UK GDPR Art. 22 compliance.
Tech: Python · FastAPI · Next.js
Data & AI: XGBoost · SHAP · Neo4j · Responsible AI · fairness audit
Standards: OMOP CDM · SNOMED CT · QOF · UK GDPR Art. 22
Use Cases: NHS GP practices · ICB risk stratification · Equity-audited clinical AI

GitHub →Try It Out →

Geospatial Analytics(1)

Geospatial AILive

UK Health Map

NHS ICB Risk Visualisation

Summary: Drill-down NHS choropleth (national → ICB → practice) layered with disengagement risk, IMD, and CQC ratings on a Delta Lake silver layer.
Tech: Next.js · TypeScript · Leaflet · Python
Data & AI: Delta Lake · GeoJSON
Standards: NHS ICB · IMD quintiles · CQC ratings
Use Cases: ICB commissioning intelligence · NHS England planning · Population-health overlays

GitHub →Try It Out →

Responsible AI & Governance(1)

Responsible AIPilot

AI Health Equity Audit Tool

Bias Detection · NICE ESF Tier B

Summary: Automated fairness pipeline producing NICE ESF Tier B / Core20PLUS5-aligned equity reports as PDF + machine-readable JSON.
Tech: Python · FastAPI · ReportLab
Data & AI: fairlearn · Responsible AI · equalized-odds difference
Standards: NICE ESF Tier B · NHS Core20PLUS5 · UK GDPR Art. 22
Use Cases: Clinical AI governance · NHS equity audits · Model-monitoring artifacts

GitHub →Try It Out →

Capital Markets & Quant(2)

Algo TradingOpen Source

propfirmbot

Futures Strategy Framework · IBKR Adapter

Summary: MIT-licensed futures-trading framework with a DXY-confluence ORB strategy, broker-agnostic adapter boundary, and HTML backtest reports.
Tech: Python · pandas · ib_insync · pytest
Data & AI: backtest harness · DXY confluence gate · ORB / VCP / liquidity-sweep
Standards: MIT OSS · Interactive Brokers paper account
Use Cases: Prop-firm evaluation runs · Micro-gold ORB trading · Broker-portable strategy research

GitHub →Try It Out →

Capital MarketsLive

StockHub

Equity Research · Macro Signals

Summary: Live equity-research workspace with macro overlays, cached fundamentals, and portfolio risk analytics on a real-time market-data pipeline.
Tech: Next.js · TypeScript · Python · FastAPI · WebSockets
Data & AI: real-time market data · technical indicators · portfolio analytics
Use Cases: Retail equity research · Macro-overlay screening · Portfolio risk monitoring

GitHub →Try It Out →

Skills & Technology

Experience —

AI Engineering Lead · Swain Solutions LLC

Product Analytics Manager · W.R. Grace

Senior Business Management Analyst (Senior Data Analyst function) · W.R. Grace

Manufacturing Leadership Program — Data & Process Engineering Rotations · W.R. Grace

Selected Projects ★

LLM Evaluation & Oversight(2)

AI Proposal Intelligence

Healthcare Dashboard Ops

RAG / LLM Systems(1)

Clinical Decision Support RAG Assistant

Machine Learning & Algorithms(1)

Patient Disengagement Prediction

Geospatial Analytics(1)

UK Health Map

Responsible AI & Governance(1)

AI Health Equity Audit Tool

Capital Markets & Quant(2)

propfirmbot

StockHub