METASTATE I LTD · Innovate UK 2422 · Phase 1 Feasibility

TRUST-EVAL UK Phase 1

A sovereign, privacy-by-design benchmark and reasoning method for evaluating frontier AI on detection of AI-generated financial deception — synthetic identities, deepfake KYC documents, AI-authored fraud narratives, and AI-laundered transaction stories.

Verifiable · Explainable · UK-Sovereign Theme 4 · Fundamental AI Privacy-by-design · Synthetic-only METASTATE I LTD · Company No. 17187306

01The measurement gap

Why a sovereign UK evaluation standard is needed now

AOffence has industrialised.

Frontier generative models now produce convincing synthetic identities, deepfake identity documents and AI-authored fraud narratives at scale. Adversarial tooling is public, productised and improving every model generation.

BDefence has no shared ruler.

There is no UK standard that answers "how well does a given AI model detect AI-generated financial deception?" Institutions test ad-hoc, on private data, with no comparability. Without a benchmark there is no evidence-based procurement, no regulatory standard, and no way to track whether defence is keeping pace.

CWhy this is a sovereign concern.

AI-enabled financial fraud and synthetic-identity attacks are an economic-security threat with national-security adjacency. A sovereign, explainable, UK-grounded measurement capability is exactly the kind of public-interest AI infrastructure the regulatory agenda calls for.

02Method · constrain–generate–verify

The novel AI contribution — evaluated as a Phase 1 research question

Phase 1 investigates whether a reasoning-layer architecture that wraps a frontier detection model with a first-order-logic (FOL) predicate schema — so that every positive decision must produce a machine-checkable justification, or the system is required to abstain — measurably improves verifiability over an unconstrained frontier baseline.

The advance lies jointly in (a) the control algorithm (the constrain–generate–verify loop with a domain predicate library), (b) the rigorous evaluation methodology, and (c) their combination. Prior neuro-symbolic LLM-reasoning work (Logic-LM, LINC, Refiner) addresses general logical reasoning over text; this project specialises the technique for high-stakes adversarial detection with structural evidence guarantees.

At inference, the model emits a candidate decision and a justification over the predicate library. Code verifies the justification. If no verifiable justification can be constructed, the system abstains rather than asserting an unsupported claim.

03The validation instrument

TRUST-EVAL UK · three layers, one stack

Layer 1 · synthetic, no PII

Adversarial corpus + threat taxonomy

Synthetic-identity generator Deepfake-KYC document generator AI-authored fraud narrative gen AI-laundered transaction stories

▼

Layer 2 · verify, not guess

Evidence-gated evaluation harness

Stage 1 · extraction → structured JSON Stage 2 · deterministic code-validation Stage 3 · evidence-gated risk note Constrain–generate–verify reasoning layer

▼

Layer 3 · hosted to edge

Sovereign reference deployment

Hosted frontier endpoint (NVIDIA NIM-compatible) Local NIM container on owned hardware

Five-axis evaluation rubric

Named baseline · same frontier model without the constrain–generate–verify layer · identical seeds, splits, scoring protocol

Detection F1

Standard F1 against adversarial samples across the four attack classes.

Verifiability rate

Share of positive decisions for which a valid machine-checkable justification can be constructed.

Unsupported-claim rate

Share of positive decisions whose justification is rejected by the verifier.

Cost of justification

Tokens consumed per decision (input + output) versus the named baseline.

Abstention quality

Correct abstention when ground truth is deliberately absent, plus protocol re-run reliability.

Threat taxonomy v0.1

Four AI-generated financial-deception attack classes scoped for the Phase 1 PoC

Synthetic identity

Wholly fictitious individuals and composite, real-looking personas. Generated via frontier image generation and structured persona pipelines.

Deepfake KYC document

Passport, national-ID and driving-licence templates (specimen-only) with adversarial perturbation from a document-template engine.

AI-authored fraud narrative

Romance-scam scripts, business-email-compromise pretexts and pig-butchering scenarios authored by frontier LLMs and adversarially refined.

AI-laundered transaction story

Synthetic transaction graphs paired with plausible-cover narratives — a graph generator combined with an LLM cover-story.

04Sovereign by architecture

Anchored on hardware the applicant already owns and operates

The private-by-architecture claim is not aspirational. METASTATE I LTD operates an on-hand heterogeneous sovereign-compute testbed for cross-stack baselining. Frontier-model and tooling access uses the NVIDIA Developer ecosystem (NGC, NIM, DLI). The applicant's affiliated venture firm is a member of the NVIDIA Inception VC Alliance.

Primary inference

Olares One workstation

NVIDIA RTX 5090 Mobile · 24 GB VRAM · 96 GB RAM. Headless always-on inference server for the frontier-model evaluation runs.

ROCm comparator

Morefine M900 + AMD RX 7800 XT

16 GB GDDR6 via OCuLink — a parallel comparator node for cross-stack baselining and ROCm experimentation.

Always-on background

AMD Ryzen AI 7 350 NPU

XDNA architecture · ~50 TOPS · energy-efficient processor for embedding generation, RAG indexing and other always-on auxiliary tasks.

05Phase 1 → Phase 2 trajectory

Four work packages across three months · feasibility, not shipped product

WP1 · Month 1

Taxonomy & governance

Validate the threat taxonomy with a UK practitioner panel; produce the data-governance and ethics framework.

WP2 · Month 1–2

Method & data

Build the privacy-by-design synthetic-data pipeline and PoC test-set; conduct the constrain–generate–verify feasibility study; design the verifier interface.

WP3 · Month 2–3

Harness & evaluation

Build the reproducible scoring harness on the five-axis rubric; run baseline-versus-method comparisons across at least three frontier and open model families.

WP4 · Month 3

Reporting & Phase 2

Phase 2 technical report (mandatory output); consortium letters of intent; Phase 2 collaborative-R&D bid skeleton.

Phase 2Collaborative R&D · 24–32 months

If Phase 1 confirms feasibility, Phase 2 scales the corpus to multi-jurisdictional production-scale, productionises the evaluation harness as a managed service, and secures UK regulatory and bank reference adoption. Phase 2 is a separate UKRI / Innovate UK competition; this site reflects the Phase 1 feasibility study only.

06Team & intended partners

Single-applicant Phase 1 · consortium assembly is itself a Phase 1 deliverable

LeadMETASTATE I LTD

Lead by Vladislav (Slava) Solodkiy: founder of a US-licensed compliance-first digital bank (IFE-065); co-author of venture-capital fund regulation in Singapore (MAS); author of two books on fintech and on AI infrastructure; published commentary on AI-enabled fraud and OSINT-led compliance. Domain authority sits squarely at the intersection the project requires — frontier-AI engagement and regulated-finance evidence standards.

To anchor the neuro-symbolic / FOL technical depth that is adjacent to but outside the applicant's core domain, Phase 1 budgets a small advisory subcontract (a handful of expert-days) with a UK academic or industry specialist in neuro-symbolic LLM reasoning.

Intended Phase 2 consortium · engagement in progress

UK SME · grant claimant

UK fintech SME AI-accounting / payments operator with compliance footprint

Academic collaborator

CCAF Cambridge Cambridge Centre for Alternative Finance · Cambridge Judge Business School

UK SME · domain

UK RegTech Fraud-detection / synthetic-data specialist for benchmark co-design

End-user

UK bank Tier-1 or challenger bank as reference adopter and validation site

07Governance & compliance

Privacy-by-design · UK-resident · evidence-led

A · Data

Synthetic-only corpus.

The benchmark is built entirely on synthetic data — no real personal information enters the system at any stage. UK GDPR Article 4(1) does not apply. ICO synthetic-data guidance followed throughout.

B · Dual-use

Detection-side release only.

Only the detection and evaluation artefacts are released publicly. Adversarial generation material is limited-fidelity and held under governance review; an ethics review is mandatory before any release.

C · Sovereignty

All activities in the UK.

Phase 1 work is carried out by a UK-registered SME on owned UK-based hardware. Exploitation intent is UK-based. The benchmark is offered as a public good toward UK standard-setting.

D · Honesty

Conditional claims.

Phase 1 is a feasibility study. The constrain–generate–verify approach is evaluated as a research question, not asserted as an existing capability. Numeric targets are empirical end-of-Phase-1 targets, to be confirmed by the work itself.

E · IP posture

Conditional, prospective.

If Phase 1 validates the approach, the IP position will rest on trade-secret protection of the synthetic-data generation pipeline and selective patenting of method components — the exact split is itself a Phase 1 deliverable.

F · Exploitation

Open + commercial split.

Open benchmark interface as a public good; commercial evaluation-as-a-service and licensed dataset slices for institutional and vendor use; certified scoring for regulator-aligned procurement.