Evaluating Behaviour and Risk in AI Systems.

Behavioural studies, evaluation methods, and strategic thinking on AI quality, for engineers and testers building trustworthy AI.

Independent analysis of AI quality.

Practical thinking on AI quality and strategy.

I write about benchmarks, red-teaming, evaluation frameworks, behavioural testing, production monitoring, and the messy reality in between. No single method solves AI quality, I'm here to map the landscape honestly.