I was afraid of AI reaching mental healthcare and I was wrong.
This is what I learned after putting the work in.
A note before we start. This piece deals directly with suicide, self-harm and severe mental illness, including details from a specific case involving a young personās death. If thatās difficult territory for you right now, please look after yourself and come back to this another time. If youāre struggling, the Samaritans are on 116 123.
And, for transparency, I have no commercial conflicts to declare on this one. None of the companies named (Limbic, ieso, Ranvier) are SomX clients or compete with any of our clients, and nobody has paid for, sponsored, or seen this publication before. The companies appear because their founders came on my podcast and the work is genuinely fascinating and a frontier in healthtech that I want to talk about.
Half of NHS Talking Therapies patients donāt recover. Thatās a strange thing to say without urgency, isnāt it? But we do.
At the last count, 50.2% of patients who completed a course of talking therapy came out of the other side recovered (which, by the way, was the target). Now thereās a new target of 53% by 2028ā29, which the system is proud of working toward, and would mean⦠letās just say it out loud⦠that a national achievement three years from now will be that 47% of mental health patients still donāt get better.
Itās hard to imagine any other clinical specialty setting that target. Itās actually impossible to imagine that. Why are we ok with mental healthcare not being effective for half the population?
And why doesnāt it work? Do we know who needs what, whether the therapy they got was any good, whether they actually got better? And whatās happening now, [trigger warning] when 1.2 million people a week are expressing suicidal ideation to ChatGPT (that wouldāve read as hyperbole a year ago, and which is, in fact, OpenAIās own published number)?
What looks to be finally forcing a change is technology, which the sector (myself included) spent the last couple of years being most afraid of. Not because AI will be scaled up to deliver therapy (thatās a different argument/discussion and Iāll come to that later) but because for the first time in the history of the specialism, the conversation generates a record, the record generates data, and the data makes the guesswork visible.
Historically, mental health has involved a fair amount of⦠well⦠guesswork. Thatās hard to actually say out loud and, by the way, Iām used to an industry that uses sensationalist language for fun (and commercial gain), so I donāt take joy in ever doing that. But hear me out and Iāll try to explain.
A Likert scale in a stressed waiting room

The standard way to diagnose depression in UK primary care, in 2026, is to ask the patient to score nine statements between 0 and 3 on a form they fill out in a waiting room, often having just disclosed to a GP for the first time that theyāre struggling. Thatās called a PHQ-9. It broadly determines whether they get talking therapy, antidepressants, both, or neither, and which kind.
Now, indulge me, imagine the cardiology version. āWeāre going to ask you nine questions about your chest, and then decide between an angioplasty and a triple bypass.ā Yes, itās not a direct comparison, but you get my point. Putting a subjective layer of patient reporting between the biological signal and the diagnostic input is far from ideal. Thatās actually many layers of input and output influenced by the physical and metaphysical.
Ben Finley, who I had on my podcast recently (came up via Medtronic, drug development, and Publicis, now an advisor to Ranvier, a biotech company looking at new biomarkers for mental health conditions) put it pretty directly:
āWeāve normalised the guesswork in mental health in ways that wouldnāt be acceptable in any other field of medicine.ā
Now, skilled clinicians and patient-advocates may argue that the patient reported signal is the signal and that the subjectivity and subsequent answers are actually to be interpreted. In part, I agree, and I fondly remember my liaison psychiatry job really honing those history taking and interpretation skills to find the real signal. I also remember a pain consultant who once said to me āif a patient communicates pain, they are in pain,ā which stuck with me in medicine and in life. But at the same time, I donāt think Benās wrong either. Objective biomarker measurements plus patient reported symptoms is, frankly, a much better data set to inform diagnosis and track recovery.
But weāve never had those biomarkers. Thereās no biomarker to depression like troponin is to an MI. The patientās own self-report on appetite, sleep, mood scores and others are, historically, all weāve had.
The good news is that this is all changing in two places at once. The earlier-stage version is the work coming out of David Nuttās lab on serotonin taste sensitivity as a surrogate biomarker, now being commercialised by Ranvier. You did, indeed, read that right, theyāre creating an actual taste test for mood (you taste less-well in a depressed state). Whether that specific product makes it through trials or not, letās agree that the direction is interesting and exciting, i.e. towards objective, measurable, biochemical correlates of mental state. It looks like weāll get there in the next decade.
How is technology helping?
For context, letās take a typical NHS Talking Therapies diagnosis. The patient has waited weeks for a phone assessment. The phone rings on a Wednesday afternoon. The clinician (overstretched) has fifteen questions to get through. The patient, who hasnāt said any of this out loud before, has to organise their whole life into the right slots in the right order before the clinician can decide which pathway they belong on. About half the time, they end up on the wrong one. Three sessions later, the new therapist works out the diagnosis was wrong and they get moved.
Now picture the newer version (with technology thatās been deployed across 45% of NHS Talking Therapies services). The patient self-refers at 2am on a Tuesday, when theyāre actually willing to talk about it. They have an hour, not ten minutes, and theyāre talking to a system that has read the entire clinical literature on the differential diagnoses they might be presenting with. A clinical reasoning layer behind the chatbot (crucially, not the chatbot itself, which matters, and Iāll come back to) predicts the right pathway. Published in Nature Medicine on 129,400 patients across 28 NHS services, the downstream pathway changes drop by 45%. And diagnostic reliability across depression, PTSD and generalised anxiety hits 93%. See references at the end.
NICE looked at this last year, cautiously. Much of the evidence base is company-produced. Thereās an open question about what happens to people triaged out of services entirely. Both fair, both resolvable. The point is that mental health diagnosis has stopped being a thing only a human can do and started being a thing a machine can do reliably enough to be class-2A regulated.
Itās important to note here that Iām on record saying that āAI should never be placed at the point of human suffering, alone.ā That position was based on a romanticised version of what Iād want healthcare to be. Iām a clinician at heart, my bias toward humans is obvious, but donāt we all want a human to care for us? Well, maybe, but that position turned out to be indefensible against the backdrop of people getting no support at all. I was oddly stubborn for a tech evangelist, but hey, I learned new information and changed my mind. Iād be interested in your views once youāve learned what a clinical reasoning layer is too (coming shortly in this piece).
The therapy nobody watched
The second guesswork problem might be the most structurally awkward of the four, and Iām not sure Iāve fully worked out where I sit on it.





