DeepScience

DeepScience — Mental Health

DeepScience

Mental Health · Daily Digest

June 10, 2026

273

Papers

9/9

Roadblocks Active

Connections

⚡ Signal of the Day

• A wave of computational depression-detection papers arrived today — spanning EEG, speech acoustics, and LLMs — but zero cross-paper connections were found, suggesting parallel development silos rather than a converging field.

• The most practically important finding is a diagnostic audit revealing that EEG foundation models universally encode subject identity 13–89x more strongly than clinical labels, meaning many published accuracy claims may be inflated by patient-fingerprinting rather than genuine biomarker learning.

• Watch the interplay between the speech-based screening papers (InfoShield, EmoTrack, perceptual features) and the LLM-screening benchmark: both tracks are converging on clinical interview data, and the first group to solve privacy preservation at acceptable diagnostic accuracy will define deployment standards.

📄 Top 10 Papers

Exploration of Perceptual Speech Features for Clinical Decision-Support in Mental Health Care

The study finds that vocal irregularities like pitch instability (jitter, shimmer) and patterns in word choice and sentence structure reliably track symptom severity for depression, anxiety, and ADHD across five separate datasets. Crucially, it uses interpretable models with SHAP explanations so clinicians can see which specific speech cues drove each prediction — not just a black-box score. This matters because clinician-interpretable speech biomarkers could support screening in settings where trained professionals are scarce, as long as the proprietary dataset limitation is addressed.

██████████ 0.9 depression-biomarkers Preprint

Read Save Connections

TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health

TimeSRL converts raw passive-sensing data (phone usage, accelerometer, etc.) into natural-language behavioral descriptions, then uses reinforcement learning to train a language model to predict anxiety and depression scores from those descriptions rather than raw numbers. Testing the model on datasets it was never trained on — a stringent real-world simulation — showed 3–44% error reduction over prior approaches. The semantic bottleneck is the key mechanism: by forcing the model to reason in human-readable behavioral terms, it learns representations that transfer across different populations and sensing setups.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

Beyond Augmentation: Score-Guided Pathological Prior for EEG-based Depression Detection

This paper tackles a core practical barrier in EEG-based depression screening: small clinical samples and incompatible electrode configurations across hospitals. Instead of fabricating extra training data, it trains a generative model only on healthy brain signals, then measures how abnormal a patient's EEG looks relative to that healthy baseline — a 'pathological prior' score that supplements the classifier. A spatial adaptation module also handles the problem of different hospitals using different electrode placements, enabling multi-center data pooling that was previously infeasible.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

Across all 12 tested combinations of EEG foundation models and datasets, frozen model representations encoded individual subject identity 13 to 89 times more strongly than actual clinical labels — and fine-tuning made this worse, not better. When a linear erasure step removed the identity axis before classification, label decoding improved by 6–27 percentage points. This is a structural validity problem: models may appear to detect depression or other conditions when they are actually recognizing individual people, which would make them useless for any new patient.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

InfoShield: Privacy-Preserving Speech Representations for Mental Health Screening via Information-Theoretic Optimization

InfoShield strips demographic information (gender, age) from speech features used for depression screening, reducing gender inference from 92.6% to near-chance (55.5%) while maintaining depression classification performance. The key technical contribution is a new estimator called TimeAwareMINE that handles the mismatch between sequential speech signals and static demographic attributes — a problem that caused standard methods to fail. Privacy-preserving speech analysis is a prerequisite for deployment of speech-based mental health tools in regulated healthcare settings.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

EmoTrack: Robust Depression Tracking from Counseling Transcripts across Session Regimes

EmoTrack combines LLM-extracted clinical signals with sentence-level semantic embeddings to predict PHQ-8 depression scores from therapy transcripts, achieving 13.5% lower error than the best single-session baseline. An attention-based memory module lets the system also track how symptoms change across multiple therapy sessions — something most prior work ignored. The ability to monitor depression trajectories rather than just snapshots is directly relevant to measuring whether therapy is working.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence

Most algorithms for personalizing digital therapeutics treat patient engagement as a fixed context variable, but this paper argues adherence is endogenous — past behavior shapes future engagement capacity in a dynamic feedback loop. It models this with a linear dynamical system and proves that the UCB-BOLD algorithm achieves provably sublinear regret even under this more realistic setup. The practical implication is that treatment recommendation systems should account for engagement momentum rather than treating each session as independent, particularly for patients prone to dropout.

██████████ 0.8 digital-therapeutics Preprint

Read Save Connections

When Symptoms Are Not Enough: Evidence-Weighting Patterns in Large Language Model Psychiatric Screening

Testing five LLMs on 555 structured clinical interview transcripts, accuracy ranged from near-chance (0.49) to 0.86 depending on the disorder and model, with Matthews correlation coefficients — a more stringent metric — topping out at only 0.38. False negatives in anxiety and PTSD screening often occurred not because symptoms were absent but because LLMs weighted protective factors (coping ability, social support) too heavily, mirroring a known clinician bias. This tells us current LLMs cannot be used as standalone screeners but may have value as decision support if their evidence-weighting tendencies are corrected.

██████████ 0.8 digital-therapeutics Preprint

Read Save Connections

Dep-LLM: Training-Free Depression Diagnosis via Evidence-Guided Structured Multi-factor with Reliable LLM Reasoning

Dep-LLM improves LLM-based depression diagnosis without any model retraining by decomposing clinical interview text into five clinically-aligned themes and using token-level probability entropy to flag uncertain reasoning before aggregating evidence. Tested across 21 language models, it consistently outperforms zero-shot prompting on nine classification metrics. The training-free approach is significant for practical deployment: clinical teams could apply it to existing frozen models without the cost or safety concerns of fine-tuning on sensitive patient data.

██████████ 0.8 depression-biomarkers Preprint

Read Save Connections

Exploring Profiles of Cognitive Distortions Associated with Mental Health Disorders

Applying text analysis to a large Reddit dataset of self-reported mental health conditions, the study finds that cognitive distortions — systematic errors in thinking like catastrophizing or mind-reading — are more prevalent across all nine mental health groups compared to controls, but the patterns of distortions look surprisingly similar across different diagnoses. This challenges the assumption that distinct disorders have distinct cognitive profiles, at least as measured in online self-disclosure text. The finding is relevant for transdiagnostic therapy approaches that target distortions across multiple conditions simultaneously.

██████████ 0.7 computational-psychiatry Preprint

Read Save Connections

🔬 Roadblock Activity

Roadblock	Papers	Status	Signal
Computational Psychiatry	146	Active	High volume day dominated by LLM and signal-processing approaches; the EEG identity trap audit raises a methodological red flag that may require re-evaluation of previously published foundation model results.
Depression Biomarkers	60	Active	Multiple converging approaches — EEG anomaly scoring, speech acoustics, and counseling transcript analysis — each showed incremental progress, but the identity trap finding in EEG models is an important caveat for the entire biomarker track.
Digital Therapeutics	55	Active	Adherence modeling and LLM screening benchmarks both advanced today, with a new theoretical framework for handling dynamic patient engagement in treatment recommendation algorithms.
Neuroplasticity Interventions	38	Active	Low direct signal today; behavioral fine-tuning of LLMs to simulate maladaptive patterns is a peripheral contribution to understanding plasticity mechanisms.
Youth Mental Health Crisis	29	Active	Two low-confidence integrative reviews on adolescent mobile phone addiction appeared but provided no new empirical data; substantive youth-focused research was sparse today.
Sleep and Circadian Psychiatry	17	Active	Sleep EEG criticality analysis for predicting cognitive decline showed statistically significant group differences (p≤0.001) in non-REM stages, offering a non-invasive monitoring angle relevant to psychiatric populations.
Neuroinflammation	10	Active	No high-signal papers reached the top tier today; activity remained background-level with no notable mechanistic advances.
Treatment-Resistant Depression	4	Open	Minimal activity; EEG anomaly scoring and foundation model auditing papers touched this roadblock peripherally but did not address treatment resistance directly.
Gut-Brain Axis	3	Open	Effectively silent today with only three papers and none reaching top-paper relevance thresholds.

View Full Analysis

DeepScience — Cross-domain scientific intelligence
Sources: arXiv · OpenAlex · Unpaywall
deepsci.io

Unsubscribe