RoadblockArtificial IntelligenceProgressing

Mechanistic interpretability

Understanding the internal computations of neural networks at the level of individual features and circuits remains extremely challenging. Sparse autoencoders have revealed interpretable features in medium-scale models, but scaling these techniques to frontier models with hundreds of billions of parameters is an open problem. Key questions include whether models represent concepts in superposition, how to extract faithful causal explanations of model behavior, and whether mechanistic understanding can yield practical safety guarantees.

Mechanistic interpretability

Knowing the Self, Understanding the World: A Dual-Cognition Benchmark for UAV Spatio-temporal Reasoning with MLLMs

FVAttn: Adaptive Sparse Attention with Runtime Load Balancing for Video Generation

PagedWeight: Efficient MoE LLM Serving with Dynamic Quality-Aware Weight Quantization

A Blueprint for Equilibrium-Based Differentiable Continuous-Variable Thermodynamic Computing

Vision-Language Assistant for Emotional Reactions to Risky Driving

Cluster-Aware Matching via Laplacian Optimal Transport

Physics-enhanced reinforcement learning for real-time optimal control of dynamical systems

Evaluating Open-Weight LLMs for Generating Structured Threat Information for Autonomous Vehicle Vulnerabilities

Vision-Language-Motion Maps: An Open-Vocabulary, Uncertainty-Aware, Queryable Motion Attribute for 3D Scene Maps

When Does Muon Help Agentic Reinforcement Learning?