Brain-Inspired 'Sleep' Cycles Help AI Remember Longer
Researchers from the University of Texas at San Antonio, the University of Rochester, and other institutions have introduced SHARP — Sleep-based Hierarchical Accelerated Replay — a framework designed to help AI systems learn from data that arrives as a continuous stream without the ability to revisit the past.
Most modern sequence models, including recurrent neural networks (RNNs) and transformer architectures, struggle with so-called "long-range" learning: understanding how events separated by large gaps in time relate to one another. RNNs are limited by how far back they can propagate learning signals, while transformers are constrained by a fixed input window size.
SHARP addresses this by splitting temporal learning into two components: a memory module that builds a structured history of past inputs, and a pattern-recognition module that works over that history. Crucially, it introduces periodic "sleep" phases — offline intervals in which stored memory traces are replayed in compressed, accelerated form and integrated into higher-level representations.
The inspiration comes directly from neuroscience. Studies in rodents have shown that during slow-wave sleep, the brain replays experiences at faster-than-real-time speeds, a process thought to be central to memory consolidation. SHARP applies an analogous mechanism to artificial neural networks.
Tested on standard language modelling benchmarks including text8 and PG-19, SHARP outperformed recurrent baselines, retaining predictive performance on previously seen data while continuing to learn from new inputs. Notably, the framework achieves an exponentially increasing effective temporal context with only linear computational cost — a meaningful efficiency gain for real-world deployment.
Gravity as a Guide for Smarter Wearable AI
Separately, a team from the University of Electro-Communications in Tokyo has tackled a different but equally stubborn problem: getting compact language model-based AI to reliably distinguish low-movement activities — such as standing, sitting, and lying — from more dynamic ones like walking or running, using wearable sensor data.
Recent "SensorLLM" approaches align motion sensor data with language representations before fine-tuning for activity recognition. However, the researchers found that when the underlying model is compressed to a smaller, deployable size — such as TinyLlama — recognition of static postures degrades significantly, even as dynamic activity detection holds up.
Their solution is a "gravity-aware hierarchical routing head," a lightweight add-on that extracts statistical cues related to posture and the direction of gravitational force from existing sensor data. The system uses soft routing to blend two specialised sub-models — one optimised for static classes, one for dynamic — with a load-balancing mechanism to ensure stable training.
On the MHealth dataset, the approach meaningfully improved macro-F1 scores, particularly for static activity classes, while adding minimal computational overhead. The authors describe the paper as a preliminary disclosure, with broader dataset evaluation planned.
Shared Themes
Though addressing different domains, both studies reflect a broader trend in applied AI research: closing the gap between laboratory performance and practical constraints. SHARP targets the challenge of continuous, real-world data streams; the gravity-aware routing method targets the challenge of running capable models on constrained hardware. Both demonstrate that targeted, domain-informed design choices can yield outsized improvements over general-purpose scaling.