惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

A
Arctic Wolf
V
V2EX
P
Proofpoint News Feed
The Hacker News
The Hacker News
GbyAI
GbyAI
G
Google Developers Blog
S
Schneier on Security
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
W
WeLiveSecurity
Security Archives - TechRepublic
Security Archives - TechRepublic
博客园 - Franky
Recent Announcements
Recent Announcements
腾讯CDC
Hacker News - Newest:
Hacker News - Newest: "LLM"
K
Kaspersky official blog
U
Unit 42
Engineering at Meta
Engineering at Meta
J
Java Code Geeks
Google Online Security Blog
Google Online Security Blog
Last Week in AI
Last Week in AI
V
Vulnerabilities – Threatpost
N
News and Events Feed by Topic
O
OpenAI News
量子位
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Y
Y Combinator Blog
博客园 - 【当耐特】
Vercel News
Vercel News
Hacker News: Ask HN
Hacker News: Ask HN
T
Tor Project blog
Apple Machine Learning Research
Apple Machine Learning Research
Microsoft Security Blog
Microsoft Security Blog
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
AWS News Blog
AWS News Blog
MongoDB | Blog
MongoDB | Blog
S
Security Affairs
A
About on SuperTechFans
Project Zero
Project Zero
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园 - 聂微东
Webroot Blog
Webroot Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Cloudbric
Cloudbric
T
Tenable Blog
月光博客
月光博客
C
Check Point Blog
宝玉的分享
宝玉的分享
V
Visual Studio Blog
T
The Blog of Author Tim Ferriss
NISL@THU
NISL@THU

cs.NE updates on arXiv.org

Preisach Attention: A Hysteretic Model of Sequential Memory Vector Policy Optimization: Training for Diversity Improves Test-Time Search Cross-Species RSA Reveals Conserved Early Visual Alignment but Divergent Higher-Area Rankings Across Human fMRI and Macaque Electrophysiology Temporal Coding as a Substrate for Sensorimotor Object Inference: A Spiking Reinterpretation of Thousand Brains Architecture Engineering Hybrid Physics-Informed Neural Networks for Next-Generation Electricity Systems: A State-of-the-Art Review Dropout Universality: Scaling Laws and Optimal Scheduling at the Edge-of-Chaos Approximation Theory for Neural Networks: Old and New How to Build Marcus's Algebraic Mind: Algebro-Deterministic Substrate over Galois Fields Genetic Programming with Transformer-Based Mutation for Approximate Circuit Design E-ReCON: An Energy- and Resource-Efficient Precision-Configurable Sparse nvCIM Macro for Conventional and Spiking Neural Edge Inference Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics What Do Evolutionary Coding Agents Evolve? Training Neural Networks with Optimal Double-Bayesian Learning optimize_anything: A Universal API for Optimizing any Text Parameter Closed-form predictive coding via hierarchical Gaussian filters Scalable, Energy-Efficient Optical-Neural Architecture for Multiplexed Deepfake Video Detection Information Processing Capacity of Stationary Physical Systems: Theory, Data-efficient Estimation Methods, and Photonic Demonstration GOAL: Graph-based Objective-Aligned Diffusion Solvers for Dynamic Multi-Objective Optimization Self-supervised local learning rules learn the hidden hierarchical structure of high-dimensional data When Fireflies Cluster; Enhancing Automatic Clustering via Centroid-Guided Firefly Optimization Spiker-LL: An Energy-Efficient FPGA Accelerator Enabling Adaptive Local Learning in Spiking Neural Networks Stability and Discretization Error of State Space Model Neural Operators Deep Reinforcement Learning Framework for Diversified Portfolio Management Across Global Equity Markets Evolutionary Extreme Learning Machine of ab-initio Energy Landscapes for Crystal Structure Prediction using Manta Ray Optimization with Levy Flight Scalable neuromorphic computing from autonomous spiking dynamics in a clockless reconfigurable chip MO-CAPO: Multi-Objective Cost-Aware Prompt Optimization Structure Abstraction and Generalization in a Hippocampal-Entorhinal Inspired World Model Bridging Silicon and the Hippocampus: Algebro-Deterministic Memory "VaCoAl" as a Substrate for Vector-HaSH and TEM Towards Code-Oriented LM Embeddings for Surrogate-Assisted Neural Architecture Search Perforated Neural Networks for Keyword Spotting On the Stability of Growth in Structural Plasticity NeuroTrain: Surveying Local Learning Rules for Spiking Neural Networks with an Open Benchmarking Framework An Amortized Efficiency Threshold for Comparing Neural and Heuristic Solvers in Combinatorial Optimization Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning Mechanistic Interpretability of EEG Foundation Models via Sparse Autoencoders Embodied Neurocomputation: A Framework for Interfacing Biological Neural Cultures with Scaled Task-Driven Validation ToolMol: Evolutionary Agentic Framework for Multi-objective Drug Discovery Solve the Loop: Attractor Models for Language and Reasoning A Family of Quaternion-Valued Differential Evolution Algorithms for Numerical Function Optimization Scaling Laws and Tradeoffs in Recurrent Networks of Expressive Neurons Multi-Timescale Conductance Spiking Networks: A Sparse, Gradient-Trainable Framework with Rich Firing Dynamics for Enhanced Temporal Processing Self-organized MT Direction Maps Emerge from Spatiotemporal Contrastive Optimization Breaking Global Self-Attention Bottlenecks in Transformer-based Spiking Neural Networks with Local Structure-Aware Self-Attention Decomposing Evolutionary Mixture-of-LoRA Architectures: The Routing Lever, the Lifecycle Penalty, and a Substrate-Conditional Boundary Causal Explanations from the Geometric Properties of ReLU Neural Networks Prospective Compression in Human Abstraction Learning Parameter-Efficient Neuroevolution for Diverse LLM Generation: Quality-Diversity Optimization via Prompt Embedding Evolution EvoPref: Multi-Objective Evolutionary Optimization Discovers Diverse LLM Alignments Beyond Gradient Descent Discovery of Nonlinear Dynamics with Automated Basis Function Generation Sparsity Moves Computation: How FFN Architecture Reshapes Attention in Small Transformers Evolutionary Ensemble of Agents ARES-LSHADE: Autoresearch-Enhanced LSHADE with Memetic Polish for the GNBG Benchmark AHD Agent: Agentic Reinforcement Learning for Automatic Heuristic Design Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction Discovering Ordinary Differential Equations with LLM-Based Qualitative and Quantitative Evaluation Same Brain, Different Prediction: How Preprocessing Choices Undermine EEG Decoding Reliability Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity GEAR: Genetic AutoResearch for Agentic Code Evolution A Unified Measure-Theoretic View of Diffusion, Score-Based, and Flow Matching Generative Models CoupleEvo: Evolving Heuristics for Coupled Optimization Problems Using Large Language Models MDN: Parallelizing Stepwise Momentum for Delta Linear Attention Graph Normalization: Fast Binarizing Dynamics for Differentiable MWIS Direct From Darwin: Deriving Advanced Optimizers From Evolutionary First Principles On the Influence of the Feature Computation Budget on Per-Instance Algorithm Selection for Black-Box Optimization DALight-3D: A Lightweight 3D U-Net for Brain Tumor Segmentation from Multi-Modal MRI S-AI-Recursive: A Bio-Inspired and Temporal Sparse AI Architecture for Iterative, Introspective, and Energy-Frugal Reasoning QUIVER: Cost-Aware Adaptive Preference Querying in Surrogate-Assisted Evolutionary Multi-Objective Optimization Unifying Dynamical Systems and Graph Theory to Mechanistically Understand Computation in Neural Networks Indian Wedding System Optimization (IWSO): A Novel Socially Inspired Metaheuristic with Operational Design and Analysis Physics-Modeled Neural Networks Elastic Spiking Transformers for Efficient Gesture Understanding MPCS: Neuroplastic Continual Learning via Multi-Component Plasticity and Topology-Aware EWC Combining Trained Models in Reinforcement Learning HERCULES: Hardware-Efficient, Robust, Continual Learning Neural Architecture Search Training Non-Differentiable Networks via Optimal Transport ShiftLIF: Efficient Multi-Level Spiking Neurons with Power-of-Two Quantization Probe-Geometry Alignment: Erasing the Cross-Sequence Memorization Signature Below Chance Benchmarking local Hebbian learning rules for memory storage and prototype extraction Robust volatility updates for Hierarchical Gaussian Filtering Spiking Sequence Machines and Transformers Affinity Is Not Enough: Recovering the Free Energy Principle in Mixture-of-Experts Scalable Learning in Structured Recurrent Spiking Neural Networks without Backpropagation Geometric and dynamical analysis of attractor boundaries and storage limits in kernel Hopfield networks Attractor FCM Physical Foundation Models: Fixed hardware implementations of large-scale neural networks When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry Learning to Forget: Continual Learning with Adaptive Weight Decay Causal Learning with Neural Assemblies NORACL: Neurogenesis for Oracle-free Resource-Adaptive Continual Learning Text-Utilization for Encoder-dominated Speech Recognition Models EdgeSpike: Spiking Neural Networks for Low-Power Autonomous Sensing in Edge IoT Architectures Neuromorphic Graph Anomaly Detection via Adaptive STDP and Spiking Graph Neural Networks EvoTSC: Evolving Feature Learning Models for Time Series Classification via Genetic Programming Analysis and Explainability of LLMs Via Evolutionary Methods Deployment-Aligned Low-Precision Neural Architecture Search for Spaceborne Edge AI SeaEvo: Advancing Algorithm Discovery with Strategy Space Evolution Primitive Recursion without Composition: Dynamical Characterizations, from Neural Networks to Polynomial ODEs MAEO: Multiobjective Animorphic Ensemble Optimization for Scalable Large-scale Engineering Applications Necessary and sufficient conditions for universality of Kolmogorov-Arnold networks Generalization Bounds of Spiking Neural Networks via Rademacher Complexity
From Compression to Deployment: Real-Time and Energy-Efficient FastGRNN on Ultra-Constrained Microcontrollers
[Submitted on 15 Jun 2026] · 2026-06-17 · via cs.NE updates on arXiv.org

View PDF HTML (experimental)

Abstract:The dominant trajectory of modern machine learning has been to scale up: larger models, larger accelerators, larger memory budgets. Yet a multi-year global semiconductor supply constraint and the growing energy and carbon cost of always-online inference expose the fragility of this trajectory and motivate the opposite direction: refactoring AI and ML algorithms to fit the small, ubiquitous microcontrollers already in mass production in wearables, sensors, and edge appliances. We present an end-to-end open-source reproduction of FastGRNN, a compact gated recurrent cell, deployed on two bare-metal targets: the 8-bit Arduino (ATmega328P) and the 16-bit MSP430 (no hardware multiplier; 16 KB Flash; 512 B SRAM). Our compression pipeline combines low-rank weight factorization, iterative hard-thresholding sparsity, and per-tensor Q15 post-training quantization with explicit activation calibration. The deployed model occupies 566 bytes of weights and achieves macro F1 = 0.918 (seed 0; five-seed Q15 mean 0.853+-0.107) on the HAPT test set. It matches a PyTorch reference at 100% prediction agreement across 3,399 test windows (MCU seed 0; 99.91-100% C-equivalent across five seeds). Both platforms sustain real-time 50 Hz streaming inference (9.21 ms per sample on Arduino; 13 ms on MSP430), where a 256-entry sigmoid/tanh look-up table delivers a 30.5x speedup on the multiplier-less MSP430. Four contributions extend the original FastGRNN paper: (i) cross-platform bit-equivalent deterministic inference; (ii) characterization of recurrent warm-up latency (median 74 samples, 1.48 s; worst-case 125 samples, 2.50 s over 100 test windows); (iii) a deployable look-up-table recipe for multiplier-less embedded targets; and (iv) hardware energy characterization showing 17.7 mW active inference power, <0.09 mW idle power, and 96.7% energy reduction with the LUT.

Submission history

From: Emre Can Kızılateş [view email]
[v1] Mon, 15 Jun 2026 19:49:38 UTC (250 KB)