AdLU: Adaptive double parametric activation functions


Güney Duman M., KOPARAL S., ÖMÜR N., ERTÜRK A., Aptoula E.

Digital Signal Processing: A Review Journal, cilt.168, 2026 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 168
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1016/j.dsp.2025.105579
  • Dergi Adı: Digital Signal Processing: A Review Journal
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Compendex, Computer & Applied Sciences, INSPEC
  • Anahtar Kelimeler: Activation functions, AdLU, Deep neural networks, ResNet-18, ResNet-50
  • Kocaeli Üniversitesi Adresli: Evet

Özet

Activation functions are critical components of neural networks, introducing the necessary nonlinearity for learning complex data relationships. While widely used functions such as ReLU and its variants have demonstrated notable success, they still suffer from limitations such as vanishing gradients, dead neurons, and limited adaptability at various degrees. This paper proposes two novel differentiable double-parameter activation functions (AdLU1 and AdLU2) designed to address these challenges. They incorporate tunable parameters to optimize gradient flow and enhance adaptability. Evaluations on benchmark datasets, MNIST, FMNIST, USPS, and CIFAR-10, using ResNet-18 and ResNet-50 architectures, demonstrate that the proposed functions consistently achieve high classification accuracy. Notably, AdLU1 improves accuracy by up to 5.5 % compared to ReLU, particularly in deeper architectures and more complex datasets. While introducing some computational overhead, their performance gains establish them as competitive alternatives to both traditional and modern activation functions.