Piecewise Linear Unit

Proposed by Yucong Zhou,Zezhou Zhu,Zhao Zhong in a paper:Learning specialized activation functions with the Piecewise Linear Unit, Piecewise Linear Unit is a variant of dynamic activation function, that has several good properties:

Most of the flexibilities lie in a bounded region, which can maximize the utilization of learnable parameters
As a universal approximator, PWLU can closely approximate any contiguous, bounded scalar functions
Thanks to the uniform division of intervals, PWLU is efficient in computation, especially for inference

WARNING

Pay attention to the mode argument, you need to set it to 1 to get a dynamic behaviour, otherwise PWLA will act as ReLU. In theory, you can always pass one, but authors got their best results with heavily modified training pipeline, check the paper.

Notes on this project

This is an implementation that I am going to use in my own projects, which means several things:

Project will be upgrading
No extensive testing
Contributions and feedback are welcome

Roadmap

Tested on several real datasets
Efficiency improved
Replicate in JAX-based libraries?

MrGoriay/pwlu-pytorch

Piecewise Linear Unit

WARNING

Notes on this project

Roadmap