Currently, BaseOptimizer handles the normalization of feature library, before the sparse regression take place. It then renormalize the learnt parameters according to the saved norm of each feature in the library as a post-processing step after the sparse regression is called.
However, in some optimizer, such as EvidenceGreedy in BINDy, the optimizer also require the target to be normalized. This is currently being handled inside the specific optimizer. However, it is easy to see that normalizing both feature library and targets have some mathematical advantage, especially in making hyperparameters such as the threshold in STLSQ, as well as the regulariser scaling parameters, more problem and scale independent.
However, it should be stressed that normalizing target will lead to breaking changes. For example, when applying classic SINDy a multi-dimensional dynamical system (say Lorenz63), normalizing target means the effective (scaled) threshold will be different for each target dimension, as different target has different scale. Hence, we recommend that this should be an optional change.
Given the breaking change, we believe this should be discussed in the community before implementation. Welcome to chip in your opinion on this change.
Currently,
BaseOptimizerhandles the normalization of feature library, before the sparse regression take place. It then renormalize the learnt parameters according to the saved norm of each feature in the library as a post-processing step after the sparse regression is called.However, in some optimizer, such as
EvidenceGreedyinBINDy, the optimizer also require the target to be normalized. This is currently being handled inside the specific optimizer. However, it is easy to see that normalizing both feature library and targets have some mathematical advantage, especially in making hyperparameters such as the threshold in STLSQ, as well as the regulariser scaling parameters, more problem and scale independent.However, it should be stressed that normalizing target will lead to breaking changes. For example, when applying classic SINDy a multi-dimensional dynamical system (say Lorenz63), normalizing target means the effective (scaled) threshold will be different for each target dimension, as different target has different scale. Hence, we recommend that this should be an optional change.
Given the breaking change, we believe this should be discussed in the community before implementation. Welcome to chip in your opinion on this change.