TODO
Add decreasing learning rate (probably also easy)
Add Serialization/Deserialization support, probably using serde_pickle for pytorch compatibility
Additional Ressources
Godly Article by ...