Published article "Neural networks made easy (Part 60): Online Decision Transformer (ODT)".

The last two articles were devoted to the Decision Transformer method, which models action sequences in the context of an autoregressive model of desired rewards. In this article, we will look at another optimization algorithm for this method.
































