Self Learning MT5
Recurrent reinforcement learning
Recurrent reinforcement learning (RRL) was a technique to tune financial trading systems for the purpose of utility maximization. The RRL technique is a stochastic gradient ascent algorithm which continuously optimizes a utility measure by using new market information. Although, in most discussions on RRL, the market information usually comprises a series of lagged price returns. The basic RRL trading system is designed to trade a single-asset with a two-position action (long/short), which is produced by using linear combinations of returns and a tanh function.
Profitability and stability are two particularly important factors in a financial trading system. In this study, we use the Sharpe ratio to measure the profitability and we calculate the Sharpe ratio using daily returns. It should be noted that the trading performance of RRL-type trading systems relates directly to the initialization of signal parameters. Therefore, stability refers to the consistency of the Sharpe ratios recorded from independent restarts of the trading system.
What is reinforcement learning?
Reinforcement learning, in the context of artificial intelligence, is a type of dynamic programming that trains algorithms using a system of reward and punishment.
A reinforcement learning algorithm, or agent, learns by interacting with its environment. The agent receives rewards by performing correctly and penalties for performing incorrectly. The agent learns without intervention from a human by maximizing its reward and minimizing its penalty.
Expert Advisor recommended configuration
- Recommended forex Pairs: USDJPY, EURJPY, GBPJPY... All pairs crossed with JPY
- Time-frame: M30 Chart
- Minimum deposit: $100
- Lot - Contract size
- sl_points - Stop loss in points
- tp_points - Take profit in points
- Expert Comment - Here you put a comment to identify the trades made by the EA