How to improve the efficiency of a training strategy's backtest - General

mytarmailS 2023.06.03 08:51 #30901

Forester #:

On page 8 so far. And this is still an introduction)))
It looks like it will be a comparison by Sharpe (but they write that you can use any other indicator) on cross validation.

As I understand it, 4 parameters should be optimised there

summary(my_pbo)
Performance function Omega with threshold 1

      p_bo      slope       ar^2     p_loss 
 0.3714286  1.6891000 -0.0140000  0.3430000

p_bo ( probability of overtraining in backtest) should be close to 0, which indicates a low risk of overtraining.
slope ( slope coefficient of linear regression) should be close to 1, which indicates a strong linear relationship between the performance metric values for the training and test subsets.
ar^2 ( adjusted coefficient of determination) should be close to 1, indicating good linear regression accuracy.
p_loss (the proportion of performance metric values for the test subset that are below a given threshold) should be close to 0, indicating that the majority of performance metric values for the test subset are above a given threshold.

However, it should be noted that these values may depend on the selected performance metric and threshold value

Need multi-criteria Pareto front-to-back multi-criteria optimisation

Elite indicators :) Indicators: LinearRegressionLine Free EA creation

Forester 2023.06.03 08:54 #30902

mytarmailS #:

as I understand it there are 4 parameters to optimise

p_bo ( probability of overtraining in the backtest) should be close to 0, which indicates a low risk of overtraining.
slope ( slope coefficient of linear regression) should be close to 1, indicating a strong linear relationship between the performance metric values for the training and test subsets.
ar^2 ( adjusted coefficient of determination) should be close to 1, indicating good linear regression accuracy.
p_loss (the proportion of performance metric values for the test subset that are below a given threshold) should be close to 0, indicating that the majority of performance metric values for the test subset are above a given threshold.

However, it should be noted that these values may depend on the performance metric chosen and the threshold value

Too short to understand what these parameters are. here is more from the article page 13 (if the package fully reproduces the methods in the article, but maybe something else added/subtracted)

Overfit statistics
The framework introduced in Section 2 allows us to characterise the relia-
bility ofa strategy's backtest in terms of four complementary analyses:
1. Probability of Backtest Overfitting (PBO): The probability that the
model configuration selected as optimal IS will underperform the me-
dian of the N model configurations OOS.
2. Performance degradation: This determines to what extent greater per-
formance IS leads to lower performance OOS, an occurrence associated
with the memory effects discussed in Bailey et al. [1].
3. Probability of loss: The probability that the model selected as optimal
IS will deliver a loss OOS.
4. Stochastic dominance: This analysis determines whether the proce-
dureused to select a strategy IS is preferable to randomly choosing
one model configuration among the N alternatives

Each item is discussed in more detail below.

Programming tutorials Principles of working with Machine Learning and Neural

mytarmailS 2023.06.03 09:06 #30903

Forester #:

It's too short to understand what these parameters are. here's more from the article page 13 (if the package fully reproduces the methods in the article, but maybe something else was added/subtracted).

the package is just awful, never seen such a partak in years

code is terrible

the documentation is practically useless

I don't understand how it got into CRAN.

I still can't understand, is there one trading system is investigated divided into batches or is it several TS (in this library)?

Error with script Features of the mql5 Any questions from newcomers

Forester 2023.06.03 09:22 #30904

mytarmailS #:

I still can not understand, there one trading system is studied divided into batches or it is several TS (in this library).

Selection of the best model among a set of models obtained with different parameters/hyperparameters. The input is a matrix, where each column is a forecast of one of the models.

Or maybe not. I haven't figured it out yet either

Is there a pattern [Archive!] Pure mathematics, physics, Any questions from newcomers

mytarmailS 2023.06.03 09:31 #30905

Forester #:

Selection of the best model among the set of models obtained at different parameters/hyperparameters. The input is a matrix where each column is a prediction of one of the models.

I've already figured this out.

I don't understand how to work with the result

I give one column (one TS)

result

summary(my_pbo)
Performance function Omega with threshold 1

  p_bo  slope   ar^2 p_loss 
0.0000 2.2673 0.9700 0.3710

I feed 5 columns (five TCs)

I also get one row.

summary(my_pbo)
Performance function Omega with threshold 1

     p_bo     slope      ar^2    p_loss 
0.3428571 1.9081000 0.0440000 0.2860000

There should be 5 rows, or if it is the result of the best TS, there should be a mndex of the best one...

I would kill this author

How to program that The NSP and the Charles Dow's theory

mytarmailS 2023.06.03 09:51 #30906

Forester #:

Selection of the best model among the set of models obtained at different parameters/hyperparameters. The input is a matrix where each column is a prediction of one of the models.

Or maybe not. I haven't figured that out yet either

It can be interpreted as taking TS profit returns from different market sections ( parameters/hyperparameters ) ????

different market sections == parameters/hyperparameters?

Forester 2023.06.03 10:05 #30907

mytarmailS #:

It can be interpreted as taking returns of TC profit from different parts of the market ( parameters/hyperparameters ) ????

Exactly profit retournals.

mytarmailS #:

different parts of the market == parameters/hyperparameters?

As I understood exactly settings: different periods of MA, SL, etc.

Forester 2023.06.03 10:08 #30908

mytarmailS #:

I also get one line

There should be 5 lines, or if it's the best TC, there should be a mndex of the best...

As a result, you get the overall evaluation of the model (and probably of the predictor and target data)
A bad model gives such outcomes (only 17% of OOS outcomes above 0).

Good model - 95% of OOS outcomes above 0

mytarmailS 2023.06.03 10:09 #30909

Forester #:

It is the returnees who have arrived.

You know, gains and losses, right?

So we take the retournals of the states when the position is open.

Forester #:

As I understand it, it is the settings: different periods of MA, SL, etc.

Instead of different settings of the TS, I'll just take trading on different areas, I think it can be equated.

If I trade manually, Weekend evening The market is a

Forester 2023.06.03 10:13 #30910

mytarmailS #:

you know, profit and loss, right?

So we take retournals from those states when the position is open.

yeah.

mytarmailS #:

instead of different TS settings I'll just take trading on different sections, I think it can be equated.

I'm not sure.

And in general. read the article to understand what you are doing, there are limitations. For example, it is necessary to give obviously successful settings, not -1000000 to +1000000. If you give everything in a row, the average OOS will be at the bottom and there is no point in comparing with it. A very narrow range of 0,95...,0,98 is also bad from the DR side - the results will be very close.

Forecasting Get Close Price of Move Stop to Break

Machine learning in trading: theory, models, practice and algo-trading - page 3091