Discussion of article "Evaluation and selection of variables for machine learning models" - page 4
You are missing trading opportunities:
- Free trading apps
- Over 8,000 signals for copying
- Economic news for exploring financial markets
Registration
Log in
You agree to website policy and terms of use
If you do not have an account, please register
Hi Vlad,
I'm trying to rerun your example step by step.
In the section Input data , The In(p=16) function deals with a price object. What is its R- format or class ( zoo, xts or dataframe ) and how does it look like ( its column names, etc..). Without these information, it's impossible to run the command x <- In(p = 16) ...
Best regards.
Julien
Hi Julien,
[1] "matrix"
> colnames(price)
[1] "Open" "High" "Low" "Close" "Med" "CO"
Я приложил снимок сессии. Откройте его в Rstudio и проводите эксперименты.
Удачи
Владимир
Thank you very much to the author of the article. I've just started and I'm having a problem. I installed RStudio, not Revolution R Open 3.2.1 as suggested by the author. The "RandomUniformForests" package andthe "RoughSets" package are loaded, but the nearZeroVar() function and the findLinearCombos() function are not called correctly. The "RandomUniformForests" and "RoughSets" packages have been loaded, butthe nearZeroVar() function andthe findLinearCombos() function don't work properly, are these functions specific to Revolution R Open?
Revolution R Open (now maintained by Microsoft and renamed MRO) is an enhanced version of R. RStudio is just an IDE, not comparable to R. The two functions mentioned are in the author's original article. The two functions mentioned are clearly labelled as functions of the caret package in the author's original article. In addition, the author of the original article used Russian, perhaps English can still communicate, but in Chinese seems to communicate.
Смотрите caret: :nearZeroVar () // caret::findLinearCombos ()
Удачи
Hi Julien,
[1] "matrix"
> colnames(price)
[1] "Open" "High" "Low" "Close" "Med" "CO"
Я приложил снимок сессии. Откройте его в Rstudio и проводите эксперименты.
Удачи
Владимир
Dear all,
Can someone tell me what the --Dig-- defined in ZZ function variable means. Is it a constant? if yes what should the value be of this constant?
Dear all,
Can someone tell me what the --Dig-- defined in ZZ function variable means. Is it a constant? if yes what should the value be of this constant?
Read this post can be useful:
Can Math Beat Financial Markets?
https://www.mql5.com/en/users/mgsystem/blog
Hi Vladimir,
Forgive me this silly question but I'm currently trying to construct my own ( very simple ) model starting from your nice example and I'm wondering why you're shifting the ZZ's differences forward in the ZZ function :
...I mean, after all, we want to train a model to predict the FUTURE value of the Zigzag, so what's the point in training a model using predictors ( technical indicators ) that summarize the market quotes at the end of day N with a target value which is the sign of the difference between the Zigzag value of day N against its N-1 value ( this is what you're doing after shifting ) ? Shouldn't we use the the sign of the difference between the Zigzag's value at day ( N+1 ) and the Zigzag's value at day N instead ( i.e we wouldn't have to shift ) ?
I know I must have missed something obvious in your methodology but if you could take 5 mns to make this cleat to me, I'd be very pleased.
Best regards.
Julien
Hi Vladimir,
Forgive me this silly question but I'm currently trying to construct my own ( very simple ) model starting from your nice example and I'm wondering why you're shifting the ZZ's differences forward in the ZZ function :
...I mean, after all, we want to train a model to predict the FUTURE value of the Zigzag, so what's the point in training a model using predictors ( technical indicators ) that summarize the market quotes at the end of day N with a target value which is the sign of the difference between the Zigzag value of day N against its N-1 value ( this is what you're doing after shifting ) ? Shouldn't we use the the sign of the difference between the Zigzag's value at day ( N+1 ) and the Zigzag's value at day N instead ( i.e we wouldn't have to shift ) ?
I know I must have missed something obvious in your methodology but if you could take 5 mns to make this cleat to me, I'd be very pleased.
Best regards.
Julien
The question is correct. There is a typo in the article. It should be like this:
1. calculate the inputs
x <- In(p = 16 )2. calculate the target
3. Combine x and out in data . Wherein:
Further on the text.
Good luck
NEW ARTICLE Variable Evaluation and Selection for Machine Learning Models has been published:
By Vladimir Perervenko
There is a big problem in the application of zigzag signal as a target variable.
The basis of all models is based on a priori already zigzag points (-1, 1), other points with condition = 0 are excluded.
In practice, you don't know if the time point is a zigzag point (-1, 1) or not, and there is a high probability that it is a point with condition = 0, because it is not possible to distinguish between the two states (-1, 1) and (0).
So the same calculation and judgement is needed for the point at 0. This time the training model and the actual model will have a big deviation;
zigzag signal as a target variable is very problematic to apply.
The basis of all models is based on a priori already zigzag point (-1, 1), other points with condition = 0 are excluded.
In practice, you don't know if the time point is a zigzag point (-1, 1) or not, and there is a high probability that it is a point with condition = 0, because it is not possible to distinguish between the two states (-1, 1) and (0).
So the same calculation and judgement is needed for the point at 0. At this point, the training model and the actual model will have a big deviation;
Drawing a simple plot illustrating the numbers (-1, 1 (0) ????
Please read the article carefully? And next to it? And don't know how to use ZZ?
Maybe the translation is not good?
Specify more precisely your comments, please can improve English?