I'm surprised that the forest predicts new data with almost 100% accuracy - General

Dr. Trader 2016.08.09 10:11 #951

Iris petal data is not a signal, this table is not suitable for the foreca test at all. Only time series, where you get new values at certain intervals, and you combine them into a vector, are suitable for a batch. For this reason, you can't change the order of the rows in the data table for foreca. And you can't randomly remove some of the rows for validation, everything must be in strict order - first data for training, then data for validation. No sample.

The best thing to do with iris is to use the maximum number of components min(dim(forec.dt)) = 14, but I think the accuracy will still be below 100%.

OTP - Accounts - OTP - Accounts - Global Variables - Algorithmic

mytarmailS 2016.08.09 10:22 #952

Dr.Trader:

The best thing to do with irises is to use the maximum number of components min(dim(forec.dt)) = 14, but I think the accuracy will still be below 100%.

I did it both ways, it was about 85% accurate, and just forrest showed 95%

СанСаныч Фоменко 2016.08.09 10:23 #953

Dr.Trader:

Iris petal data is not a signal, this table is not suitable for the foreca test at all. Only time series, where you get new values at certain intervals, and you combine them into a vector, are suitable for a batch. For this reason, you can't change the order of the rows in the data table for foreca. And you can't randomly remove some of the rows for validation, everything must be in strict order - first data for training, then data for validation. No sample.

The best thing to do with iris is to use the maximum number of components min(dim(forec.dt)) = 14, but I think the accuracy will still be below 100%.

I think the post about irises is very important.

The point is that rf is phenomenally prone to overtraining.

And here it turns out that foreCA has no such tendency. So it's a very useful package.

Gann Tools - Objects Gann Tools - Objects Gann Tools - Analytical

mytarmailS 2016.08.09 10:34 #954

Dr.Trader:

What are your results with BP there?

Dr. Trader 2016.08.09 10:53 #955

SanSanych Fomenko:

I think the post about irises is very important.

The point is that rf is phenomenally prone to overlearning.

And here it turns out that foreCA has no such tendency. So it's a very useful package.

Even though it overtrains, if you add 10 more columns with random values to the 4 predictors for irises, the forest still predicts new data with almost 100% accuracy. I'm surprised, and glad that the forest did well. I haven't done such an experiment myself before, I'll keep it in mind for the future.

foreCA, in its turn, called all predictors noise with predictability ~ 1% (both lobe lengths and predictors from random values), and tried to extract some signal from it all. To extract signal from where it shouldn't be is useless in my opinion, this experiment says nothing for foreca.

mytarmailS:
What are your results there with BP ?

The model is still learning. I have probably fed too much data, but I don't want to cancel it, let it run until the end, I will leave it. I will write later about the results when it is over.

Working with Storage - Awesome Oscillator - Bill MetaTrader 5 Built-in Trading

Mihail Marchukajtes 2016.08.09 14:45 #956

Of course, I do not want to get ahead of myself, but Reshetov made such a cool thing in the new release.... He'll figure out your problems in no time. I gave him the idea, but he was already thinking about it himself, so fools think alike and the result is a powerful thing. You shouldn't be taking it out on him.....

MQL5 Wizard: Development of Purchasing Managers Index - Jobless Claims (Initial Claims)

СанСаныч Фоменко 2016.08.09 15:33 #957

Mihail Marchukajtes:
I certainly don't want to get ahead of myself, but Reshetov made such a cool thing in the new release.... . You shouldn't be criticizing him.....

Cool talk about cool stuff...

And will we see at least one comparison with the generally accepted and universally known and recognized?

Mihail Marchukajtes 2016.08.09 15:39 #958

SanSanych Fomenko:

Cool talk about cool stuff...

Will we see at least one comparison with the generally accepted and commonly known and recognized?

Someday you will, why not.....

mytarmailS 2016.08.09 17:13 #959

Dr.Trader:

Even though the forest retrains, if we add 10 more columns with random values to the 4 Iris predictors, the forest still predicts new data with almost 100% accuracy. I'm surprised, and glad that the forest did well. I haven't done this experiment myself before, I'll keep it in mind for the future.

Yes I'm surprised myself that it so brilliantly ignored noise and differentiated from predictors, I've never done that either, I was curious myself....

So even until today I had absolutely no confidence in the importense function.

but it made me believe

Deposits and withdrawals - Deposits and withdrawals - Deposits and withdrawals -

Dr. Trader 2016.08.09 18:34 #960

Continue not to trust importance when using it for forex. Iris is very simple data, there are direct patterns between available data and classes. RF is enough to find a minimal set of predictors on which iris classes can be defined and you're done.

Projects and MQL5 Storage Technical Indicators - Price For Advanced Users -

Machine learning in trading: theory, models, practice and algo-trading - page 96