Discussion of article "Advanced resampling and selection of CatBoost models by brute-force method" - page 5
You are missing trading opportunities:
- Free trading apps
- Over 8,000 signals for copying
- Economic news for exploring financial markets
Registration
Log in
You agree to website policy and terms of use
If you do not have an account, please register
Then it is necessary to average exactly. Otherwise it'll be "different" on the new data.
you don't need to average exactly. The sampler already has averaging built in.
GMM sampler can create a bad sample, with skewed classes, etc., sampling is random. Does it make sense to take it into account?
definitely don't need to average
GMM sampler can create a bad sample, with skewed classes etc, sampling is random. Does it make sense to take it into account?
A random forest similarly creates a set of successful and not-so-successful trees. Averaging all models shows a better result on new data than a single best tree.
A random forest similarly creates a set of good and not so good trees. Averaging all models shows a better result on new data than a single best tree.
and if you compose several forests, there will be approximately zero transactions, the signals will overlap.
and if you compose several scaffolds, the trades will be about zero, the signals will overlap.
Several (e.g. 10) forests of 100 is the same as one forest of 1000 trees. It gives a lot of signals.
A few, (e.g. 10) forests of 100 is the same as one forest of 1000 trees. It gives a lot of signals.
Any practice? I've done it before. Signals become few.
If you have a 0.5 indentation set, you just need to reduce it.
I agree with that, it wasn't getting enough anyway. And I don't quite understand why you would add randomly bad models. Compose cool ones that improve each other - another conversation
It is averaging of everything that is needed. The basic descriptions of the scaffolding principle say so. Like the crowd knows better than one expert.
I did this with timber about 2 years ago, trained 1000, took the best 10-50. It did not work, apparently the result on new data was not very good.
It is the averaging of everything in a row that is needed. The basic descriptions of the scaffolding principle say so. Like the crowd knows better than one expert.