文章 "机器学习模型的变量评估和选择"

MetaQuotes 2016.02.23 11:36

本文重点介绍机器学习模型中输入变量（预测因子）的选择，预处理以及评估的相关细节。多种规范化的方法以及它们的特点都将在此被讨论。在处理过程中将极大影响模型训练最终结果的环节也将在此探讨。我们将仔细看看，并评估新的及鲜为人知的用于确定输入数据信息量和可视化的方法。

我们将使用“RandomUniformForests”包，计算并分析变量不同值和各种组合的重要概念，预测因子和目标的对应关系，预测因子之间的相互作用，以及将所有重要的方面都考虑在内后预测因子集的最优选取。

我们将使用“RoughSets”包，换一个角度和基于其他概念来探讨预测因子的选择问题。我们将揭示不仅是预测因子集可以被优化，用于训练的样本集也可以被优化。

所有的计算和实验将用R语言来实现，准确的说是用 — Revolution R Open 3.2.1。

OOB error

图 2. 训练误差取决于树的数量

作者：Vladimir Perervenko

Zhi Long Yang 2017.01.03 03:52 #1

非常感谢文章作者。我刚刚开始就遇到一个问题。我安装的是RStudio，不是作者建议的Revolution R Open 3.2.1。“RandomUniformForests”包和“RoughSets”包已经加载，但是nearZeroVar()函数和findLinearCombos() 函数无法正常调用，这些函数是Revolution R Open特有的么？

Microsoft R Open: The Enhanced R Distribution · MRAN

Microsoft Corporation
mran.revolutionanalytics.com

Microsoft R Open, formerly known as Revolution R Open (RRO), is the enhanced distribution of R from Microsoft Corporation. It is a complete open source platform for statistical analysis and data science. The current version, Microsoft R Open 3.3.2, is based on (and 100% compatible with) R-3.3.2, the most widely used statistics software in the...

Zhang Zhang 2017.04.06 05:44 #2

Zhi Long Yang:
非常感谢文章作者。我刚刚开始就遇到一个问题。我安装的是RStudio，不是作者建议的Revolution R Open 3.2.1。“RandomUniformForests”包和“RoughSets”包已经加载，但是nearZeroVar()函数和findLinearCombos() 函数无法正常调用，这些函数是Revolution R Open特有的么？

Revolution R Open（现在由微软维护，改名为MRO）相当于R的一个加强版。RStudio只是一个IDE，不能和R相提并论。所提两个函数在作者原文中很清晰标明是caret包的函数。另，原文作者使用俄语，或许英文尚且能交流，用中文似乎沟通不上。

Vladimir Perervenko 2017.04.07 16:17 #3

Смотрите caret::nearZeroVar () // caret::findLinearCombos ()

Удачи

freewalk 2017.10.19 08:36 #4

MetaQuotes Software Corp.:

新文章机器学习模型的变量评估和选择已发布：

作者：Vladimir Perervenko

zigzag信号做为目标变量应用上存在很大问题，

所有模型的基础是基于先验的已经zigzag点（-1，1），其他的condition＝0的点是排除在外的，

实际应用中，你不知道改时点是否是zigzag点（-1，1），很大的可能是condition＝0的点，因为区分不了（-1，1）和（0）这两种状态，

所以在0的时点同样需要计算和判断。这个时候训练模型和实际模型就会存在很大的偏差；

Vladimir Perervenko 2017.10.19 15:06 #5

freewalk :

zigzag信号做为目标变量应用上存在很大问题，

所有模型的基础是基于先验的已经zigzag点（-1，1），其他的condition＝0的点是排除在外的，

实际应用中，你不知道改时点是否是zigzag点（-1，1），很大的可能是condition＝0的点，因为区分不了（-1，1）和（0）这两种状态，

所以在0的时点同样需要计算和判断。这个时候训练模型和实际模型就会存在很大的偏差；

绘制一个简单的绘图说明的数字（-1，1（0）????

请您仔细阅读的文章？而旁边呢？而且不知道如何使用ZZ？

也许翻译不好？

指定更精确您的意见，请能提高英语水平？

freewalk 2017.10.20 02:02 #6

Vladimir Perervenko:

绘制一个简单的绘图说明的数字（-1，1（0）????

请您仔细阅读的文章？而旁边呢？而且不知道如何使用ZZ？

也许翻译不好？

指定更精确您的意见，请能提高英语水平？

ZigZag 这个序列根本就不需要预测，if you know the lag(ZigZag)=-1 ,then the ZigZag must be 1;the lag(ZigZag)=1 ,then the zigzag=-1;

all of the lag(ZigZag) is occured at the past time,it can predict the zigzag 100% accurately。so if you know the time is zigzag point ,you can 100% accurately

predict the zigzag is -1 or 1.

But in the realtime you cannot know the time is zigzag point,so you must caculate the third status (0)， so how can it work?

You can reference this right articl https://www.mql5.com/zh/articles/2773.

运用人工智能实现的 Thomas DeMark 次序 (TD SEQUENTIAL)

2017.06.19
Mihail Marchukajtes
www.mql5.com

首先, 我们设定一个神经网络不能自行完成交易。这便是, 如果有一个神经网络, 并为它提供无限量的价格数据, 指标和其它他美味佳肴 — 获得永无终止的结果, 所以这个想法可以立即被丢弃。神经网络只能从侧面为策略 "服务": 协助制订决策, 过滤, 预测。能体现出一个完整策略的神经网络是无稽之谈 (至少我个人从未见过)。首先, 用几句话来概括策略。次序是逆势策略。其中出现的信号不依赖于彼此。换言之, 可以在一行里收到买卖信号, 这令次序的使用极大地复杂化。就像任何其它策略一样, 它会产生假信号, 我们将要找出来。作者本人很好地描述了基于次序生成信号的原理。他的解释在这里稍作修改。仅应用了策略的第一部分, 使用设置和交汇信号。选择它们出于两个原因: 一是这些信号位于顶部和底部...

Requests & Ideas, the 3_Level_ZZ_Semafor (indicator for MT4) We CAN NOT predict

2935071411 2017.10.20 06:22 #7

好的

Vladimir Perervenko 2017.10.20 17:01 #8

2935071411 :
好的

Good afternoon.

ZZ is really not defined on the last bars (the last vertex). But they do not need me. Neural network training is conducted on ZZ values without the last 300 bars !! On those bars where ZZ is defined.

You carefully look at the scripts and do not rush to conclusions. You can look stupid.

Good luck

Help Needed PLZ !!! ZigZags Shepherds Machine learning in trading:

freewalk 2017.10.21 01:07 #9

in the realtime ,do you use 'without the last 300 bars '!?

You looks so stupid ,can you use it in the realtime.?

All of your article are wrong ,because your target define is wrong.All canot work in the realtime,

as follow your point ,you singal will happened after 300 bars later.

Time disappearance! How do I close EA freeze

Zhang Zhang 2017.10.26 00:24 #10

freewalk:

in the realtime ,do you use 'without the last 300 bars '!?

You looks so stupid ,can you use it in the realtime.?

All of your article are wrong ,because your target define is wrong.All canot work in the realtime,

as follow your point ,you singal will happened after 300 bars later.

你并不真正了解作者的设想，自己想象的过于幼稚了，说出的stupid只是映射在你自己身上而已，请不要再给国人丢脸，不但stupid，而且丑陋。

to Vladimir Perervenko: thanks again for those wonderful articles, you did and doing really good research! And I must apologize for this stupid thing from "freewalk", not all chinese like him.

Using my EA live Pass Values from ZigZag Machine learning in trading:

新评论