评估CPU内核的优化 - 页 4

 
Georgiy Merts:

以上 -树_Brut_TestPL_F

这是第二张,配置相同。

运行12个虚拟核心时也是如此(没有F)。

当运行12个虚拟核心时,专家 "与F"。

我现在要关闭超级交易......

专家 "没有F",没有超额交易,有6个通道。

专家 "与F "和没有超额交易,有6个通道。

非常奇怪的结果 - 原来Tree_Brut_TestPL在没有禁用bios 6代理的情况下,平均通过0:00:33.712,而禁用后大约1:10 - 没有混淆?

而对于Tree_Brut_TestPL_F 专家顾问,在没有禁用超传统的情况下,在bios中有6个代理" 平均通过0:01:10.931",而在禁用时 "平均通过0:01:10.621",也就是说,没有区别。

 

下一主题 Phenom II X4 960T 3000Mhz

2019.08.11 14:08:20.656 Terminal        Windows 7 Service Pack 1 (build 7601) x64, IE 11, AMD Phenom II X4 960 T Processor, Memory: 3182 / 4063 Mb, Disk: 57 / 92 Gb, GMT+3

Tree_Brut_TestPL - 4个代理商

14:11:32.289    Core 3  pass 1 returned result 1001000.00 in 0:02:12.098
14:11:32.813    Core 2  pass 3 returned result 1001000.00 in 0:02:12.530
14:11:33.038    Core 1  pass 0 returned result 1001000.00 in 0:02:12.847
14:11:34.409    Core 4  pass 2 returned result 1001000.00 in 0:02:14.138
14:11:34.409    Tester  optimization finished, total passes 4
14:11:34.419    Statistics      optimization done in 2 minutes 15 seconds
14:11:34.419    Statistics      shortest pass 0:02:12.098, longest pass 0:02:14.138, average pass 0:02:12.903


Tree_Brut_TestPL_F - 4个代理商

14:18:27.493    Core 4  pass 3 returned result 1001000.00 in 0:03:32.381
14:18:28.165    Core 1  pass 0 returned result 1001000.00 in 0:03:33.080
14:18:28.328    Core 2  pass 1 returned result 1001000.00 in 0:03:33.223
14:18:33.135    Core 3  pass 2 returned result 1001000.00 in 0:03:38.035
14:18:33.135    Tester  optimization finished, total passes 4
14:18:33.145    Statistics      optimization done in 3 minutes 40 seconds
14:18:33.145    Statistics      shortest pass 0:03:32.381, longest pass 0:03:38.035, average pass 0:03:34.179

 

再次测试FX-8350,但它运行在4200Mhz和一个内存通道中,另一个母亲在760G芯片上运行。

2019.08.11 08:54:57.931 Terminal        Windows 7 Service Pack 1 (build 7601) x64, IE 11, AMD FX-8350 Eight-Core Processor , Memory: 14341 / 16126 Mb, Disk: 106 / 272 Gb, GMT+3

Tree_Brut_TestPL - 4个代理商

RN      0       09:02:08.810    Core 3  pass 2 returned result 1001000.00 in 0:01:08.306
EG      0       09:02:08.894    Core 2  pass 6 returned result 1001000.00 in 0:01:08.337
LL      0       09:02:08.987    Core 4  pass 4 returned result 1001000.00 in 0:01:08.427
LR      0       09:02:09.478    Core 1  pass 0 returned result 1001000.00 in 0:01:09.074
PK      0       09:03:16.943    Core 2  pass 7 returned result 1001000.00 in 0:01:08.057
NP      0       09:03:17.684    Core 3  pass 3 returned result 1001000.00 in 0:01:08.882
PF      0       09:03:17.794    Core 4  pass 5 returned result 1001000.00 in 0:01:08.816
NO      0       09:03:18.476    Core 1  pass 1 returned result 1001000.00 in 0:01:09.007
EJ      0       09:03:18.476    Tester  optimization finished, total passes 8
KN      0       09:03:18.486    Statistics      optimization done in 2 minutes 18 seconds
EQ      0       09:03:18.486    Statistics      shortest pass 0:01:08.057, longest pass 0:01:09.074, average pass 0:01:08.613

Tree_Brut_TestPL - 8个代理商

KP      0       09:06:39.083    Core 1  pass 3 returned result 1001000.00 in 0:01:58.552
CI      0       09:06:40.856    Core 4  pass 0 returned result 1001000.00 in 0:02:00.327
FN      0       09:06:42.195    Core 3  pass 2 returned result 1001000.00 in 0:02:01.663
GD      0       09:06:43.023    Core 5  pass 6 returned result 1001000.00 in 0:02:01.890
RM      0       09:06:43.279    Core 2  pass 1 returned result 1001000.00 in 0:02:02.749
MS      0       09:06:43.675    Core 6  pass 7 returned result 1001000.00 in 0:02:02.395
KH      0       09:06:43.894    Core 8  pass 4 returned result 1001000.00 in 0:02:02.896
EQ      0       09:06:44.356    Core 7  pass 5 returned result 1001000.00 in 0:02:03.280
KH      0       09:06:44.357    Tester  optimization finished, total passes 8
FL      0       09:06:44.367    Statistics      optimization done in 2 minutes 04 seconds
CG      0       09:06:44.367    Statistics      shortest pass 0:01:58.552, longest pass 0:02:03.280, average pass 0:02:01.719

Tree_Brut_TestPL_F - 4个代理商

PJ      0       09:20:56.349    Core 4  pass 2 returned result 1001000.00 in 0:01:47.739
PP      0       09:20:56.591    Core 2  pass 4 returned result 1001000.00 in 0:01:47.958
HI      0       09:20:57.103    Core 3  pass 0 returned result 1001000.00 in 0:01:48.680
HN      0       09:21:01.851    Core 1  pass 6 returned result 1001000.00 in 0:01:53.204
FD      0       09:22:44.340    Core 2  pass 5 returned result 1001000.00 in 0:01:47.763
HM      0       09:22:44.949    Core 4  pass 3 returned result 1001000.00 in 0:01:48.615
CR      0       09:22:45.221    Core 3  pass 1 returned result 1001000.00 in 0:01:48.132
FH      0       09:22:53.944    Core 1  pass 7 returned result 1001000.00 in 0:01:52.108
KN      0       09:22:53.944    Tester  optimization finished, total passes 8
IJ      0       09:22:53.954    Statistics      optimization done in 3 minutes 46 seconds
KM      0       09:22:53.954    Statistics      shortest pass 0:01:47.739, longest pass 0:01:53.204, average pass 0:01:49.274

Tree_Brut_TestPL_F - 8家代理商

JO      0       09:11:41.826    Core 6  pass 0 returned result 1001000.00 in 0:03:23.474
QE      0       09:11:42.819    Core 5  pass 4 returned result 1001000.00 in 0:03:24.528
HJ      0       09:11:45.883    Core 2  pass 6 returned result 1001000.00 in 0:03:27.480
HS      0       09:11:45.900    Core 3  pass 7 returned result 1001000.00 in 0:03:27.437
FI      0       09:11:45.935    Core 4  pass 3 returned result 1001000.00 in 0:03:27.867
CN      0       09:11:47.761    Core 7  pass 1 returned result 1001000.00 in 0:03:29.327
ED      0       09:11:48.789    Core 8  pass 5 returned result 1001000.00 in 0:03:30.352
HM      0       09:11:51.431    Core 1  pass 2 returned result 1001000.00 in 0:03:33.033
CM      0       09:11:51.432    Tester  optimization finished, total passes 8
PI      0       09:11:51.442    Statistics      optimization done in 3 minutes 34 seconds
KP      0       09:11:51.442    Statistics      shortest pass 0:03:23.474, longest pass 0:03:33.033, average pass 0:03:27.937
 
Aleksey Vyazmikin:

非常奇怪的结果 - 原来Tree_Brut_TestPL在没有关闭bios 6的hypertrading的情况下代理 平均通过0:00:33.712,而关闭后大约1:10 - 没有混淆?

Tree_Brut_TestPL_F 专家顾问在没有禁用超传统的情况下,在bios中有6个代理" 平均通过0:01:10.931",而在禁用 传统的情况下,"平均通过0:01:10.621",也就是说,没有区别。

我还认为,当任务是纯粹的计算时,超交易不应该有什么区别。 正如我之前所说,在优化中,虚拟核心的主要好处正是我们不访问内存,一切都在我们的高速缓存中。当我在测试不同的变体时--我清楚地记得,在 "真实点位 "上,两年来没有从超级交易中获得任何利润。而相反,如果我与1MOHLC合作,我马上就能获得10-20%的利润。也就是说,收益是由于缓存造成的。

另一方面,第一个结果让我感到惊讶......

 

下一个测试对象是一个有两个工作站/服务器CPU的母亲,E5-2670 v1处理器,频率为3000Mhz(母亲强行提升了它们的频率)


2019.08.11 11:32:37.213 Terminal        Windows 7 Service Pack 1 (build 7601) x64, IE 11, Intel Xeon  E5-2670 0 @ 2.60 GHz, Memory: 60534 / 65483 Mb, Disk: 317 / 465 Gb, GMT+3

Tree_Brut_TestPL - 16个代理商

RI      0       11:37:17.969    Core 001        pass 10 returned result 1001000.00 in 0:03:36.032
MP      0       11:37:25.419    Core 004        pass 4 returned result 1001000.00 in 0:03:43.584
KI      0       11:37:25.493    Core 007        pass 8 returned result 1001000.00 in 0:03:43.590
DL      0       11:37:28.716    Core 010        pass 16 returned result 1001000.00 in 0:03:46.730
JF      0       11:37:30.696    Core 011        pass 14 returned result 1001000.00 in 0:03:48.759
FI      0       11:37:34.878    Core 016        pass 26 returned result 1001000.00 in 0:03:52.774
QP      0       11:37:35.231    Core 006        pass 6 returned result 1001000.00 in 0:03:53.328
IE      0       11:37:37.138    Core 012        pass 24 returned result 1001000.00 in 0:03:55.033
IL      0       11:37:39.501    Core 015        pass 30 returned result 1001000.00 in 0:03:57.396
DE      0       11:37:40.320    Core 005        pass 2 returned result 1001000.00 in 0:03:58.658
GH      0       11:37:40.488    Core 014        pass 20 returned result 1001000.00 in 0:03:58.467
HQ      0       11:37:40.722    Core 003        pass 0 returned result 1001000.00 in 0:03:59.134
NE      0       11:37:41.021    Core 002        pass 28 returned result 1001000.00 in 0:03:58.915
FO      0       11:37:41.565    Core 008        pass 22 returned result 1001000.00 in 0:03:59.519
EQ      0       11:37:42.991    Core 009        pass 12 returned result 1001000.00 in 0:04:01.004
PH      0       11:37:45.825    Core 013        pass 18 returned result 1001000.00 in 0:04:03.805
FR      0       11:41:10.537    Core 001        pass 11 returned result 1001000.00 in 0:03:52.573
LG      0       11:41:17.444    Core 007        pass 9 returned result 1001000.00 in 0:03:51.955
MN      0       11:41:24.007    Core 011        pass 15 returned result 1001000.00 in 0:03:53.315
PR      0       11:41:30.690    Core 004        pass 5 returned result 1001000.00 in 0:04:05.275
GK      0       11:41:31.628    Core 010        pass 17 returned result 1001000.00 in 0:04:02.918
PM      0       11:41:33.720    Core 016        pass 27 returned result 1001000.00 in 0:03:58.847
DD      0       11:41:34.517    Core 015        pass 31 returned result 1001000.00 in 0:03:55.021
FN      0       11:41:35.195    Core 012        pass 25 returned result 1001000.00 in 0:03:58.063
IS      0       11:41:35.758    Core 006        pass 7 returned result 1001000.00 in 0:04:00.532
HJ      0       11:41:36.532    Core 002        pass 29 returned result 1001000.00 in 0:03:55.517
HM      0       11:41:38.036    Core 013        pass 19 returned result 1001000.00 in 0:03:52.217
PD      0       11:41:40.323    Core 003        pass 1 returned result 1001000.00 in 0:03:59.605
MI      0       11:41:41.865    Core 008        pass 23 returned result 1001000.00 in 0:04:00.305
QP      0       11:41:42.899    Core 014        pass 21 returned result 1001000.00 in 0:04:02.416
GI      0       11:41:42.992    Core 005        pass 3 returned result 1001000.00 in 0:04:02.676
IL      0       11:41:49.992    Core 009        pass 13 returned result 1001000.00 in 0:04:07.008
JI      0       11:41:49.992    Tester  optimization finished, total passes 32
NN      0       11:41:50.002    Statistics      optimization done in 8 minutes 09 seconds
DQ      0       11:41:50.002    Statistics      shortest pass 0:03:36.032, longest pass 0:04:07.008, average pass 0:03:56.092

Tree_Brut_TestPL - 32个代理商

OO      0       12:21:37.757    Tester  optimization finished, total passes 32
KH      0       12:21:37.767    Statistics      optimization done in 5 minutes 16 seconds
CS      0       12:21:37.767    Statistics      shortest pass 0:05:07.092, longest pass 0:05:15.503, average pass 0:05:11.508

Tree_Brut_TestPL_F - 16个代理商

MO      0       11:57:24.764    Tester  optimization finished, total passes 32
IK      0       11:57:24.774    Statistics      optimization done in 13 minutes 53 seconds
IS      0       11:57:24.774    Statistics      shortest pass 0:06:40.315, longest pass 0:07:04.229, average pass 0:06:50.431

Tree_Brut_TestPL_F - 32家代理商

HJ      0       12:30:29.924    Tester  optimization finished, total passes 32
MM      0       12:30:29.934    Statistics      optimization done in 8 minutes 30 seconds
QD      0       12:30:29.934    Statistics      shortest pass 0:08:00.054, longest pass 0:08:28.112, average pass 0:08:13.088

我对所有变体中代理的分散性感到困扰--最好的--最差的--长达25秒--也许这是对双处理器架构的一种支付,但我不确定--如果有人有类似的设置,请测试。

 
Georgiy Merts:

我还认为,当任务是纯粹的计算时,超交易不应该有什么区别。 我已经说过,在优化中,虚拟核心的主要好处是,我们不访问内存,我们在缓存中拥有一切。当我在测试不同的变体时--我清楚地记得,在 "真实点位 "上,两年来没有从超级交易中获得任何利润。但如果我与1MOHLC合作,我立即得到10-20%的利润。这就是利润是由于缓存的原因。

但第一个结果--我自己也很惊讶......

我并不是在争论这个理论,也许是这样的:所有在缓存中带有附属物的代码都会装在里面,一切都会很好很快速,而在相反的情况下,我们会用内存工作。

关于Tree_Brut_TestPL 测试--我认为你在BIOS中关闭了hypertrading,一定是犯了一个错误--再试一下,第一个专家顾问比第二个快,所以我认为这是一个错误

 

到目前为止,排名是基于以秒为单位的平均运行时间--倒数第二列,最后一列显示每小时的CPU运行次数。

该表由最后一列作为计算资源 消耗方面最重的EA选项进行过滤。



到目前为止,我对E5-2670处理器的糟糕结果感到惊讶,虽然它(其中2个)在Cinabench r15中得到了2000分,但i7-8700只得到了1191分!我认为这是不可能的。这里有些不对劲...

 
2019.08.11 18:16:13.866 Terminal        Windows 7 Service Pack 1 (build 7601) x64, IE 11, UAC, Intel Celeron  G3900 @ 2.80 GHz, Memory: 2473 / 3958 Mb, Disk: 63 / 111 Gb, GMT+3

Tree_Brut_TestPL - 2个代理商

19:46.634	Core 1	pass 1 returned result 1001000.00 in 0:00:50.197
JL	0	18:19:46.701	Core 2	pass 0 returned result 1001000.00 in 0:00:50.267
RM	0	18:19:46.701	Tester	optimization finished, total passes 2
RI	0	18:19:46.711	Statistics	optimization done in 0 minutes 50 seconds
MM	0	18:19:46.711	Statistics	shortest pass 0:00:50.197, longest pass 0:00:50.267

Tree_Brut_TestPL_F - 2个代理商

22:31.231       Core 1  pass 0 returned result 1001000.00 in 0:01:46.866
RO      0       18:22:38.982    Core 2  pass 1 returned result 1001000.00 in 0:01:54.603
LK      0       18:22:38.982    Tester  optimization finished, total passes 2
PO      0       18:22:38.992    Statistics      optimization done in 1 minutes 55 seconds
QK      0       18:22:38.992    Statistics      shortest pass 0:01:46.866, longest pass 0:01:54.603
 

2990WX测试结果

Tree_Brut_TestPL - 32个代理商

2019.08.11 22:52:38.668 Tester  optimization finished, total passes 32
2019.08.11 22:52:38.678 Statistics      optimization done in 3 minutes 06 seconds
2019.08.11 22:52:38.678 Statistics      shortest pass 0:01:04.372, longest pass 0:02:09.139, average pass 0:01:42.144

Tree_Brut_TestPL - 64个代理商

2019.08.11 22:46:49.456 Tester  optimization finished, total passes 64
2019.08.11 22:46:49.467 Statistics      optimization done in 3 minutes 16 seconds
2019.08.11 22:46:49.467 Statistics      shortest pass 0:02:41.837, longest pass 0:03:06.668, average pass 0:03:00.527

Tree_Brut_TestPL_F - 32家代理商

2019.08.11 23:14:07.830 Tester  optimization finished, total passes 32
2019.08.11 23:14:07.840 Statistics      optimization done in 4 minutes 13 seconds
2019.08.11 23:14:07.840 Statistics      shortest pass 0:02:54.945, longest pass 0:04:12.540, average pass 0:03:37.639

Tree_Brut_TestPL_F - 64个代理商

2019.08.11 23:07:06.525 Tester  optimization finished, total passes 64
2019.08.11 23:07:06.535 Statistics      optimization done in 5 minutes 52 seconds
2019.08.11 23:07:06.535 Statistics      shortest pass 0:05:31.519, longest pass 0:05:49.750, average pass 0:05:42.464

结果非常令人震惊,我无法理解哪里出了问题--编译器不了解这个架构,还是其他什么原因--单次运行的结果极其平庸,而且分布非常大。

是的,总体结果是该处理器的最快速度,但我预期的结果要好得多!"。谁有Ryzen--让我们来测试一下!

 
Aleksey Vyazmikin:

2990WX测试结果

Tree_Brut_TestPL - 32个代理商

Tree_Brut_TestPL - 64个代理商

Tree_Brut_TestPL_F - 32个代理商

Tree_Brut_TestPL_F - 64个代理商

结果非常令人震惊,我无法理解哪里出了问题--编译器不了解这个架构,还是其他什么原因--单次运行的结果极其平庸,而且分布非常大。

是的,总体结果是该处理器的最快速度,但我预期的结果要好得多!"。谁有Ryzen--让我们来测试一下!

2990WX则不同。它由四个Zeppelin晶体组成,有32个处理核心。在X399平台上,AMD对这款处理器施加了一些限制,以便不损害EPYC服务器芯片的销售。

这些限制中最主要的是只有四个内存控制器。虽然还有两个Zeppelin芯片,但AMD称它们为计算芯片。这意味着他们不能访问本地PCIe或DRAM,为此他们必须通过Infinity Fabric来处理I/O组件。由于晶体的数量是原来的两倍,因此Infinity结构的带宽是原来的两倍,如果使用DDR4-3200内存,带宽约为25Gb/s。

原因: