Convolutional layers extract important features, while LSTM remembers their sequences for sequence prediction.

Aleksei Kuznetsov 2024.05.14 12:20 #35161

Maxim Dmitrievsky #:
The convolution layer and the lstm layer. Stride (filter) is a move step in the convolution
.

A contrived bullshit that doesn't improve anything. But this architecture (sequence of layers) is considered the best. In reality it does not give anything. Well, if there is an explicit signal in the data, it does.

On convolutions...
If the second function for it is rectangular, we will get SMA

If there are other forms of this function (triangular, exponential, etc.), we will get something else (maybe EMA), just there will be other coefficients for different rows.
But in any case, as with SMA and EMA, there will be a delay due to averaging.
So it seems that they are of little use, as well as MAs. One plus - reduction of the number of predictors.

Any rookie question, so The market is a What is WMA

mytarmailS 2024.05.14 12:24 #35162

Forester #:
On convolutions...
If the second function for it is rectangular, we will get SMA

If there are other forms of this function (triangular, exponential, etc.), we will get something else (maybe EMA), just there will be different coefficients for different lines.
But in any case, as with SMA and EMA, there will be a delay due to averaging.
So it seems that they are of little use, as well as MAs. One plus is the reduction of the number of predictors.

It's a little different, but it's a lot more primitive.

In a convolutional neuron, convolutional nuclei are "invented" by the neuron during training and there are thousands of nuclei/filters....

So to think that a convolutional neuron can be replaced by a single rectangular nucleus is absurd.

Article: Price forecasting with Finding a set of Questions from Beginners MQL5

Aleksei Kuznetsov 2024.05.14 12:35 #35163

mytarmailS #:

In a convolutional neuron, convolutional nuclei are "invented" by the neuron during the learning process and the nuclei/filters there are thousands of...

Matching coefficients to different strings? Than it will create some more suitable ph-iie instead of a rectangular one.

mytarmailS 2024.05.14 12:37 #35164

Forester #:

Picks up the coefficients of different rows?

No, it picks up the convolution kernels.

Forester #:

Than create some more suitable one instead of a rectangular ph-type.

yes

[Deleted] 2024.05.14 13:19 #35165

Forester #:
On convolutions...
If the second function for it is rectangular, we will get SMA

If there are other forms of this function (triangular, exponential, etc.), we will get something else (maybe EMA), just there will be different coefficients for different lines.
But in any case, as with SMA and EMA, there will be a delay due to averaging.
So it seems that they are of little use, as well as MAs. One plus is the reduction in the number of predictors.

Well, the coefficients of the convolution kernel are chosen to minimise the error, as I recall.

a matrix or vector of these coefficients runs over another matrix or vector of input data with a given step, the values are multiplied and the matrix with the maximum total values is taken as the output, for example

then the outputs are passed to the next layer of the NS, in this case lstm, as shown in the picture

lstm then tries to remember the sequences of these values, like a memory analogue.

So the feature of the convolutional layer is that it can select the most important elements from the whole matrix, and lstm remembers their sequences.

As a result, selecting important features of each row + remembering their sequences, what comes after what, in what order. Approximation. Usually at the end a full-link layer with softmax is put in order to distribute all this goodness by classes.

Discussion of article "Matrix Description of architecture and Description of architecture and

СанСаныч Фоменко 2024.05.14 13:55 #35166

lstm works with radically unbalanced classes - for 2 classes by a factor of ten?

[Deleted] 2024.05.14 14:08 #35167

СанСаныч Фоменко #:
lstm works with radically unbalanced classes - for 2 classes by a factor of ten?

lstm works basically shitty ) very hard to train properly

there are constant gradient explosions because of multiple multiplications of weights.

Maybe now it's fixed somehow, but I don't think so, because everyone switched to transformers.

Transformer is some combination of convolution and lstm, unofficially you can probably make this analogy

It's not random forests - there you have to show off to train properly.

Suggestions, comments, errors on Expert Advisors: Bollinger Band Errors, bugs, questions

Aleksey Vyazmikin 2024.05.14 15:27 #35168

Made some visualisation to better understand the process of degradation of the model results.

Below is a graph showing ZZ on the balance of weighted quantum cutoff errors on the train sample.

You can see that overall the trend is good, positive probability bias, but what about the next two samples, test and exam?

Below are the graphs on these samples.

We observe flatness and at the end of the period the trend to a positive probability shift. Most likely, we will stop the model involving this quantum segment - without waiting for the rise.

And what awaits next?

On the exam sample, the rise continued in the beginning, but again everything fell into flat. How is it possible that there was a long trend on the train, but later we see occasional bursts - something has changed? It remains only to guess - or to make an additional split on another quantum segment, but how to do it on train, because everything is not bad there - a mystery.

But, it happens that we can find quite acceptable variants, here for example - three samples below.

Thoughts, ideas, considerations are welcome!

Is there a pattern What to feed to Discussion of article "Thomas

mytarmailS 2024.05.14 16:06 #35169

Topological Data Analysis (TDA) for predicting crises in the Russian stock market. #sber, #rosn (youtube.com)

Aleksei Kuznetsov 2024.05.14 16:32 #35170

Aleksey Vyazmikin #:
Thoughts, ideas, considerations are welcome!

If the train is overtrained, then there is less to train.

Machine learning in trading: theory, models, practice and algo-trading - page 3517