Discussion of article "Data Science and Machine Learning (Part 08): K-Means Clustering in plain MQL5"

MetaQuotes 2022.10.25 13:10

New article Data Science and Machine Learning (Part 08): K-Means Clustering in plain MQL5 has been published:

Data mining is crucial to a data scientist and a trader because very often, the data isn't as straightforward as we think it is, The human eye can not understand the minor underlying pattern and relationships in the dataset, maybe the K-means algorithm can help us with that. Let's find out...

Clustering analysis is a task of grouping a set of objects in such a way that objects with the same attributes are placed within the same groups (clusters).

If you go to the mall, you will find similar items kept together right? Someone did the process of grouping them, When the dataset isn't grouped the clustering analysis will do just like that, group the data values that are more similar(in some sense) to each other than the rest of the groups(clusters).

Clustering analysis itself is not a specific algorithm. The general task can be solved through various algorithms that differ significantly in terms of their understanding of what constitutes a cluster.

Img src: wikipedia

There are three types of clustering widely known;

Exclusive clustering
Overlapping clustering
Hierachial clustering

Author: Omega J Msigwa

Zhiqiang Zhu 2023.02.11 06:08 #1

First of all, I would like to thank the author for sharing this article. I hope that the author, in addition to explaining these theories, can give examples of how K-mean clustering is used in real trading, if there is no corresponding example, this or other articles by the author are practically indistinguishable from textbooks. Machine learning is used in many fields.

It would be nice if the author could better illustrate these machine learning theories with examples of MT5 trading mechanisms. Thanks again for sharing.

A trend-following strategy. [ARCHIVE] Any rookie question, How to use Shark

Mahdi Ebrahimzadeh 2024.01.21 15:43 #2

MetaQuotes:

New article Data Science and Machine Learning (Part 08): K-Means Clustering in plain MQL5 has been published:

Author: Omega J Msigwa

m_cols = Matrix.Cols();
      n = Matrix.Rows(); //number of elements | Matrix Rows
      
      InitialCentroids.Resize(m_clusters,m_cols);     
      vector cluster_comb_v = {};
      matrix cluster_comb_m = {};      
      vector rand_v = {};
      
      for (ulong i=0; i<m_clusters; i++) 
        {
          rand_v = Matrix.Row(i * m_clusters); 
          InitialCentroids.Row(rand_v,i);
        }     
     Print("Initial Centroids matrix\n",InitialCentroids);

hi Omega J Msigwa, thanks your very useful article.

am I missing something or in above code you mean DMatrix?

Omega J Msigwa 2024.01.22 07:35 #3

Mahdi Ebrahimzadeh #:

hi Omega J Msigwa, thanks your very useful article.

am I missing something or in above code you mean DMatrix?

I mean Matrix as explained in the article, since this code is found under the function

void CKMeans::KMeansClustering(const matrix &Matrix, matrix &clustered_matrix,int iterations = 10)
 { 
      m_cols = Matrix.Cols();
      n = Matrix.Rows(); //number of elements | Matrix Rows
      
      InitialCentroids.Resize(m_clusters,m_cols);
      cluster_assign.Resize(n);
            
      clustered_matrix.Resize(m_clusters, m_clusters*n);
      clustered_matrix.Fill(NULL);
      
      vector cluster_comb_v = {};
      matrix cluster_comb_m = {};      
      vector rand_v = {};      
      for (ulong i=0; i<m_clusters; i++) 
        {
          rand_v = Matrix.Row(i * m_clusters); 
          InitialCentroids.Row(rand_v,i);
        }     
     Print("Initial Centroids matrix\n",InitialCentroids);    
.... rest of the code

New comment