Niflheim World

Welcome to Niflheim !

  • Messages from new users will be checked for flood/spam before being posted on the forum. Users will also be checked for a multi-account.
    If you want to communicate without delay, get a free Huscarl status (how to get - User Groups), or buy premium status (how to buy - Premium status)
  • We help Ukrainian army to win the war against Russia. If you would like to join in helping, you can donate a small amount to one of the addresses below.
    BTC- bc1qmrnjt0eq5emtywxs0ujsusp59w4gkclfyqcjm2
    LTC - LbQfXfAxymMjwq6A4dQcowa9wswxZA6SBq
    ETH - 0x6F63FfAB23D1EF9aaf087f9107cEA88646A4A70e
    USDT (ERC20) - 0x6F63FfAB23D1EF9aaf087f9107cEA88646A4A70e
    USDT (TRC20) - TD33p7GjWjBfdRbmdNLNgUoYefsHSpAeoH

Programming Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications. Timothy Masters


Redman

Forumteam
Staff member
Tignarman
Joined
Aug 24, 2020
Messages
15,723
Reaction score
21,599
NL COIN
79,313
1604841106446.png
Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code. Many of these techniques are recent developments, still not in widespread use. Others are standard algorithms given a fresh look. In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program. The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work. What you'll learn
Monte-Carlo permutation tests provide statistically sound assessment of relationships present in your data.

Combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data.
Feature weighting as regularized energy-based learning ranks variables according to their predictive power when there is too little data for traditional methods.

The eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data.
Plotting regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high, provides visual insight into anomalous relationships.

Who this book is for
The techniques presented in this book and in the DATAMINE program will be useful to anyone interested in discovering and exploiting relationships among variables. Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.

 
shape1
shape2
shape3
shape4
shape7
shape8
Top