Data mining regression analysis

regression analysis Regression analysis is a widely used quantitative analysis method. It is used to analyze the statistical relationship between things, focus on the quantitative change law between variables, and describe and reflect this relationship in the form of regression equation, so as to help people accurately grasp the degree of vari ...

Posted on Sun, 03 Oct 2021 21:13:38 -0400 by JasperBosch

Common feature selection methods

conclusion Filtering methods are faster but coarser. Packaging and embedding methods are more precise and more suitable for adjustment to algorithms, but they are computationally intensive and take longer to run. When there is a large amount of data, differential filtering and mutual information methods are preferred before other feature ...

Posted on Sun, 03 Oct 2021 13:10:42 -0400 by FireyIce01

Data mining training camp data mining: game problem understanding learning notes

1, Summary of learning points Information obtained from competition informationRead dataEvaluation and calculation of classification indexOn parity calculation of regression indexUnderstanding of some nouns 2, Learning content: 1. New knowledge learned from the competition a. Desensitization: process some private information, such as 186 ...

Posted on Fri, 01 Oct 2021 19:35:04 -0400 by davidguz

The bottom layer implements the K-means + + algorithm and is used to find data outliers

preface In this article, we solve the problem of outlier screening using the data of the overall dimension based on our own defined methods rather than calling ready-made modules, and finally visually display the results. Years are like clouds, bandits I want to save, and writing is not easy. I hope friends passing by will praise, collect ...

Posted on Sat, 25 Sep 2021 07:42:00 -0400 by dgudema

Data analysis and mining 3 - Feature Engineering

Data and features determine the upper limit of machine learning, and models and algorithms only approximate this upper limit 1. Data preprocessing data acquisitionData cleaning: remove dirty dataData sampling: it can be used when the data is unbalanced, including up sampling and down sampling; Positive sample > negative sample, and the amo ...

Posted on Tue, 21 Sep 2021 18:16:19 -0400 by little_webspinner