[hands on data analysis] Task05 - model establishment and evaluation

Basic process of modeling and evaluation: Zero, characteristic Engineering Import data: import pandas as pd import numpy as np import seaborn as sns import matplotlib.pyplot as plt from IPython.display import Image plt.rcParams['font.sans-serif'] = ['SimHei'] # Used to display Chinese labels normally plt.rcParams['axes.unicode_minus'] ...

Posted on Thu, 23 Sep 2021 08:10:50 -0400 by jOE :D

Data analysis and mining 3 - Feature Engineering

Data and features determine the upper limit of machine learning, and models and algorithms only approximate this upper limit 1. Data preprocessing data acquisitionData cleaning: remove dirty dataData sampling: it can be used when the data is unbalanced, including up sampling and down sampling; Positive sample > negative sample, and the amo ...

Posted on Tue, 21 Sep 2021 18:16:19 -0400 by little_webspinner

Jingdong platform small household appliance user portrait Analysis Report

1, Project background As the number of orders, product browsing, search and other indicators of small household appliances have decreased recently, a promotion is planned. Before the activity, I hope to give some suggestions according to the user characteristics of small household appliances. Data: there are two tables, user_info user informa ...

Posted on Mon, 20 Sep 2021 13:02:15 -0400 by kmemis

Datawhale September team learning - hands on data analysis task2_ Learning records

Data cleaning and feature processing Usually, the original data is not clean, and there may be outliers, missing values and other problems. Therefore, it is generally necessary to clean the data before data analysis. Read a file first #Load the required libraries import numpy as np import pandas as pd #Load data train.csv df = pd.read_csv(' ...

Posted on Mon, 20 Sep 2021 10:59:36 -0400 by gerbs987

❤️ 20000 words, 50 pandas, high frequency operation [pictures and texts, worth collecting] ❤️

Point, knock on the blackboard First of all, this paper follows the traditional teaching, point to point! Only some functions or processing methods that are frequently used by individuals are introduced.The examples in this article are only used for demonstration. Generally, the examples do not modify the original data. If the code will modif ...

Posted on Sun, 19 Sep 2021 17:00:54 -0400 by GESmithPhoto

Taobao user behavior analysis

1, Data source         The data comes from Alibaba Tianchi. The source data has about 100 million records. Due to hardware reasons, only the data from November 25, 2017 to December 4, 2017 are intercepted for analysis. 2, Data structure Column nameexplainuser_idUser id, integer type, serialized user iditem_idCommodity id ...

Posted on Sun, 05 Sep 2021 22:34:54 -0400 by LanceEh