The text and pictures of this article are from the Internet, only for learning and communication, not for any commercial purpose. The copyright belongs to the original author. If you have any questions, please contact us in time for handling.
Generally speaking, the basic process of data analysis includes the following steps:
1. Ask ...
Posted on Tue, 12 May 2020 03:52:20 -0400 by BIOSTALL
In the last blog, we used word bag model, including word frequency matrix, TF IDF matrix, LSA and n-gram to construct text features, and did the movie comment emotion classification on Kaggle.
This blog is still about text feature engineering, using word embedding to construct text features, that is, using word2vec, glove and fasttext word vec ...
Posted on Tue, 21 Apr 2020 06:58:19 -0400 by chamal
Data types in data processing
When processing data with pandas, we often encounter the problem of data type. When we get the data, we first need to make sure that we get the correct type of data. Generally, through the conversion of data type, this article introduces the data type inside pandas (data types are commonly used dtyps), and the da ...
Posted on Sun, 19 Apr 2020 02:10:28 -0400 by mouse02
The video instructions are as follows:
Complete the Theory and Practice of Machine Learning Algorithms--Logistic Regression
Video address: https://www.bilibili.com/video/av95806420/
How to use Logical Regression for Biclassificatio ...
Posted on Sat, 14 Mar 2020 20:17:16 -0400 by todd-imc
When using SQLAlchemy ORM to query data, if the required records are filtered according to the selected conditions, you can refer to the relevant methods described in this article. There's no skill, just familiarity. Use the data source I often use Sample Data . It is recommended to use jupyter noteboo ...
Posted on Sat, 07 Mar 2020 08:28:57 -0500 by ams007
Click AI Channel above to select Top Public Number
Heavy dry goods, first delivery
Author: Yingxiang Chen & Zihan Yang
Edit: Red Stone
The importance of Feature Engineering in machine learni ...
Posted on Sun, 23 Feb 2020 20:06:13 -0500 by jac38
The example of this article comes from Delta Lake Official course . Because the official tutorial is based on the commercial software Databricks Community Edition. Although the software features used in the tutorial are all possessed by the open-source Delta Lake version, considering the domestic network environment, the threshold for register ...
Posted on Sun, 23 Feb 2020 00:35:26 -0500 by GroundZeroStudios
Record the second time of punch in team learning of "manual learning deep learning"
Linear regression code implementation (based on Python)
Part of linear regression theory can be referred to Last blog
Realization of linear regression model f ...
Posted on Tue, 18 Feb 2020 07:18:55 -0500 by gotry
matplotlib is a common drawing library, which supports python and Jupyter Notebook, as well as the latest jupyterab environment. This paper introduces the font setting method of matplotlib and the setting of drawing linetype, symbol and color.
1. Chinese font
As for the problem of Chinese character scrambling, https://www.linuxidc.com/Linux ...
Posted on Mon, 10 Feb 2020 10:19:06 -0500 by youneek
Dask parallel task scheduling
Introduction to Dask
Dask Is a flexible library for parallel computing in Python.
Darth consists of two parts:
Dynamic task scheduling is optimized for computing. This is similar to Airflow, Luigi, gallery, or Make, but has been optimized for interactive computing work ...
Posted on Sat, 01 Feb 2020 09:02:05 -0500 by flashmonkey