### [TensorFlow Learning Notes -05] Bacth Normalization (BN)

### scikit-learn

scikit-learn preprocessing Data Preprocessing-Normalization/Standardization/Regularization 1, Z-Score, or remove mean and variance scaling formula is: (X-mean)/std is calculated separately for each attribute/column. Subtract the mean value from the data schedule attribute (by column) and add its variance.The result is that, for each attribute/column, all data are clustered around 0 with a variance of 1. is implemented in two different ways: using sklearn.preprocessing.scale () function, the given data can be directly standardized.

### scipy.special.boxcox1p

skewness = skewness[abs(skewness) > 0.75] print("There are {} skewed numerical features to Box Cox transform".format(skewness.shape[0])) from scipy.special import boxcox1p skewed_features = skewness.index lam = 0.15 for feat in skewed_features: #all_data[feat] += 1 all_data[feat] = boxcox1p(all_data[feat], lam) help(boxcox1p) Help on ufunc object: boxcox1p = class ufunc(builtins.object) | Functions that operate element by element on whole arrays. | | To see the documentation for a specific ufunc, use info.

### sklearn naive bayes class library usage summary

In scikit-learn, three naive Bayesian classification algorithms are provided: GaussianNB (Gaussian Naive Bayesian), MultinomialNB (Polynomial Naive Bayesian), BernoulliNB (Bernoulli Naive Bayesian) 1, gaussian naive bayes: sklearn.naive _ bayes.gaussiannb (priors = none) ① Using GaussianNB Class to Establish Simple Model In [1]: import numpy as np ...: from sklearn.naive_bayes import GaussianNB ...: X = np.array([[-1, -1], [-2, -2], [-3, -3],[-4,-4],[-5,-5], [1, 1], [2, ...: 2], [3, 3]]) ...: y = np.array([1,

### Bayesian Classification for Machine Learning

Maximum Likelihood Estimation The training process of probability model is parameter estimation.Bayesian school thinks that the parameters are unobserved random variables and may have their own distribution, so it can be assumed that the parameters obey a prior distribution, and then calculate the posterior distributions based on the observed data.The frequency school thinks that although the parameters are unknown, they have objective fixed values, so the parameters can be determined by optimizing likelihood functions.

### Definition of Text Categorization Problem

The classification problem of a text (the meaning of the two words "text" and "document" is basically not distinguished below) is to classify a document into one or more of several predefined categories, and the automatic classification of text uses a computer program to realize such classification.To put it more bluntly, it's like taking an article and asking the computer whether the article is about sports, economy or education. If the computer can't answer it, it will spank it (…).

### Implementation of Machine Learning Actual Combat Naive Bayesian Classifier python3

All codes in this article are stored in bayes.py file, which is convenient for code testing and program running. from numpy import * def loadDataSet(): """ 功能：词表到向量的转换函数 输出：1.进行此条切分后的文档集合。2.类别标签的集合，这些文