Machine Learning

Hilbert-Schmidt Independence Criterion (HSIC)

#Data #Representation #Similarity

Given two kernels of the feature representations $K=k(x,x)$ and $L=l(y,y)$, HSIC is defined as12 $$ …

Differential Learning Rates in PyTorch

#Python #PyTorch #Learning Rate

Using different learning rates in different layers of our artificial neural network.

The log-sum-exp Trick

#Numerical #Neural Network #Basics

For numerical stability we can use the log-sum-exp trick to calculate some loss such as cross entropy

Evidence Lower Bound: ELBO

#variational method #probabilistic

ELBO is an very important concept in variational methods

Valid Confidence Sets in Multiclass and Multilabel Prediction

#statistics #classification #multilabel #multiclass #conformal inference

Ask for valid confidence: “Valid”: validate for test data, train data, or the generating …

Hierarchical Classification

#machine learning #supervised learning #classification #multilabel #multiclass

Hierarchical Classification Problem Hierarchical classification labels involves hierarchical class …

Classifier Chains for Multilabel Classification

#machine learning #supervised learning #classification #multilabel

Classifier chains is a method to predict hierarchical class labels

McCulloch-Pitts Model

#Artificial Neuron #Neural Network #Basics

Artificial neuron that separates the state space

Rosenblatt's Perceptron

#Artificial Neuron #Neural Network #Perceptron #Basics

Connected perceptrons

Empirical Loss

#Data #Model Selection

The loss calculated on all the data points

Population Loss

#Data #Model Selection

The loss calculated on all the whole population

Machine as a Hologram

#Machine Learning #Data Science #Tutorial

Tutorials on machine learning and data science productivity articles

Latent Variable Models

#latent variable model #variational autoencoder #normalizing flow

Latent variable models brings us new insights on identifying the patterns of some sample data.

PyTorch: Initialize Parameters

#Python #PyTorch

We can set the parameters in a for loop. We take some of the initialization methods from Lippe1. To …

Pandas Groupby Does Not Guarantee Unique Content in Groupby Columns

#Python #Pandas

Pandas Groupby Does Not Guarantee Unique Content in Groupby Columns, it also considers the …

Data Types

#Data #Data Types

Gini Impurity

#Data

The Gini impurity is a measurement of the impurity of a set.

Information Gain

#Data

The information is a measurement of the entropy of the dataset.

Dealing with Missing Data in Machine Learning

#Machine Learning #Feature Engineering #Data

During feature engineering, we have to deal with missing values.