Random/incomplete thoughts on various ML topics A Few Notes on attention mechanisms A Few Notes on Fisher Information (WIP) Some Notes on Kolmogorov Complexity, the Algorithmic Markov Condition, and Causality Causal Graphs and Sources of Bias Likelihood Ratio Policy Gradients for Reinforcement Learning Notes on Policy Gradients (Trust Region Policy Optimization, under construction) Notes on Policy Gradients (under construction) Notes on Noise Contrastive Estimation (NCE) Notes on Maximization of Inner Products over Norm Balls Notes on Adversarial Examples Basic probability stuff that everyone likely should know Notes on Variational Autoencoders Notes on policy gradient and the log derivative trick for reinforcement learning (under construction) LDA Intro/overview (also under construction) Brief intro to auto-encoders (I would explain this differently/use different notation these days) Pretty old random idea How exactly does word2vec work? Why is minimizing error the same thing as maximizing likelihood is the same thing as finding a low energy state... A bit on sequence to sequence learning (e.g Google Inbox smart reply) Code Snippets VariationalAutoencoder (VAE) and other examples written in tensorflow. An example reconstruction of MNIST digits by the VAE is shown below. . lr.py: Very simple linear regression in tensorflow (made up dataset) k-prototypes.py: Categorical Clustering for Netflow data Softmax regression on MNIST in tensorflow PCA + K-Means (Spark/MLlib/Scala) on the KDDCUP99 data set Last Update: 05.02.2021 by dmm@1-4-5.net