Table of Contents



Resources:

Graphical Models

  1. Motivation:
    Machine learning algorithms often involve probability distributions over a very large number of random variables. Often, these probability distributions involve direct interactions between relatively few variables. Using a single function to describe the entire joint probability distribution can be very inefficient (both computationally and statistically).

    A description of a probability distribution is exponential in the number of variables it models.

  2. Graphical Model:
    A graphical model or probabilistic graphical model (PGM) or structured probabilistic model is a probabilistic model for which a graph expresses the conditional dependence structure (factorization of a probability distribution) between random variables.

    Generally, this is one of the most common statistical models

    Properties:

    • Factorization
    • Independence

    Graph Structure:
    A PGM uses a graph \(\mathcal{G}\) in which each node in the graph corresponds to a random variable, and an edge connecting two r.vs means that the probability distribution is able to represent interactions between those two r.v.s.

    Types:

    • Directed:
      Directed models use graphs with directed edges, and they represent factorizations into conditional probability distributions.
      They contain one factor for every random variable \(x_i\) in the distribution, and that factor consists of the conditional distribution over \(x_i\) given the parents of \(x_i\).
    • Undirected:
      Undirected models use graphs with undirected edges, and they represent factorizations into a set of functions; unlike in the directed case, these functions are usually not probability distributions of any kind.

    Core Idea of Graphical Models:
    The probability distribution factorizes according to the cliques in the graph, with the potentials usually being of the exponential family (and a graph expresses the conditional dependence structure between random variables).

  3. Neural Networks and Graphical Models:
    Deep NNs as PGMs:
    You can view a deep neural network as a graphical model, but here, the CPDs are not probabilistic but are deterministic. Consider for example that the input to a neuron is \(\vec{x}\) and the output of the neuron is \(y .\) In the CPD for this neuron we have, \(p(\vec{x}, y)=1,\) and \(p(\vec{x}, \hat{y})=0\) for \(\hat{y} \neq y .\) Refer to the section 10.2 .3 of Deep Learning Book for more details.


Bayesian Network



Random Field Techniques

  1. Random Field:
    A Random Field is a random function over an arbitrary domain (usually a multi-dimensional space such as \(\mathbb{R}^{n}\) ). That is, it is a function \(f(x)\) that takes on a random value at each point \(x \in \mathbb{R}^{n}\) (or some other domain). It is also sometimes thought of as a synonym for a stochastic process with some restriction on its index set. That is, by modern definitions, a random field is a generalization of a stochastic process where the underlying parameter need no longer be real or integer valued “but can instead take values that are multidimensional vectors on some manifold.

    Formally
    Given a probability space \((\Omega, \mathcal{F}, P),\) an \(X\) -valued random field is a collection of \(X\) -valued random variables indexed by elements in a topological space \(T\). That is, a random field \(F\) is a collection

    $$\left\{F_{t} : t \in T\right\}$$

    where each \(F_{t}\) is an \(X\)-valued random variable.

    Notes:

    • Random Field (wiki)
    • Generative vs Discriminative Models for Sequence Labeling Tasks:
      Generative model makes more restrictive assumption about the distribution of \(x\).
      “Unlike traditional generative random fields, CRFs only model the conditional distribution \(p(t | x)\) and do not explicitly model the marginal \(p(x)\). Note that the labels \(t i\) are globally conditioned on the whole observation \(x\) in CRF. Thus, we do not assume that the observed data \(x\) are conditionally independent as in a generative random field.” - Minka