1 Graphical Models

原創

2020-02-20 22:28

From course Probabilistic Models and Inference Algorithms for Machine Learning, Prof. Dahua Lin
All contents here are from the course and self understandings.

The key idea behind graphical models is factorization
A graphical model generally refers to a family of joint distributions over multiple variables that factorize according to the structure of the underlying graph.
可以從兩方面來理解 graphical models:
- 是個數據結構(data structure), 這個數據結構呢能通過分解的方式描述聯合分佈（a joint distribution in a factorized manner.）
- 一種緊湊的一系列條件獨立(conditional independencies)的分佈（a family of distributions.）的表示方法
- 以上這兩點實際上是等價的。
Graphical Models 的類別：
- Bayesian Networks (Directed Acyclic Graphs)
- Markov Random Fields (Undirected Graphs)
- Chain Graphs (Directed acyclic graphs over undirected components)
- Factor Graphs
Directed Acyclic Graph
- A graph G is called a directed acyclic graph (DAG)
  if it has no directed cycles. (即每月自環的現象)
- 因爲有向圖(directed graph)是有方向的，所以對圖中的有向邊來說，是要分parent 和 child 的
- A vertex s is called an ancestor of t and t an descendant of s, denoted as s ≺ t, if there exists
  a directed path from s to t.
- Topological Ordering： A topological ordering of a directed graph G = (V, E) is a linear ordering of vertices such that for each edge (s, t) ∈ E, s always comes before t.
- A finite directed graph is acyclic if and only if it has a topological ordering.

考慮一個無向圖 G = (V, E)
- clique: is a fully connected subset of vertices
- A clique is called maximal if it is not properly contained in another clique. (指當另一個點加進來的時候，這個clique 就變得不 clique 了)
- C(G) denotes the set of all maximal cliques.
- example
Markov Random Fields
- Consider an undirected graph G=(V,E) , we say a joint distribution of XV factorizes according to G if its density p can be expressed as
  $p (x V) = 1 X \prod C \in C ψ C (x C)$
- This is called a Markov Random Field over G.
- ψC:XC→R+ are called factors
- The normalizing constant Z is usually needed to ensure the distribution is properly normalized:
  $Z = \int \prod C \in C (G) ψ C (x C) μ (d x)$
- ψC 稱爲 compatibility functions，它不需要服從marginal or conditional distributions.

The graphical structure also encodes a set of conditional independencies among the variables.
Consider a joint distribution over (X, Y, Z), X and Y are called conditionally independent given Z, denoted by X⊥Y|Z if
Pr(X∈A&Y∈B|Z)=Pr(X∈A|Z)Pr(Y∈B|Z)

or more generally

EX,Y|Z[f(X)g(Y)]=EX|Z[f(X)]EY|Z[g(Y)]
- Suppose the conditional distributions X|Z and Y|Z have densities pX|z and pY|z , then X⊥Y|Z , if the following equality holds almost
  surely: $p (X, Y) | z (x, y) = p X | z (x) p Y | z (y)$

An MRF does not always fully reveal the factorized structure of a distribution.
A factor graph can sometimes give a more accurate characterization of a family of distributions.
A factor graph is a bipartite graph with links between two types of nodes: variables and factors.
A variable x and a factor f is linked in a factor graph, if the factor involves x as an argument.

發佈了57 篇原創文章 · 獲贊 3 · 訪問量 2萬+

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.