Function Repository Resource:

BlockEntropy

Source Notebook

Calculate the joint information entropy of a data matrix

Contributed by: Bradley Klee

ResourceFunction["BlockEntropy"][data]

gives the joint information entropy of data.

ResourceFunction["BlockEntropy"][list,blocksize]

computes the entropy after partitioning list by blocksize.

ResourceFunction["BlockEntropy"][… , macrofun]

groups matrix rows by distinctness of the values macrofun[row_i].

ResourceFunction["BlockEntropy"][… , macrofun,probfun]

allows custom conditional probabilities input through probfun.

ResourceFunction["BlockEntropy"][k,…]

gives the base k joint information entropy.

Details

ResourceFunction["BlockEntropy"] is similar to Entropy in that it also computes values from a sum of the form -∑p_iLog[p_i].

The entropy measurement starts with grouping rows by Total.

Typically, p_i is a frequentist probability of obtaining the i^th distinct element by randomly sampling the input list.

ResourceFunction["BlockEntropy"] expects a matrix structure, either of the form data={row₁,row₂,…}, or implicitly as Partition[list,blocksize] ={row₁,row₂,…}.

Additionally, BlockEntropy allows for coarse-graining of rows to macrostates using the function macrofun (default: Total).

Two rows row_j and row_k, with macrostates m_j=macrofun[row_j] and m_k=macrofun[row_k], are considered distinct if m_j≠m_k.

Likewise, two atomistic states d_j,x and d_k,y are considered distinct if d_j,x≠d_k,y.

Let 𝒟 be the set of unique atomistic states and ℳ the set of distinct values in the range of macrofun. The joint entropy is then calculated by a double sum, S=-∑ℙ(d_j❘m_i)ℙ(m_i)Log[ℙ(d_j❘m_i)ℙ(m_i)], where indices i and j range over the elements of ℳ and 𝒟 respectively.

The frequentist probability ℙ(m_i) , m_i∈ℳ equals the count of rows satisfying m_i=macrofun[row_j], divided by the total number of rows.

The conditional probability ℙ(d_j❘m_i), m_i∈ℳ, d_j∈𝒟 is not necessarily frequentist, but is often assumed or constructed to be so.

The optional function probfun takes m_i∈ℳ as the first argument and blocksize as the second argument. It should return an Association or List of conditional probabilities ℙ(d_j❘m_i).

When probfun is not set, either "Micro" or "Macro" conditional probabilities can be specified by setting the "Statistics" option.

The default "Micro" statistics obtains 𝒟 by taking a Union over row elements. The conditional probabilities are then calculated as ℙ(d_j❘m_i)=∑ℙ(d_j❘row_k)ℙ(row_k)=∑ℙ(d_j❘row_k) /N, where the sum includes every possible row_k written over elements 𝒟 and satisfying m_i=macrofun[row_k]. The factor 1/ℙ(row_k)=N equals the Count of such rows, all assumed equiprobable.

Traditional "Macro" statistics require that 𝒟 contains all possible rows of the correct length whose elements are drawn from the complete set of row elements using Tuples. The conditional probabilities are then calculated as ℙ(d_j❘m_i)=0 if m_i≠macrofun[d_j] or if m_i=macrofun[d_j] as ℙ(d_j❘m_i)=1 /N, with N equal to the count of atomistic row states d_k satisfying m_i=macrofun[d_k].

Examples

Basic Examples (4)

Calculate the BlockEntropy of a binary matrix:

In[1]:=

Out[1]=

The BlockEntropy value does not change by permuting in block:

In[2]:=

Out[2]=

Calculate the ternary BlockEntropy of a binary list:

In[3]:=

Out[3]=

Calculate the same value from a matrix input:

In[4]:=

Out[4]=

The BlockEntropy value does not change by permuting blocks:

In[5]:=

Out[5]=

Calculate the ternary entropy of a binary list assuming isentropic macrostates:

In[6]:=

Out[6]=

Changing the aggregation function can change the BlockEntropy value:

In[7]:=

Out[7]=

Two different aggregation functions can have the same asymptotics:

In[8]:=

$ListPlot[Transpose[Abs[List[ ResourceFunction["BlockEntropy"][SeedRandom[Floor[10^6 Pi^#]]; RandomInteger[1, {#, 2}], Total], ResourceFunction["BlockEntropy"][SeedRandom[Floor[10^6 Pi^#]]; RandomInteger[1, {#, 2}], Entropy] ]] & /@ Range[100]], PlotRange -> All]$