claude shannon, entrophy, id3 alogrithm, information gain.

ID3 Algorithm:
ID3 invented by Ross Quinlan used to generate a tree from a dataset. its precursor of C4.5 algorithm, used in machine learning and NLP
domains.
it begins with the original set S  as the root node. IT calculates the entrophy  H(S) (or information gain IG(S)) before, it iterates through
every unused attributes of the set S on each iteration of algorithm. It n selects the smallest entrophy (or largest information gain)value.
the set S is then spit by the selected attribute (e.g age is less than 50, age is between 50 and 100, age is greater than 100 ) to produce
subsets of the data. The algorithm continues to recurse on each subset, considering only attributes never selected before. 
INformation Gain in decision tree:
it is used as mutual information, which is the expected value of the kullback leibler divergance of the univariate probability distribution
of one variable from the conditional distribution of its variable given the other one.
In general terms, the expected  information gain is the change in the information entrophy H from a prior state that takes some information as given:
IG(T,a)=H(T)-H(T|a).
Entrophy:
it is defined as the average amount of data produced by the stocastic(random) source of data.
Claude Shannon:(april 30 1916 ---- 2001)
american mathematician, electrical engineer and cryprtographer known as father of information theory. noted for having founded information theory with a
landmark paper, A Mathematical Theory of communication, published on 1948, he is equally known for founding 
 digital circuit thoey in 1937.

Comments

Popular posts from this blog

Aspect Ratio

Why People attend Higher education institutions

Quarantine affecting different people