Artificial Intelligence Blog

“Church: a language for generative models”

November 15, 2012 in Deep Belief Networks, General ML, Graphical Models by hundalhh | Permalink

In “Church: a language for generative models“, Goodman, Mansinghka, Roy, Bonawitz, and Tenenbaum introduce the probabilistic computer language “Church, a universal language for describing stochastic generative processes. Church is based on the Lisp model of lambda calculus, containing a pure Lisp as its deterministic subset.” There will be a workshop on probabilistic programming at NIPS (which I first read about at the blog Statistical Modeling, Causal Inference, and Social Science). Here is a cool tutorial.

Interior Point Methods for Large Scale SVMs

November 14, 2012 in Optimization, Support Vector Machines by hundalhh | Permalink

Jacek Gondzio has some nice slides (2009) on interior point methods for large scale support vector machines. He focuses on the primal dual logarithmic barrier methods (see e.g. Wright 1987) for softer classification. Great explanations, diagrams, and numerical results are provided. Kristian Woodsend wrote his 2009 Ph.D. thesis on the same subject. Woodsend applies the interior point methods and low rank approximations of the SVM kernel to reduce the computational cost to order $n$ where $n$ is the number of data points. He compares this approach to active set methods, gradient projection algorithms, and cutting-plane algorithms and concludes with numerical results.

“Active Learning Ranking from Pairwise Preferences with Almost Optimal Query Complexity”

November 12, 2012 in Reinforcement Learning by hundalhh | Permalink

In “Active Learning Ranking from Pairwise Preferences with Almost Optimal Query Complexity“, Ailon (2011) presents a method for ordering a set based on noisy comparisons of elements. He proves that his adaptive method approximates the NP Hard optimal solution (Alon 2006) to within $(1+\epsilon)$ times the minimum error after $f(\epsilon^{-1}, n)$ comparisons where $f$ is a polynomial. Ailon builds on the work of Kenyon-Mathieu and Schudy (2007) who also developed a polynomial time algorithm for approximate ranking. The Kenyon-Mathieu and Schudy algorithm was not query efficient.

“Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data”

November 10, 2012 in Graphical Models by hundalhh | Permalink

In the seminal paper “Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data“, Lafferty, McCallum, Pereira (2001) introduce a very popular type of Markov random field for segmentation. Conditional Random Fields (CRFs) are used in many fields including machine translation, parsing, genetics, and transmission codes. They are a non-directed version of Hidden Markov Networks. The paper describes conditional random fields, provides an iterative method to estimate the parameters of the CRF, and reports experimental comparisons between CRFs, hidden Markov models, and maximum entropy Markov models.

Magic is Turing Complete

November 8, 2012 in Complexity, Games by hundalhh | Permalink

I don’t play Magic, but if I did, this would be a cool thing to read.

Clifford Algebras, Neural Nets, and the Brain

November 6, 2012 in Neural Nets by hundalhh | Permalink

In “Back Propagation in a Clifford Algebra“, Pearson and Bisset (1992) discuss the interesting problem of replacing real numbers in a neural net by elements from a Clifford Algebra. They replace the sigmoid activation function with

$$f(x) = x / ( c + |x|/r)$$

where $c$ and $r$ are real positive constants and $|x|$ is the norm of the element in the Clifford algebra. Deriving the back propagation algorithm is straight forward otherwise.

In a later article “Neural Networks in the Clifford Domain“, the same authors explain how complex numbers, quaternions, or Clifford algebras can convey electrical phase information between neurons which might be necessary for more accurate representation of how the brain actually works. Also, it is possible that signal processing and image processing applications may benefit. They write,

“It is conjectured that complex valued feed-forward networks will be able to achieve better representations of problems that map into the complex domain naturally (such as phase and frequency information) than if the components of the signal were split up and presented to a real valued feed forward network.”

Some links from Nuit Blanche

November 4, 2012 in Uncategorized by hundalhh | Permalink

I was reading “Around the Blogs in 80 Summer Hours” at Nuit Blanche and these two links caught my eye:

Topological Data Analysis

Implementation: BiLinear Modelling Via Augmented Lagrange Multipliers (BLAM)

“Algorithms for Inﬁnitely Many-Armed Bandits”

November 2, 2012 in Multi-Armed Bandit Problem by hundalhh | Permalink

In “Algorithms for Infinitely Many-Armed Bandits”, Wang, Audibert, and Munos (2008) describe some algorithms for the multi-armed bandit problem when a large number or infinitely many arms are available. Their algorithms are designed for the situation where all rewards are contained in $[0,1]$ and “the probability that a new arm is $\epsilon$-optimal is of order $\epsilon^\beta$”. More precisely, there exist real numbers $c, \mu^*,$ and $\beta$ such that the expected value of an unexplored arm $\mu$ obeys

$$P(\mu^* – \mu < \epsilon) < c \epsilon^\beta.$$

They prove that the total regret is at most of order $n^{\beta/(\beta+1)}\log^2(n)$ if $\beta > 1$ and $\log^2(n)\sqrt{n}$ otherwise. Additionally, they prove a lower bound of order $n^{\beta / (\beta + 1)}$ for any algorithm. Their algorithm applies UCB to the first $n^{\beta/(\beta+1)}$ arms. (The case where $\beta = 1$ was explored in “Bandit problems with infinitely many arms” by Berry, Chen, Zame, Heath, and Shepp (1997).)

K medians & K-medoids

October 30, 2012 in Clustering by hundalhh | Permalink

K-medians and K-medoids are a variants of K-means clustering algorithm. The both minimize the sum of the distances from the centroids to the points, but the K-modoids algorithm requires that the center of each cluster be a sample point. Both problems can be solved using EM type methods.

“Active Learning Literature Survey”

October 28, 2012 in Reinforcement Learning by hundalhh | Permalink

In “Active Learning Literature Survey“, Burr Settles (2010) reviews uncertainty sampling (Lewis and Gale, 1994), margin sampling (Scheffer et al., 2001), entropy sampling, optimal experimental design, query-by-committee (Seung et al., 1992), query-by-boosting, query-by-bagging, expected model change, expected error reduction, expected information gain, variance reduction, and density weighted methods. He then comments on theoretical and empirical performance of these methods, practical considerations, and related areas of machine learning including: semi-supervised learning, reinforcement learning, and compression.

« Older entries § Newer entries »