Abstract :
Discriminative learning methods have achieved many successes in speech and language processing during the past decades. Discriminative learning of generative models is a typical optimization problem, where efficient optimization methods play a critical role. For many widely used statistical models, discriminative learning normally leads to nonconvex optimization problems. In this article we used three representative examples to showcase how to use a proper convex relaxation method to convert discriminative learning of HMMs and MMMs into standard convex optimization problem so that it can be solved effectively and efficiently even for large-scale statistical models. We believe convex optimization will continue to play important role in discriminative learning of other statistical models in other application domains, such as statistical machine translation, computer vision, biometrics, and informatics.
Keywords :
hidden Markov models; learning (artificial intelligence); optimisation; parameter estimation; HMM; MMM; biometrics; computer vision; convex relaxation method; discriminative learning methods; large-scale statistical models; nonconvex optimization problems; parameter estimation; speech-language processing; statistical machine translation; Application software; Hidden Markov models; Large-scale systems; Learning systems; Machine learning; Natural languages; Optimization methods; Parameter estimation; Relaxation methods; Speech processing;