• DocumentCode
    2705259
  • Title

    An Evaluation of Lattice Scoring using a Smoothed Estimate of Word Accuracy

  • Author

    Omar, Mohamed K. ; Mangu, Lidia

  • Author_Institution
    IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    4
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    This paper describes a novel approach for estimating the best hypothesis of a given word lattice, the hypothesis lattice, using another word lattice, the reference lattice, and its application to large vocabulary automatic speech recognition. This approach selects the word sequence in the hypothesis lattice which maximizes a smoothed estimate of the word accuracy with respect to the reference lattice. It is shown in the paper that two algorithms similar to the Viterbi and the forward-backward algorithms can be used to estimate the hypothesis which approximately maximizes this objective function. We present in this paper two setups to test the performance of our approach. In the first setup, only one lattice is used as both the reference and the hypothesis lattices. In the second setup, two lattices produced by different systems are used to calculate the best hypothesis. In each setup, we test our approach on two Arabic broadcast news speech recognition tasks. Compared to the baseline results, up to 2.1% relative improvement in the word error rate (WER) is obtained by using our approach.
  • Keywords
    error statistics; natural language processing; speech recognition; Arabic broadcast news speech recognition tasks; Viterbi algorithm; forward-backward algorithms; given word lattice; hypothesis lattice; large vocabulary automatic speech recognition; lattice scoring; objective function; reference lattice; smoothed estimation; word accuracy; word error rate; word sequence; Automatic speech recognition; Broadcasting; Decoding; Error analysis; Lattices; Minimization methods; Speech recognition; Testing; Viterbi algorithm; Vocabulary; ASR decoding; Lattice scoring; confusion network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.367278
  • Filename
    4218309