stupid backoff language model

611 Views Asked by user1288196 At 29 July 2025 at 06:33

In section 4.1 (Normalized Stupid Backoff) of "One Billion Word Benchmark ..." by Chelba, Mikolov, et.al., it states:

... the Stupid Backoff model does not generate normalized probabilities. For the purpose of computing perplexity ... values output by the model were normalized over the entire LM vocabulary.

Assuming a bigram LM, the obvious way to interpret this is to score all single words (using MLE) and score pairs of words using the standard backoff formula, sum the unigram and bigram scores, yielding Sigma, then divide each (unigram or bigram) score by Sigma. Is this the correct interpretation of the quote?

Original Q&A

stupid backoff language model

There are 0 best solutions below

Related Questions in MODEL

Related Questions in IMPLEMENTATION

Trending Questions

Popular # Hahtags

Popular Questions