Speeding up Viterbi execution

Question

Speeding up Viterbi execution

1.2k Views Asked by dev_nut At 16 August 2025 at 00:11

I have implemented a naive Viterbi algorithm for a HMM based signal that I observe. The execution time of the decoder seems to be too slow for my requirement. I'm now trying to understand how to speed up the execution. When I'm to determine the computational complexity of the algorithm, I see that it's mentioned as having complexity of t * s^2, where t is number of observations and s is the number of states. I have roughly, 3500 states, and 100 observations. Each state has 729 emission probabilities.

I also see that it's mentioned in this paper, that Viterbi decoding is exponential in this paper (2^k, where k is the constraint length). I'm not understanding this explanation that well. But, I believe if Viterbi is exponential with regarding to states, then surely the algorithm would be very slow, even though I parallelize it.

My questions are:

What is the complexity of the Viterbi algorithm/decoding? Are they the same in both instances?
How do I make modifications to the Viterbi algorithm to speed it up?

EDITS: I'm implementing it in C++, hoping to modify it and parallelize it in the future.

Original Q&A

There are 2 best solutions below

Stand with Gaza On 05 April 2019 at 20:40

The complexity of the Viterbi algorithm is O(t|S|^{n+1}), where n is the order of the Markov model (1 in your case), t the length of the observation sequence and |S| the number of hidden states. So in your case you have a O(t) with an enormous constant factor of 3500^2 = 12 250 000. You would be best advised to either try and reduce the number of hidden states in your model or investigate using stochastic algorithms which can run much faster but aren't guaranteed to always return the absolutely best result.

**Peter de Rivaz** · Accepted Answer

To answer the first question:

If you have t observations, s states, and each state has e emission probabilities, then the trellis will have t*s nodes, and to evaluate each node will cost e operations, so the overall complexity of a naive implementation will be O(t*s*e).

Viterbi decoding can be used to decode sequences of bits. If the observation depends on the previous k binary bits, then the number of different sequences of k bits is 2^k. This represents the number s of states you would need to do a stream decoding (each state represents one configuration of previous bits). However, this is unlikely to be relevant to you.

The paper you link to describes an approach which reduces the number of nodes which need to be expanded. This will not improve the worst case complexity, but may well give significant improvements in typical use depending on the nature of your specific problem.

Speeding up Viterbi execution

There are 2 best solutions below

Related Questions in ALGORITHM

Related Questions in PERFORMANCE

Related Questions in MACHINE-LEARNING

Trending Questions

Popular # Hahtags

Popular Questions