Why is this stratified bayesian logit so slow in Turing.jl?

77 Views Asked by Emiliano Isaza Villamizar At 03 September 2023 at 15:28

I'm trying to create a Bayesian logistic regression that gives me insights into the number of payments made by a person and the probability of default. I created synthetic data to see if I could fit the model with real data:

using Turing

is_bad_pay(x) = x > 70 ? 1 : 0

function simulate_payment_frequency(Pₙ, N)
    P = rand(DiscreteUniform(1, Pₙ,), N)
    avg_delay = rand(LogNormal(2,2),N)
    payers = [is_bad_pay(x) for x in avg_delay]
    payers, P, avg_delay
end

I have three variables:

A binary variable that is 1 if the person and 0 otherwise (payers)
the number of payments (P)
(avg_delay which is the average days between payments.

the model is the following:

# fit simulated data
@model function freq_pay(prob_pay, number_payments, avg_delay)
## Heading ##
   Num_payments = length(unique(number_payments))
# hierarchical by quantile of number of payments
    αₛ ~ filldist(Normal(60, 10), Num_payments)
    βₛ ~ filldist(Normal(0, 1), Num_payments)
    v = @. logistic(αₛ[number_payments] + βₛ[number_payments]*(avg_delay))  
# logistic regression 
    for i ∈ eachindex(v)
        prob_pay[i] ~ Bernoulli(v[i])
    end
end

I first tried simulating people with only two payments and it works well:

synthetic_payers = simulate_payment_frequency(2, 100)
s1_1 = sample(freq_pay(synthetic_payers[1], synthetic_payers[2], synthetic_payers[3]), NUTS(), 100)

However, when I try more than 3 payments it never stops.

synthetic_payers = simulate_payment_frequency(4, 100)
s1_2 = sample(freq_pay(synthetic_payers[1], synthetic_payers[2], synthetic_payers[3]), NUTS(), 100)

What I'm I doing wrong?

Original Q&A

There are 1 best solutions below

Emiliano Isaza Villamizar On 07 September 2023 at 23:33

Ok so ~ Bernoulli is unstable numerically, therefore convergence isn't assured; Turing created a distribution only for logits. This is the answer:

            for i ∈ eachindex(v)
               prob_pay[i] ~ BernoulliLogit(v[i])
            end

Just use stay calm and use BernoulliLogit. (Turing.jl documentation is wrong by the way it uses Bernoulli for the logit example)

Why is this stratified bayesian logit so slow in Turing.jl?

There are 1 best solutions below

Related Questions in JULIA

Related Questions in BAYESIAN

Related Questions in PROBABILISTIC-PROGRAMMING

Trending Questions

Popular # Hahtags

Popular Questions