Benchmark

non-incremental/QF_NRA/20240407-pPDA-Chiari-Pontiggia-Winkler/certificate_hints/schelling_POMC_hints.smt2

Benchmarks generated by the tool POMC to analyze the termination
probability of a probabilistic Pushdown Automaton (pPDA). The benchmark
contains the system of polynomial equations encoding the probability,
and constrains the solution to be in the interval [l, l + eps], where
l is a lower bound computed by value iteration, and eps is a small
tolerance.
The pPDA has been automatically generated from the following
probabilistic program, adapted from [1]:
u4 p;
main() {
  bool res;
  p = 0 {2 : 6} 1 {1 : 6} 2 {1 : 6} 3 {1 : 6} 4;
  query alice(res);
  // res is which cafè they have gone to
}

alice(bool &x) {
    bool prior_alice, bob_choice;
    // sample according to the prior (0.55)
    prior_alice = 1 {11 : 20} 0;
    query bob(bob_choice);
    observe prior_alice == bob_choice;
    x = prior_alice;
}

bob(bool &y) {
    bool prior_bob, recurse, alice_choice;
    // sample according to the prior (0.55)
    prior_bob = 1 {11 : 20} 0;
    recurse = 1 {p : 10} 0;
    if (recurse) {
        query alice(alice_choice);
        observe prior_bob == alice_choice;
    } else {}
    y = prior_bob;
}
Publications:
Stuhlmueller, Andreas, and Noah D. Goodman. "Reasoning about reasoning
by nested conditioning: Modeling theory of mind with probabilistic
programs." Cognitive Systems Research 28 (2014): 80-99.
Benchmark
Size63414
Compressed Size9836
License Creative Commons Attribution 4.0 International (CC-BY-4.0)
Categoryindustrial
First Occurrence2024-07-22
Generated ByMichele Chiari, Francesco Pontiggia
Generated On2024-02-02 00:00:00
GeneratorPOMC
Dolmen OK1
strict Dolmen OK1
check-sat calls1
Query 1
Status unknown
Inferred Status None
Size 63406
Compressed Size9812
Max. Term Depth7
Asserts 812
Declared Functions0
Declared Constants424
Declared Sorts 0
Defined Functions0
Defined Recursive Functions 0
Defined Sorts0
Constants0
Declared Datatypes0

Symbols

=140 let4 /260 +174
*502 <=52 >=620

Evaluations

Evaluation Rating Solver Variant Result Wallclock CPU Time
SMT-COMP 2024 1.00 (0/5) cvc5 cvc5 unknown ❌ 1201.72154 1200.63716
SMTInterpol SMTInterpol unknown ❌ 0.82244 1.89787
SMT-RAT SMT-RAT unknown ❌ 1201.71749 1201.13087
Yices2 Yices2 unknown ❌ 1201.22655 1201.00462
Z3alpha Z3-alpha unknown ❌ 1201.72011 1201.11978
SMT-COMP 2025 0.83 (1/6) cvc5 cvc5 unknown ❌ 1201.77543 1201.05633
SMTInterpol SMTInterpol unknown ❌ 0.81558 1.64628
SMT-RAT SMT-RAT unknown ❌ 1201.26327 1200.95909
Yices2 Yices2 sat ✅ 0.63529 0.51222
Z3alpha Z3-alpha unknown ❌ 1201.00475 4802.12970
Z3 Z3-alpha-base unknown ❌ 1201.27322 1201.05968
z3siri-base unknown ❌ 1201.29388 1201.05076