[Decider] Backward reasoning

cosmo · March 27, 2022, 1:52pm

Backward reasoning: principle

This decider is not related to a specific family of the zoology.

We take the same idea of “Backward reasoning” as in Heiner Marxen - Attacking the Busy Beaver 5.

We can afford to run the decider on depth 300 (instead of 5 in Heiner Marxen - Attacking the Busy Beaver 5) thanks to having more computing power and the relatively small subset of machines on which it was run.

Click on the image below for more details about the technique:

More at https://github.com/bbchallenge/bbchallenge-proofs/blob/build-latex-pdf/deciders/correctness-deciders.pdf.

Decider examples and counterexamples

Examples: #4,843,748; #58,360,621; #2,009,846. More in the tests.
Counterexamples: See tests.

Decider code

https://github.com/bbchallenge/bbchallenge-deciders/tree/main/decider-backward-reasoning

Decider tests

https://github.com/bbchallenge/bbchallenge-deciders/blob/main/decider-backward-reasoning/main_test.go

Results

09/10/22: the bug found by @TonyG was solved results of the two implementations agree (@TonyG 's and @cosmo’s) hence the decider has been re-applied officially: it decided 2,035,598 machines. To this date, there are 1,538,624 machines left to decide!
02/10/22: new bug found by @TonyG, the results of backward reasoning have been unapplied and now the decider must be debugged is required to pass the new [Debate & Vote] Deciders’ validation process in order to be applied. We are back to 3,574,222 machines to decide.
27/06/22 ~~04/06/22~~: a bug fix was proposed for the bug outlined by @sligocki, https://github.com/bbchallenge/bbchallenge-deciders/pull/7. A bug fix was proposed for the bug outlined by @lijil. https://github.com/bbchallenge/bbchallenge-deciders/pull/8.
The decider was run up to depth 50 and decided 2,035,610 machines (among the remaining ones after [Decider] Cyclers and [Decider] Translated cyclers) of which IDs in the seed DB are available at http://docs.bbchallenge.org/bb5_decided_indexes/backward-reasoning-run-78af853d8968-depth-50-minIndex-0-maxIndex-88664064.
13/05/22: the bug found by @modderme123 and @atticuscull has been corrected
. The decider was applied on the index of undecided machines and it decided 2,243,340 machines available at http://docs.bbchallenge.org/bb5_decided_indexes/backward-reasoning-run-370b72380a8d-depth-300-minIndex-0-maxIndex-88664064. An implementation bug was outlined by @sligocki, see Skelet #10 Backtracking?.
09/02/22: 1,253,418 / 3,577,204 machines were decided thanks to backward reasoning (only applied on undecided machines after cyclers and translated cyclers deciders were applied). Indices of decided machines available here: TODO See Backtracking Bug · Issue #1 · bbchallenge/bbchallenge-deciders · GitHub

Decider correctness

Proof available at https://github.com/bbchallenge/bbchallenge-proofs/blob/build-latex-pdf/deciders/correctness-deciders.pdf.

cosmo · April 1, 2022, 8:51am

@atticuscull, @pg132, @DunDunDunDone, and @modderme123 have found a bug in the current implementation of this decider: Backtracking Bug · Issue #1 · bbchallenge/bbchallenge-deciders · GitHub

Hence it is back to work in progress! The machines have been removed of the undecided index.

atticuscull · April 2, 2022, 8:46am

I wrote some python code that implements backward reasoning. It uses a breadth first approach instead of the depth first approach that the current code uses. I’m not sure if there is an advantage either way - perhaps the depth first approach is better optimized for space by limiting the tree depth as opposed to the number of steps you run it for, which is what the current code does.

Anyway, here is my code:

# Single instance of a state transition, i.e. 1LB
class Transition:
    def __init__(self, write=None, direction=None, state=None):
        if direction is not None and direction != "?":
            self.direction = direction
            self.write = write
            self.state = state
        else:
            self.direction, self.write, self.state = (None,)*3


# Just a wrapper class for a state table essentially
class TuringMachine:
    def __init__(self, states, state_table):
        # list of states
        self.states = states
        # A dictionary containing transitions
        self.state_table = state_table
        # What state + bit combinations directly precede that state
        self.predecessors = {s: set() for s in states}
        # None is the halting state
        self.predecessors[None] = set()
        for s in states:
            zero, one = state_table[s]
            self.predecessors[zero.state].add("0" + s)
            self.predecessors[one.state].add("1" + s)

    def get_predecessors(self, state):
        return self.predecessors[state]


# Storing a tape along with the head and current state
class Configuration:
    def __init__(self, tape, state, head):
        self.state = state
        self.tape = tape
        self.head = head

    def __str__(self):
        tape = list(" " + "  ".join(list(self.tape)) + " ")
        tape[3 * self.head] = "["
        tape[3 * self.head + 2] = "]"
        return self.state + ":  " + "".join(tape)


def breadth_first_backtrack(turing_machine, step_limit):
    # Which configuration should we backtrack from first
    process_order = Queue()
    # Which configurations have we seen already
    discovered_configs = set()

    # Add all the halting configurations to the process,
    # seeding the search process
    for halt_pred in turing_machine.get_predecessors(None):
        config = Configuration(halt_pred[0], halt_pred[1:], 0)
        discovered_configs.add(config)
        process_order.enqueue(config)

    steps = 0
    # While there are still branches that aren't contradicitons,
    # and while we haven't ran the search for the search limit,
    # process the next leaf
    while (not process_order.is_empty()) and steps < step_limit:
        steps += 1
        config = process_order.dequeue()
        possible_preds = turing_machine.get_predecessors(config.state)
        for pred in possible_preds:
            transition = turing_machine.state_table[pred[1:]][int(pred[0])]
            write = transition.write
            direction = transition.direction
            # If we're at the left of the tape, add to the left
            # R corresponds to left because we are going backwards
            if config.head == 0 and direction == "R":
                new_config = Configuration(pred[0] + config.tape, pred[1:], 0)
                discovered_configs.add(new_config)
            # If we're at the right of the tape, add to the right
            elif config.head == len(config.tape) - 1 and direction == "L":
                new_config = Configuration(config.tape + pred[0], pred[1:],
                                           config.head + 1)
            # Checking that config.tape[config.head - 1] == write makes sure
            # that after transitioning from that state, the bit we write
            # agrees with the tape what we currently have
            elif direction == "R" and config.tape[config.head - 1] == write:
                new_tape = list(config.tape)
                new_tape[config.head - 1] = pred[0]
                new_tape = "".join(new_tape)
                new_config = Configuration(new_tape, pred[1:], config.head - 1)
            elif direction == "L" and config.tape[config.head + 1] == write:
                new_tape = list(config.tape)
                new_tape[config.head + 1] = pred[0]
                new_tape = "".join(new_tape)
                new_config = Configuration(new_tape, pred[1:], config.head + 1)
            else:
                # If the the transition does not agree with the proposed path
                # towards halting, move on to the next possible predecessor
                continue
            # If we have not seen this configuration before, add it to the
            # set of open branches to search
            if new_config not in discovered_configs:
                discovered_configs.add(new_config)
                process_order.enqueue(new_config)

    # If there is nothing more to process then we know the while loop broke
    # because all of the branches lead to contradictions.
    if process_order.is_empty():
        return "contradiction"
    # Otherwise, the while loop broke because of the step limit
    return "no contradiction found in " + str(step_limit) + " steps"


STATES = ["A", "B", "C", "D", "E"]


def check_from_data(data_string, step_limit=1000):
    # Convert the string into a state table
    ST = {STATES[i//6]: [Transition(*data_string[i + j: i + j + 3])
          for j in [0, 3]] for i in range(0, 30, 6)}

    TM = TuringMachine(STATES, ST)

    print(breadth_first_backtrack(TM, step_limit))


check_from_data("1RB0LD1LC0RE???1LD1LA1LD1RA0RA")   # 55897188
check_from_data("1RB???1LC???0LD0LC1RD0RE1LB0RE")   # 27879939
check_from_data("1RB???1LC0RB1LD0LC0LE1RE0RA1RB")   # 2713328
check_from_data("1RB1LC1RC1RB1RD0LE1LA1LD???0LA")   # BB5 Champion

cosmo · April 2, 2022, 10:12am

Thank you very much and welcome to the forum

I will dive in your code as soon as I get a chance. If you have time would you mind writing, in plain English, a small argument of why your code is correct?

On a side note, I like your compact strings such as 1RB0LD1LC0RE???1LD1LA1LD1RA0RA for machine representations. It’s probably better and more straightforward than the current base-64 representation.

The only annoyance is that ? is not URL safe… We could consider getting rid of base-64 representation for something like the strings you use. I opened a discussion on that point on github: Get rid of base-64 representations for something simpler? · Issue #6 · bbchallenge/bbchallenge · GitHub

modderme123 · April 3, 2022, 7:09pm

I believe this algorithm has a flaw…

First, it will return invalid results for machines that use the starting state of A with tape head 0. (I realize that any machine that halts within 1000 steps would’ve already been ruled out by the seed, but this might be worth noting.) For example, consider ??????1LC0RB1LD0LC0LE1RE0RA1RB which immediately halts but this function returns a contradiction.

Additionally, and probably more importantly, I believe the algorithm should need to remove items from the discovered_configs set or else two branches that both point towards the same end state may both find a contradiction with the opposite branch (however, I have not confirmed this theory)

modderme123 · April 4, 2022, 8:43pm

Update: @atticuscull explained that my second critique was wrong, we are just backtracking forever while there are still possible previous states (but if there are no possible previous states to all states that could reach the halting state, then the machine runs forever)

cosmo · April 4, 2022, 9:40pm

Thank to both of you for looking at this in more details.

I need to wrap my head around it soon.

atticuscull · April 6, 2022, 4:05am

Here is an argument of why the algorithm works. This has in mind the go implementation, as opposed to the above python, seeing as the go code is more relevant. Not much is changed besides that a stack is used instead of a queue, making it depth first, and some of the variable names have changed. Let me know if this is not what you asked for; I’m happy to try and shorten it, or fill in other details that I breezed over.

The algorithm works backwards from all halting states to see if there is no possible way to reach a halting state. Abstractly, we are exploring a digraph where each node is a local configuration, that is a small section of the tape where no assumptions are made about the rest of the tape (put another way, each node represents all possible extensions of the local section of the tape). This differs from the usual notation wherein the rest of the unseen tape is assumed to be zeroes. The goal of the algorithm is to show that this digraph of predecessor configurations is self-contained finite by exploring it entirely. Technically we also need to show that the digraph does not contain a local configuration of state A and only zeroes on the tape, however I do not imagine that we will run the decider for 47 million steps on every undecided machine, and we already know that the digraph doesn’t contain a local configuration of the starting state if it has size less than 47 million.

Concretely, the algorithm implements a depth first search on the digraph (limited by a search depth). stack contains the current leaves of our search. seenStates stores the configurations we’ve seen already so that we don’t repeat unnecessary computation. That is, once we’ve found all of the predecessors of a configuration, we don’t need to look at it again.

To start the search, we add all of the local configurations consisting of a state s along with a single cell of exposed tape with bit b, where (s b) is an undefined transition. Next, as long as there are possible local configurations that we haven’t checked yet, we take the next one from the stack, and find all of the predecessor configurations. We do that by looking at all state + bit combinations (s’, b’) that transition to the current state s, and for each one checking that the place where that transition moved from (i.e. if (s’ b’) moves right then we look at the cell to the left of the current head), has the same bit as what the transition (s’ b’) writes. If the current head is at the far left (respectively right) and (s’ b’) moves right (respectively left) then the local configuration expands in scope; this is always acceptable as we made no assumptions about the tape outside of the scope of the local configuration.

For example, if we are in state A with tape 000[1]101, then we want any transition rules that transition to A and either
1. move right and write 0, or
2. move left and write 1
Then, for each pair (s’ b’) satisfying this, we add
1. s’: 00[b’]1101 or
2. s’: 0001[b]01
to stack respectively, if it is not in seenStates.

If this process terminates in 47 million steps or fewer, we know that the halting states are in a different connected component in the configuration graph than the starting configuration (A with all 0s), and so it is not possible for the machine to halt.

cosmo · May 13, 2022, 10:58am

Thanks to the work of @modderme123 and @atticuscull, the decider was debugged and successfully applied!

~~It decided 2,243,340 machines~~

~~The index file of undecided machines has been updated: https://github.com/bbchallenge/bbchallenge-undecided-index~~

cosmo · June 4, 2022, 9:54pm

A new bug has been found and hopefully fixed: Debuggin Shawn's bug by tcosmo · Pull Request #7 · bbchallenge/bbchallenge-deciders · GitHub

Don’t hesitate to review the code!

Hence this voids the results claimed in the previous message, the new result is 2,035,610 machines decided!

The index file of undecided machines has been updated: https://github.com/bbchallenge/bbchallenge-undecided-index

TonyG · October 2, 2022, 3:33pm

I have been getting into this project by attempting to reproduce existing results. I have succeeded with Cyclers and Translated Cyclers (in fact I found several hundred new Translated Cyclers at runtimes of up to 3,037,316 steps, but that is for another post). But I have been unable to reproduce your Backward Reasoning results. Specifically, I have been unable to use Backward Reasoning to prove that the following machines don’t halt:

3102389, 4864409, 5367930, 5367934, 5725043, 5852433,
5852438, 6932470, 7998506, 7998514, 8660490, 9430801

So I was able to decide only 2,035,598 machines instead of 2,035,610. All the above machines are flagged in the database as having been decided by Backward Reasoning. But for each of them, I can exhibit an explicit starting configuration that halts after 500 steps. This surely means that they can’t be decided by Backward Reasoning at a depth less than 500?

For instance, if machine 4864409 starts in state A with tape 001100001101, then it runs for 500 steps before halting. You can check this for yourself at https://bbchallenge.org/4864409. I can supply starting configurations for the other eleven machines if you like.

Am I missing something obvious? Or has the database perhaps miscategorised these machines?

cosmo · October 2, 2022, 5:40pm

Waw. Thank you very much, it seems that you have uncovered yet another bug in my implementation of [Decider] Backward reasoning. We are going to un-apply the results ( ), solve this bug and then require that the decider goes through the new verification process which was not in place at the time of this decider.

Thank you very much for this contribution.

Concerning Translated Cyclers, we are aware that there are a few thousands machine left in the DB when you run the decider with higher parameters (for instance see Skelet machines that are Translated Cyclers) but we did not apply the results officially because it concerned so few machines in proportion to the number of undecided machines.

Update: backward reasoning results have been un-applied on the offical github repository of the undecided index, we are back to 3,574,222 machines on https://bbchallenge.org/.

TonyG · October 2, 2022, 8:21pm

I wonder if it might have somehting to do with the detection of already-visited configs. I don’t do this in my code. What do you see if you disable this feature?

cosmo · October 3, 2022, 12:20pm

Yes I believe you are right, in the function that transforms a configuration to a string, I forgot to put a separator between the tape, the state and the head position:

func (c ConfigurationAndDepth) toString() string {
	return c.tapeToString() + string(c.State) + strconv.Itoa(c.Head)
}

This makes an ambiguous encoding instead of 1-to-1… This feature was there only for optimization but it seems to be at the root of the bug (thanks to @Iijil for finding out the details).

EDIT: another, more important reason for the bug is that the function that encodes the tape does not include a marker for the origin:

func (c ConfigurationAndDepth) tapeToString() string {
	tapeString := ""

	// Assuming that Configurations are correctly
	// propagating the invariant that any pos
	// between min and max are well defined
	for pos := c.minTapePos; pos <= c.maxTapePos; pos += 1 {
		tapeString += string('0' + c.Tape[pos])
	}
	return tapeString
}

Hence another source of ambiguity…

EDIT2: The irony is that its been noted that this mechanism of configuration detection is useless in practice because a well formed backward reasoning tree cannot have any loops: there cant be a loop within the same branch by construction and there cant be a loop across different branches otherwise the same configuration would two different successors which is not possible (thanks @Iijil again for outlining this).

So the piece of buggy code was trying to solve a non-problem and should be safely removed in order to fix the bug.

TonyG · October 3, 2022, 6:28pm

My BackwardReasoning source code is now available on GitHub - TonyGuil/bbchallenge: Busy Beaver Challenge code and resultss, along with Deciders and Verifiers for Cyclers and TranslatedCyclers. I have several worries:

Being new to github, I don’t even know if this repository is accessible by the general public. Please let me know if you can’t access it.
githb won’t let me upload files bigger than 100Mb. So interested parties will have to generate various files (e.g. TranslatedCyclers.dvf) themselves.
I use g++ for MinGW, which might already cause problems for many of you. But also I use the boost thread library, and I don’t know whether I am free to upload boost header files and libraries to hithub. What is common practice here?

TonyG · October 3, 2022, 7:01pm

I have this great simplifying idea: that instead of trying to prove correctness of Deciders, the Deciders emit Verification Data for each machine that they decide. For many deciders, this Verification Data can be verified much more easily, making the acceptance process much more straightforward. For instance, it takes 12 hours to find all the new Translated Cyclers up to 4,000,000 steps, but only 0.4 seconds to verify the resulting 568 machines. See the README file GitHub - TonyGuil/bbchallenge: Busy Beaver Challenge code and resultss for more info.

My problem is: where do I post this, so that everybody sees it? This site is not easy to navigate! I did look at your Discord link, but the traffic is so high, anything posted there is likely to be lost in the noise.

cosmo · October 3, 2022, 9:28pm

Thank you very much for sharing your work!

Being new to github, I don’t even know if this repository is accessible by the general public. Please let me know if you can’t access it.

Yes, thank you we can access it well!

githb won’t let me upload files bigger than 100Mb. So interested parties will have to generate various files (e.g. TranslatedCyclers.dvf) themselves.

Yes, we tend to avoid to store very large files on github as it would be inefficient for version control, for instance, we store the official files of decided indices here http://docs.bbchallenge.org/bb5_decided_indexes/.

I use g++ for MinGW, which might already cause problems for many of you. But also I use the boost thread library, and I don’t know whether I am free to upload boost header files and libraries to hithub. What is common practice here?

Thank you for letting us know! Hopefully, we should be able to compile the code by downloading boost independently. We have little experience with cpp deciders because its not the language that has been the most commonly used around here so far (but we are language agnostic so any language is welcome) and I personally don’t know how cpp projects handles their boost dependency on github. I believe it should be ok as it.

I have this great simplifying idea: that instead of trying to prove correctness of Deciders, the Deciders emit Verification Data for each machine that they decide.

Yes! I really like this idea. If we push it to the extreme, a similar idea I had, was that deciders would output parameters for templated formal proofs written in Coq or Lean which are languages to verify mathematical proofs. That way, we would not even have to trust the code of the decider but just use Coq/Lean to verify the outputted proofs.

My problem is: where do I post this, so that everybody sees it? This site is not easy to navigate!

I am not sure I understand your question, what would you like to publish? Any file < 4Mb can be published on the forum (use the upload button), what is the size of your file?

I did look at your Discord link, but the traffic is so high, anything posted there is likely to be lost in the noise.

Yes, discord is better suited for live conversation and we use this forum for keeping records.

TonyG · October 3, 2022, 9:51pm

I appreciate your positive feedback very much. Thank you! Here is my idea in a bit more detail:

Many of the (present and future) Deciders expend a lot of time and energy in finding non-halting machines, but it would be a simple matter for them to output the results of their search in a way that is easily verifiable. This means that we don’t have to prove that the deciders do their job of finding every possible candidate; we just have to verify that their proposed candidates are valid. Then we can remove them from the list of undecided machines.

As an easy example, the Cyclers decider can output the number of steps to the initial match, and the number of steps to the final match. Then it is easy for a Verifier to take this data and verify that the machine in question really does generate the same configuration at those two points.

Only slightly more complicated is the TranslatedCyclers verification data: as well as the above, it must specify the tape head positions at the initial and final points, and the length of the matched tape.

I have implemented both of these Verifiers. You can find the details in the README files of the various sub-directories in my repository. As a vindication of my approach, I ran the TranslatedCyclers Decider up to 4,000,000 steps, and found 566 new Translated Cyclers. This took about twelve hours on my laptop. But verifying these 566 machines takes just 0.4 seconds. And anybody can write a Verifier – it’s easy! So there is no reason not to accept these 566 new Translated Cyclers as having been decided. The relevant dvf (Decider Verification File) is TC_4000000.dvf in the TranslatedCyclers sub-directory.

This idea is not applicable to the BackwardReasoning or HaltingSegment Deciders, at least not as far as I can see. But it will come in useful for the various Bouncers. It will mean that there is no reason to reject a Decider just because it decides less than 10% of the undecided machines. As long as the Verifier can be trusted, we don’t need to know anything at all about the Decider, so we can just accept anything that can be verified.

The reason I expressed uncertainty about where to post all this was that it seems too general to post under [Decider] Backward Reasoning, but I couldn’t find a more suitable discussion branch for it.

cosmo · October 4, 2022, 9:11am

Thank you very much for the details on your idea. I think it is very promising and, in my opinion, especially once we can plug this verification data into formal Lean/Coq proofs to get an absolute certificate of non-halting for the machine.

At the moment, we have put in place a Deciders’ validation process which requires that a decider decides at least 10% of the currently undecided machines which is why, under this “rule”, we would not officially apply the new 566 Translated Cyclers or other Translated Cyclers found in Skelet machines that are Translated Cyclers because its such a tiny fraction of the millions of machines that are left. Note that we have been consistent with this rule since the 5,647 machines of Closed state/transition cluster were not applied because it does not go above the 10% threshold.

The incentive behind this rule is to focus on new deciders, hopefully conceptually simple, that will slash the number of machines by a lot at once.

I believe that we could shift to your idea (i.e. immediately considering that a machine is decided given its verification data) once we get somewhere under 100k machines left.

Long story short: I think your idea is very certainly the way to go once we have reached a “small” set of undecided machines (in my mind, around 100,000).

The reason I expressed uncertainty about where to post all this was that it seems too general to post under [Decider] Backward Reasoning, but I couldn’t find a more suitable discussion branch for it.

I believe that you could post it here if it is about Backward Reasoning or in [Decider] Translated cyclers if it is about translated cyclers

Thank you again!

cosmo · October 9, 2022, 6:32pm

@TonyG I have debugged my implementation of backward reasoning and it decides the same 2,035,598 machines as you

Hence, we satisfy the requirements of [Debate & Vote] Deciders’ validation process and I have re-applied the decider officially, there remain 1,538,624 to decide!

When time permits I will check that my decider can output the exact same dvf as you did for extra safety.

Thank you again for this contribution!