LMCompletionCondition

`dendron.conditions.lm_completion_condition.LMCompletionCondition`

Bases: ConditionNode

A completion condition node uses a causal language model to evaluate the relative likelihood of several different completions of a prompt, returning SUCCESS or FAILURE using a user-provided function that selects a status based on the most likely completion.

This node tends to run quickly and gives useful answers, but if you use this node you should be aware of the perils of "surface form competition", documented in the paper by Holtzman et al. (see https://arxiv.org/abs/2104.08315).

This node is based on the HFLM library, and will download the model that you specify by name. This can take a long time and/or use a lot of storage, depending on the model you name.

There are enough configuration options for this type of node that the options have all been placed in a dataclass config object. See the documentation for that object to learn about the many options available to you.

Parameters:

Name	Type	Description	Default
`name`	`str`	The given name of this node.	required
`cfg`	`CompletionConditionNodeConfig`	The configuration object for this model.	required

Source code in src/dendron/conditions/lm_completion_condition.py

class LMCompletionCondition(ConditionNode):
    """
    A completion condition node uses a causal language model to evaluate
    the relative likelihood of several different completions of a prompt,
    returning `SUCCESS` or `FAILURE` using a user-provided function that
    selects a status based on the most likely completion.

    This node tends to run quickly and gives useful answers, but if you
    use this node you should be aware of the perils of "surface form
    competition", documented in the paper by Holtzman et al. (see 
    https://arxiv.org/abs/2104.08315).

    This node is based on the HFLM library, and will
    download the model that you specify by name. This can take a long 
    time and/or use a lot of storage, depending on the model you name.

    There are enough configuration options for this type of node that
    the options have all been placed in a dataclass config object. See 
    the documentation for that object to learn about the many options
    available to you.

    Args:
        name (`str`):
            The given name of this node.
        cfg (`CompletionConditionNodeConfig`):
            The configuration object for this model.
    """
    def __init__(self, model_cfg : HFLMConfig, node_cfg : LMCompletionConfig) -> None:
        super().__init__(node_cfg.node_name)
        self.input_key = node_cfg.input_key
        self.completions_key = node_cfg.completions_key
        self.success_fn_key = node_cfg.success_fn_key
        self.logprobs_out_key = node_cfg.logprobs_out_key

        self.node_config = node_cfg
        self.model_config = model_cfg

    def set_model(self, new_model) -> None:
        """
        Set a new model to use for generating text.
        """
        self.model = new_model

    def set_tree(self, tree : BehaviorTree) -> None:
        """
        Set the behavior tree for this node, which includes setting up the blackboard
        and registering the model configuration with the tree.

        Args:
            tree (BehaviorTree):
                The behavior tree this node belongs to.
        """
        self.tree = tree
        self.set_blackboard(tree.blackboard)
        tree.add_model(self.model_config)

    def tick(self) -> NodeStatus:
        """
        Execute a tick, consisting of the following steps:

        - Retrieve the input prefix from the blackboard.
        - Retrieve the list of completion options from the blackboard.
        - Retrieve the success predicate from the blackboard.
        - Compute the log probabilities of each completion.
        - Apply the success predicate to the completion with the highest
          log probability.
        - Return the status computed by the success predicate.

        If any of the above fail, the exception text is printed and the node
        returns a status of `FAILURE`. Otherwise the node returns `SUCCESS`.
        """
        try:
            input_prefix = self.blackboard[self.input_key]
            completions = self.blackboard[self.completions_key]
            success_fn = self.blackboard[self.success_fn_key]

            log_probs = self.tree.get_model(self.model_config.model_name).loglikelihood(
                ((input_prefix, s) for s in completions), 
                disable_tqdm=True
            )

            self.blackboard[self.logprobs_out_key] = {completions[i] : log_probs[i] for i in range(len(log_probs))}

            best_completion = completions[argmax(log_probs)]

            return success_fn(best_completion)

        except Exception as ex:
            print(f"Exception in node {self.name}:")
            print(traceback.format_exc())
            return NodeStatus.FAILURE

`set_model(new_model)`

Set a new model to use for generating text.

Source code in src/dendron/conditions/lm_completion_condition.py

def set_model(self, new_model) -> None:
    """
    Set a new model to use for generating text.
    """
    self.model = new_model

`set_tree(tree)`

Set the behavior tree for this node, which includes setting up the blackboard and registering the model configuration with the tree.

Parameters:

Name	Type	Description	Default
`tree`	`BehaviorTree`	The behavior tree this node belongs to.	required

Source code in src/dendron/conditions/lm_completion_condition.py

def set_tree(self, tree : BehaviorTree) -> None:
    """
    Set the behavior tree for this node, which includes setting up the blackboard
    and registering the model configuration with the tree.

    Args:
        tree (BehaviorTree):
            The behavior tree this node belongs to.
    """
    self.tree = tree
    self.set_blackboard(tree.blackboard)
    tree.add_model(self.model_config)

`tick()`

Execute a tick, consisting of the following steps:

Retrieve the input prefix from the blackboard.
Retrieve the list of completion options from the blackboard.
Retrieve the success predicate from the blackboard.
Compute the log probabilities of each completion.
Apply the success predicate to the completion with the highest log probability.
Return the status computed by the success predicate.

If any of the above fail, the exception text is printed and the node returns a status of FAILURE. Otherwise the node returns SUCCESS.

Source code in src/dendron/conditions/lm_completion_condition.py

def tick(self) -> NodeStatus:
    """
    Execute a tick, consisting of the following steps:

    - Retrieve the input prefix from the blackboard.
    - Retrieve the list of completion options from the blackboard.
    - Retrieve the success predicate from the blackboard.
    - Compute the log probabilities of each completion.
    - Apply the success predicate to the completion with the highest
      log probability.
    - Return the status computed by the success predicate.

    If any of the above fail, the exception text is printed and the node
    returns a status of `FAILURE`. Otherwise the node returns `SUCCESS`.
    """
    try:
        input_prefix = self.blackboard[self.input_key]
        completions = self.blackboard[self.completions_key]
        success_fn = self.blackboard[self.success_fn_key]

        log_probs = self.tree.get_model(self.model_config.model_name).loglikelihood(
            ((input_prefix, s) for s in completions), 
            disable_tqdm=True
        )

        self.blackboard[self.logprobs_out_key] = {completions[i] : log_probs[i] for i in range(len(log_probs))}

        best_completion = completions[argmax(log_probs)]

        return success_fn(best_completion)

    except Exception as ex:
        print(f"Exception in node {self.name}:")
        print(traceback.format_exc())
        return NodeStatus.FAILURE