CS221 Homework Problem 2.1: word segmentation

Setup

iseethecat

i, see, the, cat

[3 points] Suppose maximizing our utility corresponds to minimizing the number of words in the output segmentation. Construct a deterministic state space model for this task. What search algorithms (out of the following - DAG search, BFS, DFS, UCS, A*, Bellman-Ford) would produce a minimum cost path for your model?
[2 point] If our goal is to maximize the number of words in the segmentation, revise the state space model from above. Which search algorithms work now?
[3 point] Instead of minimizing the number of words in the segmentation, suppose we had at our disposal a function $\text{Fluency}(w_1, w_2)$ which returns a number (either positive or negative) representing the compatibility of $w_1$ and $w_2$ being next to each other (for example, $\text{Fluency}(an, cat)$ would be low and $\text{Fluency}(a, cat)$ would be high). Suppose our utility function is the sum of the fluencies of adjacent words; formally, if the segmentation produces words $w_1, \dots, w_n$, then the utility is $\sum_{i=2}^n \text{Fluency}(w_{i-1}, w_i)$. Modify the state space model from above to find the most fluent segmentation.