* with unroll, the action space goes from 161 -> 127 * more reliable instrumentation * beam search is so op * beam bugfix