Temp

From Toward AGI

Value Iteration, for estimating

Parameter: a small threshold  determining accuracy of estimation

Initialize:  for all 
Loop for :
    
    Loop for each :
        
        
        
Until 

Output: a deterministic policy, , such that