Policy Gradient Methods for Reinforcement Learning with Function Approximation with Yishai Mansaur, Rich Sutton, and Satinder Singh, NIPS, 1999.
Here is another RL paper from my time at ATT:Approximate planning for factored POMDPs using belief state simplification with Satinder Singh, UAI, 1996.