Satinder Singh | Mdu Rohtak (original) (raw)
Uploads
Papers by Satinder Singh
Artificial Intelligence, 1998
Computational Intelligence, 2005
The TAC 2003 supply-chain game presented automated trading agents with a challenging strategic pr... more The TAC 2003 supply-chain game presented automated trading agents with a challenging strategic problem. Embedded within a high-dimensional stochastic environment was a pivotal strategic decision about initial procurement of components. Early evidence suggested that the entrant field was headed toward a self-destructive, mutually unprofitable equilibrium. Our agent, Deep Maize, introduced a preemptive strategy designed to neutralize aggressive procurement, perturbing the field to a more profitable equilibrium; it worked. Not only did preemption improve Deep Maize's profitability, it improved profitability for the whole field. Whereas it is perhaps counterintuitive that action designed to prevent others from achieving their goals actually helps them, strategic analysis employing an empirical game-theoretic methodology verifies and provides insight about this outcome.
Journal of Artificial Intelligence Research, 2002
Journal of Artificial Intelligence Research, 2002
1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- kno... more 1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- known stochastic dynamic environments. For Markov environments a variety of dierent reinforcement learning algorithmshave been devised to predict and control the ...
1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- kno... more 1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- known stochastic dynamic environments. For Markov environments a variety of dierent reinforcement learning algorithmshave been devised to predict and control the ...
Abstract. We present new algorithms for reinforcement learning and prove that they have polynomia... more Abstract. We present new algorithms for reinforcement learning and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decision processes. After observing that the number of actions required to approach the optimal ...
Abstract. We present new algorithms for reinforcement learning and prove that they have polynomia... more Abstract. We present new algorithms for reinforcement learning and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decision processes. After observing that the number of actions required to approach the optimal ...
Artificial Intelligence, 1998
Computational Intelligence, 2005
The TAC 2003 supply-chain game presented automated trading agents with a challenging strategic pr... more The TAC 2003 supply-chain game presented automated trading agents with a challenging strategic problem. Embedded within a high-dimensional stochastic environment was a pivotal strategic decision about initial procurement of components. Early evidence suggested that the entrant field was headed toward a self-destructive, mutually unprofitable equilibrium. Our agent, Deep Maize, introduced a preemptive strategy designed to neutralize aggressive procurement, perturbing the field to a more profitable equilibrium; it worked. Not only did preemption improve Deep Maize's profitability, it improved profitability for the whole field. Whereas it is perhaps counterintuitive that action designed to prevent others from achieving their goals actually helps them, strategic analysis employing an empirical game-theoretic methodology verifies and provides insight about this outcome.
Journal of Artificial Intelligence Research, 2002
Journal of Artificial Intelligence Research, 2002
1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- kno... more 1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- known stochastic dynamic environments. For Markov environments a variety of dierent reinforcement learning algorithmshave been devised to predict and control the ...
1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- kno... more 1 INTRODUCTION Reinforcement learning provides a sound framework for credit assignment in un- known stochastic dynamic environments. For Markov environments a variety of dierent reinforcement learning algorithmshave been devised to predict and control the ...
Abstract. We present new algorithms for reinforcement learning and prove that they have polynomia... more Abstract. We present new algorithms for reinforcement learning and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decision processes. After observing that the number of actions required to approach the optimal ...
Abstract. We present new algorithms for reinforcement learning and prove that they have polynomia... more Abstract. We present new algorithms for reinforcement learning and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decision processes. After observing that the number of actions required to approach the optimal ...