sagar verma - Academia.edu (original) (raw)

sagar verma

Uploads

Papers by sagar verma

Research paper thumbnail of Boltzmann Machine for Non-Intrusive Load Monitoring

Increasing population indicates that energy demands need to be managed in the residential sector.... more Increasing population indicates that energy demands need to be managed in the residential sector. Prior studies have reflected that the customers tend to reduce a significant amount of energy consumption if they are provided with appliancelevel feedback. This observation has increased the relevance of load monitoring in today’s tech-savvy world. Most of the previously proposed solutions claim to perform load monitoring without intrusion, but they are not completely nonintrusive. These methods require historical appliance-level data for training the model for each of the devices. This data is gathered by putting a sensor on each of the appliances present in the home which causes intrusion in the building. Some recent studies have proposed that if we frame NonIntrusive Load Monitoring (NILM) as a multi-label classification problem, the need for appliance-level data can be avoided. In this paper, we propose Multi-label Restricted Boltzmann Machine(ML-RBM) for NILM and report an experim...

Research paper thumbnail of MAPEL: Multi-Agent Pursuer-Evader Learning using Situation Report

2019 International Joint Conference on Neural Networks (IJCNN), 2019

In this paper, we consider a territory guarding game involving pursuers, evaders and a target in ... more In this paper, we consider a territory guarding game involving pursuers, evaders and a target in an environment that contains obstacles. The goal of the evaders is to capture the target, while that of the pursuers is to capture the evaders before they reach the target. All the agents have limited sensing range and can only detect each other when they are in their observation space. We focus on the challenge of effective cooperation between agents of a team. Finding exact solutions for such multi-agent systems is difficult because of the inherent complexity. We present Multi-Agent Pursuer-Evader Learning (MAPEL), a class of algorithms that use spatio-temporal graph representation to learn structured cooperation. The key concept is that the learning takes place in a decentralized manner and agents use situation report updates to learn about the whole environment from each others' partial observations. We use Recurrent Neural Networks (RNNs) to parameterize the spatio-temporal graph. An agent in MAPEL only updates all the other agents if an opponent or the target is inside its observation space by using situation report. We present two methods for cooperation via situation report update: a) Peer-to-Peer Situation Report (P2PSR) and b) Ring Situation Report (RSR). We present a detailed analysis of how these two cooperation methods perform when the number of agents in the game are increased. We provide empirical results to show how agents cooperate under these two methods. Index Terms-multi-agent learning; deep reinforcement learning; recurrent neural network

Research paper thumbnail of Boltzmann Machine for Non-Intrusive Load Monitoring

Increasing population indicates that energy demands need to be managed in the residential sector.... more Increasing population indicates that energy demands need to be managed in the residential sector. Prior studies have reflected that the customers tend to reduce a significant amount of energy consumption if they are provided with appliancelevel feedback. This observation has increased the relevance of load monitoring in today’s tech-savvy world. Most of the previously proposed solutions claim to perform load monitoring without intrusion, but they are not completely nonintrusive. These methods require historical appliance-level data for training the model for each of the devices. This data is gathered by putting a sensor on each of the appliances present in the home which causes intrusion in the building. Some recent studies have proposed that if we frame NonIntrusive Load Monitoring (NILM) as a multi-label classification problem, the need for appliance-level data can be avoided. In this paper, we propose Multi-label Restricted Boltzmann Machine(ML-RBM) for NILM and report an experim...

Research paper thumbnail of MAPEL: Multi-Agent Pursuer-Evader Learning using Situation Report

2019 International Joint Conference on Neural Networks (IJCNN), 2019

In this paper, we consider a territory guarding game involving pursuers, evaders and a target in ... more In this paper, we consider a territory guarding game involving pursuers, evaders and a target in an environment that contains obstacles. The goal of the evaders is to capture the target, while that of the pursuers is to capture the evaders before they reach the target. All the agents have limited sensing range and can only detect each other when they are in their observation space. We focus on the challenge of effective cooperation between agents of a team. Finding exact solutions for such multi-agent systems is difficult because of the inherent complexity. We present Multi-Agent Pursuer-Evader Learning (MAPEL), a class of algorithms that use spatio-temporal graph representation to learn structured cooperation. The key concept is that the learning takes place in a decentralized manner and agents use situation report updates to learn about the whole environment from each others' partial observations. We use Recurrent Neural Networks (RNNs) to parameterize the spatio-temporal graph. An agent in MAPEL only updates all the other agents if an opponent or the target is inside its observation space by using situation report. We present two methods for cooperation via situation report update: a) Peer-to-Peer Situation Report (P2PSR) and b) Ring Situation Report (RSR). We present a detailed analysis of how these two cooperation methods perform when the number of agents in the game are increased. We provide empirical results to show how agents cooperate under these two methods. Index Terms-multi-agent learning; deep reinforcement learning; recurrent neural network

Log In