AdamGleave - Overview (original) (raw)
Skip to content
Provide feedback
Saved searches
Use saved searches to filter your results more quickly
Sign up
Adam Gleave AdamGleave
Organizations
Block or report AdamGleave
Pinned Loading
- Forked from openai/baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Python 4.3k 723
- Find best-response to a fixed policy in multi-agent RL
Python 283 47
- Clean PyTorch implementations of imitation and reward learning algorithms
Python 1.5k 269
- Benchmark environments for reward modelling and imitation learning algorithms.
Python 46 6
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
Python 28 2
- The Firmament cluster scheduling platform
C++ 412 77