AdamGleave - Overview (original) (raw)

Skip to content

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

View AdamGleave's full-sized avatar

Adam Gleave AdamGleave

Organizations

@HumanCompatibleAI

Block or report AdamGleave

Pinned Loading

  1. Forked from openai/baselines
    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
    Python 4.3k 723
  2. Find best-response to a fixed policy in multi-agent RL
    Python 283 47
  3. Clean PyTorch implementations of imitation and reward learning algorithms
    Python 1.5k 269
  4. Benchmark environments for reward modelling and imitation learning algorithms.
    Python 46 6
  5. (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
    Python 28 2
  6. The Firmament cluster scheduling platform
    C++ 412 77