Benjamin-eecs - Overview (original) (raw)

Skip to content

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

Appearance settings

Hi there 👋

Pinned Loading

  1. SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
    Python 196 22
  2. DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
    5k 543
  3. DeepSeek-VL: Towards Real-World Vision-Language Understanding
    Python 4.1k 593
  4. TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
    Python 631 44
  5. C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
    C++ 1.5k 141
  6. Natural Language Reinforcement Learning
    Python 101 7