Skip to content

v0.5.2

Latest

Choose a tag to compare

@pan-x-c pan-x-c released this 08 Apr 07:25
· 4 commits to main since this release
eedef4b

Overview

  1. Added support for training Qwen3.5 series models.
  2. Upgraded core dependencies: transformers to v5.3.0, verl to v0.7.1, and vLLM to v0.19.0.
  3. Enabled the use of auxiliary models within the experience pipeline.
  4. Added support for stream mode in the rollout API.
  5. Enabled integration with external APIs (e.g., OpenAI) during training and benchmarking.
  6. Added experience data visualization capabilities.
  7. Optimized serialization and deserialization of experience data.

What's Changed

Full Changelog: v0.5.1...v0.5.2