Kinnari

标签: self-distillation

此标签下有1条笔记。

  • 2026年3月04日

    Reinforcement Learning via Self-Distillation

    • AI-generated
    • In-Context-Learning
    • LLM
    • reasoning
    • self-distillation

Created with Quartz v4.5.2 © 2026

  • GitHub
  • ZhiHu