A generalized distillation process combining on-policy distillation with RL-based imitation learning for upward knowledge transfer from smaller to larger models
No results
Select a result to preview
A generalized distillation process combining on-policy distillation with RL-based imitation learning for upward knowledge transfer from smaller to larger models
No results