Spaces:

iteratehack
/

MentorFlow

Paused

MentorFlow / teacher_agent_dev /RUN_LM_COMPARISON.md

Cornelius

Deploy MentorFlow with GPU support

a52f96d 13 days ago

1.25 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

Running Comparison with LM Student

Updated compare_strategies.py to use LM Student (DistilBERT) instead of MockStudentAgent for all three strategies:

cd teacher_agent_dev
python compare_strategies.py --iterations 500 --deterministic

LM Student is slower - Each iteration involves DistilBERT inference/fine-tuning
Uses DistilBERT for multiple choice questions
Online learning (fine-tunes on 1 task at a time)
Memory decay using Ebbinghaus forgetting curve
Per-topic skill tracking

With LM Student:

Total: ~15-30 minutes for full comparison

If LM Student cannot be imported (e.g., transformers library missing), it will automatically fall back to MockStudentAgent.