[update 2021/06] The system summaries are uploaded as output.zip.
train_full_rl.py: Main file. Call training.py -> rl.py -> model/rl.py.
training.py: training loop
rl.py: training epoch (rewards, official evaluation)
model/rl.py: core functions, where MMR is injected
ConfManager.py: Parameters in addition to argparse
data_info.py: store all data paths, data follows the format of fast_abs_rl
ScoreAgent.py: MMR
Part of the code is adapted from fast_abs_rl.