1. Pretraining roberta through running the script below. Detailed parameter descriptions can be found in the arguments.py. `data_path_prefix` is absolute path specifies output of preprocessing. **You have to modify the *hostfile* according to your cluster.**
```bash
bash run_pretrain.sh
```
*`--hostfile`: servers' host name from /etc/hosts
*`--include`: servers which will be used
*`--nproc_per_node`: number of process(GPU) from each server
*`--data_path_prefix`: absolute location of train data, e.g., /h5/0.h5
*`--eval_data_path_prefix`: absolute location of eval data