I try and then do git clone https://github.com/Qsingle/verl.git && cd verl, but verl already exists (when I cd, I get all of the folders in my server), so I create a env folder, cd into that, and then run git clone https://github.com/Qsingle/verl.git && cd verl. pip3 install --no-deps -e .
This image is huge. Even the compressed version is almost 20GB, so I didn’t try it, but please, show us how you confirmed that the file is where you expect it to be.
BAsed on your described steps, you mounted the local folder to /workspace/verl in the container. Since the workdir is /workspace (based on the layer history on Docker Hub), when you started a shell in the container, you were in that workspace folder. Then you entered the verl folder and ran git clone which created a new verl folder inside the existing one. If there is any file you need that would be in /workspace/verl/verl.Let’s say it is in your local folder that you mounted. Then it is in /workspace/verl/PAPO_ViRL39K_train not /workers/verl/PAPO_ViRL39K_train.
I don’t know how /workspace/env/verl. is relevant at step 4.
Hello,
The FileNotFound error is occurring because the Docker container’s environment is separate from your host machine’s file system, despite the volume mount. The command python3 -m verl.trainer.main_ppo ... data.train_files="PAPO_ViRL39K_train" is looking for the dataset at a path relative to the container’s working directory (/workers/verl), but the dataset is located at /workspace/env/verl/PAPO_ViRL39K_train within the container. The simple fix is to either move the datasets to the expected location or, more robustly, update the data.train_files and data.val_files arguments in your python command to provide the full, correct path to the datasets within the container’s file system, which would be /workspace/env/verl/PAPO_ViRL39K_train and /workspace/env/verl/PAPO_MMK12_test respectively.
I just saw that maybe that could be the problem as you were typing; however, when I tried and change train and test to: data.train_files=/workspace/env/verl/PAPO_ViRL39K_train data.val_files=/workspace/env/verl/PAPO_MMK12_test
I am still getting FileNotFoundError: Unable to find '/workspace/env/verl/PAPO_ViRL39K_train'
and I know that the file exists at this location because I can cd into that folder just fine: root:/workspace/env/verl/PAPO_ViRL39K_train#