I try and then do git clone https://github.com/Qsingle/verl.git && cd verl, but verl already exists (when I cd, I get all of the folders in my server), so I create a env folder, cd into that, and then run git clone https://github.com/Qsingle/verl.git && cd verl. pip3 install --no-deps -e .
This image is huge. Even the compressed version is almost 20GB, so I didnāt try it, but please, show us how you confirmed that the file is where you expect it to be.
BAsed on your described steps, you mounted the local folder to /workspace/verl in the container. Since the workdir is /workspace (based on the layer history on Docker Hub), when you started a shell in the container, you were in that workspace folder. Then you entered the verl folder and ran git clone which created a new verl folder inside the existing one. If there is any file you need that would be in /workspace/verl/verl.Letās say it is in your local folder that you mounted. Then it is in /workspace/verl/PAPO_ViRL39K_train not /workers/verl/PAPO_ViRL39K_train.
I donāt know how /workspace/env/verl. is relevant at step 4.
Hello,
The FileNotFound error is occurring because the Docker containerās environment is separate from your host machineās file system, despite the volume mount. The command python3 -m verl.trainer.main_ppo ... data.train_files="PAPO_ViRL39K_train" is looking for the dataset at a path relative to the containerās working directory (/workers/verl), but the dataset is located at /workspace/env/verl/PAPO_ViRL39K_train within the container. The simple fix is to either move the datasets to the expected location or, more robustly, update the data.train_files and data.val_files arguments in your python command to provide the full, correct path to the datasets within the containerās file system, which would be /workspace/env/verl/PAPO_ViRL39K_train and /workspace/env/verl/PAPO_MMK12_test respectively.
I just saw that maybe that could be the problem as you were typing; however, when I tried and change train and test to: data.train_files=/workspace/env/verl/PAPO_ViRL39K_train data.val_files=/workspace/env/verl/PAPO_MMK12_test
I am still getting FileNotFoundError: Unable to find '/workspace/env/verl/PAPO_ViRL39K_train'
and I know that the file exists at this location because I can cd into that folder just fine: root:/workspace/env/verl/PAPO_ViRL39K_train#
The post you reacted to is most likely AI-generated, so if the files are really where they should be, I would spend more time on figuring out why the application says the opposite. I donāt know anything about the app, but FileNotFound could also mean that it expects a file, not a directory. Or the filepath contains something that is just similar to another character or there i a non-printable character in the path or the reference to it. Maybe the user on behalf of which the process is running, has no access to any of the folders in the path. So if the files are where you expect them to be, it seems to be more like an application issue , permission issue or an invalid character issue.