I’m new to programming, and I’m unfamiliar with the Docker platform. I wanted to use LLAMU 3.1 8B, but it runs too slowly on my laptop, whereas Mistral 7B runs very quickly. It seems that LLAMU 3.1 8B is not utilizing my GPU, as it runs very slowly. How can I configure Docker to recognize and use my GPU (GeForce RTX 4050 Laptop GPU)?
Thank you for your assistance. As I’m new to programming, I’ll do my best to follow the forum’s structure and culture. I appreciate your patience as I learn, and I apologize for any inadvertent mistakes I might make along the way.
Thank you for your concern about security. I’ve reviewed the information and ensured there are no sensitive details, such as IP addresses, included. I am not sure whether they are exposed or not. However, I understand that it’s always better to err on the side of caution regarding data security. With that in mind, I’m comfortable sharing the following information with you
Below are the details of the Docker Desktop installation on my Windows 11 laptop:
I’m feeling a bit stuck and unsure about what to do next. I’ve tried everything step by step, but it seems like the problem might be related to the limits of my RTX 4050 graphics card, which has 6GB of VRAM. From what I understand, models like LLAMA 3.12 8B or Gemma2:9B require a minimum of 8GB of VRAM. This might be the issue, even though they were running smoothly and quickly yesterday!