Questions tagged [nvidia]
For programming questions specifically related to Nvidia hardware. N.B. Questions about system configuration are usually off-topic here!
nvidia
3,736
questions
0
votes
0
answers
9
views
CUDA Out of Memory Error Despite Having Multiple GPUs
I'm encountering a CUDA out-of-memory error while trying to run a PyTorch model, even though my system has multiple NVIDIA GPUs.
# Load the tokenizer and model
tokenizer = AutoTokenizer....
0
votes
0
answers
18
views
Integrating Custom Avatars with LLM-Powered Responses in a Web Application
We're developing a web application with the following requirements:
Create and display multiple custom avatars in the UI
Allow users to speak to any avatar
Generate responses using LLMs on the backend
...
0
votes
0
answers
48
views
Inconsistent global memory access between blocks despite use of volatile, threadfence and disabling L1 cache
In the following minimal reproducible example for the construction of a tree, where bodies are inserted based on their position (so a 1D version of a Quad/Octree) when multiple blocks are used, some ...
-4
votes
0
answers
34
views
Problems for installing NVIDIA driver in Ubuntu 22.04.2 LTS [closed]
NVIDIA driver installing failed. I tried to install NVIDIA driver by the command below.
sudo apt install nvidia-driver-545-open
but I got the following log:
Building for 6.5.0-44-generic 6.5.1-...
-2
votes
1
answer
24
views
TensorFlow using wrong GPU in system [closed]
I'm training models using TensorFlow using Cuda toolkit in which I'm trying to utilise my GPU for the training. I have one Dedicated GPU, Nvidia GeForce GTX 1650 Ti(4GB), and an Integrated GPU, AMD ...
-4
votes
1
answer
47
views
continuously getting the error: 'nvidia-smi' is not recognized as an internal or external command, operable program or batch file [closed]
disclaimer: I am not super experienced with python
I have been trying to set up SAM (segment anything model by meta), but have been running into issues with installing pytorch. I have followed ...
-1
votes
0
answers
10
views
Debian : how to fix my graphics drivers installation?
I just installed debian with kde on a machine with a nvidia graphic card and an amd cpu who have integrated graphics.
When i install the package nvidia-drivers and reboot, when i login, the screen get ...
0
votes
0
answers
15
views
Nvidia GPU not found when running ACE microservices Docker containers under WSL
I am currently exploring the 1st ACE Workflow with Docker Containers, following the documentation provided by Nvidia. I am using WSL2 (with Ubuntu) on Windows 11 with NVIDIA GeForce RTX 3070. I went ...
-1
votes
0
answers
26
views
How to Estimate GPU Memory for training and inference, Data Requirements, and Training Time for Large Language Models?
This is a very concrete and well-defined computer engineering question. I don't understand why someone would want to close it.
Today, I faced this question during an interview for an ML Engineer ...
-1
votes
0
answers
40
views
Contradict specs on tensor cores on my GPU [duplicate]
My GPU is Quadro T1000 Mobile (SM_75). I've received the contrary device specs on tensor cores.
The GPU has 14 SMs and the chapter compute capacity 7.x lists 8 tensor cores per SM straightly. If so, ...
0
votes
0
answers
44
views
Rapids Cudf library not fast enough
I have a server with 1xNvidia T4 and a 32 Cores CPU. Trying to migrate from pandas to cudf.
I am performing multiple row wise transformations on my data but I could find the CPU computing faster than ...
1
vote
1
answer
69
views
Weird behaviour of CUDA recursion
In the following minimal reproducible example, when the recursion in device_func is active, the __synchthreads() barrier is ignored, and when debugged, breakpoint 2 occurs before breakpoint 1. If the ...
0
votes
0
answers
46
views
I want to use 11.7 version of cuda (but my driver wants 12.2) [closed]
I am a beginner in artificial intelligence. In order to test specific artificial intelligence, version 11.7 of CUDA is required.
The recommended CUDA version of the driver is 12.2, but I want to use ...
-2
votes
0
answers
61
views
CUDA Kernel Taking: Large Amount of Process Memory
I have total 811 images with each RGB image dimension of 1440x1080. Therefore total 3.52 GB. I'm using CUDA v12.2, NVIDIA RTX 3090 GPU, and Visual Studio 2019. I'm facing two issues
After readImage() ...
0
votes
1
answer
53
views
Questions about mma instruction with Nvidia ptx
Hi my understanding about mma instruction with ptx is (please tell me if I'm wrong):
it is a per warp instruction
it need to load specific element into register of each thread within the target warp
...