Skip to main content
The 2024 Developer Survey results are live! See the results

Questions tagged [nvidia]

For programming questions specifically related to Nvidia hardware. N.B. Questions about system configuration are usually off-topic here!

0 votes
0 answers
9 views

CUDA Out of Memory Error Despite Having Multiple GPUs

I'm encountering a CUDA out-of-memory error while trying to run a PyTorch model, even though my system has multiple NVIDIA GPUs. # Load the tokenizer and model tokenizer = AutoTokenizer....
Flying-Meta's user avatar
0 votes
0 answers
18 views

Integrating Custom Avatars with LLM-Powered Responses in a Web Application

We're developing a web application with the following requirements: Create and display multiple custom avatars in the UI Allow users to speak to any avatar Generate responses using LLMs on the backend ...
Mkp's user avatar
  • 11
0 votes
0 answers
48 views

Inconsistent global memory access between blocks despite use of volatile, threadfence and disabling L1 cache

In the following minimal reproducible example for the construction of a tree, where bodies are inserted based on their position (so a 1D version of a Quad/Octree) when multiple blocks are used, some ...
larrycaverga's user avatar
-4 votes
0 answers
34 views

Problems for installing NVIDIA driver in Ubuntu 22.04.2 LTS [closed]

NVIDIA driver installing failed. I tried to install NVIDIA driver by the command below. sudo apt install nvidia-driver-545-open but I got the following log: Building for 6.5.0-44-generic 6.5.1-...
Mason Wong's user avatar
-2 votes
1 answer
24 views

TensorFlow using wrong GPU in system [closed]

I'm training models using TensorFlow using Cuda toolkit in which I'm trying to utilise my GPU for the training. I have one Dedicated GPU, Nvidia GeForce GTX 1650 Ti(4GB), and an Integrated GPU, AMD ...
Niranjan Official's user avatar
-4 votes
1 answer
47 views

continuously getting the error: 'nvidia-smi' is not recognized as an internal or external command, operable program or batch file [closed]

disclaimer: I am not super experienced with python I have been trying to set up SAM (segment anything model by meta), but have been running into issues with installing pytorch. I have followed ...
F.O.'s user avatar
  • 1
-1 votes
0 answers
10 views

Debian : how to fix my graphics drivers installation?

I just installed debian with kde on a machine with a nvidia graphic card and an amd cpu who have integrated graphics. When i install the package nvidia-drivers and reboot, when i login, the screen get ...
nico's user avatar
  • 9
0 votes
0 answers
15 views

Nvidia GPU not found when running ACE microservices Docker containers under WSL

I am currently exploring the 1st ACE Workflow with Docker Containers, following the documentation provided by Nvidia. I am using WSL2 (with Ubuntu) on Windows 11 with NVIDIA GeForce RTX 3070. I went ...
hoosengold's user avatar
-1 votes
0 answers
26 views

How to Estimate GPU Memory for training and inference, Data Requirements, and Training Time for Large Language Models?

This is a very concrete and well-defined computer engineering question. I don't understand why someone would want to close it. Today, I faced this question during an interview for an ML Engineer ...
maplemaple's user avatar
  • 1,435
-1 votes
0 answers
40 views

Contradict specs on tensor cores on my GPU [duplicate]

My GPU is Quadro T1000 Mobile (SM_75). I've received the contrary device specs on tensor cores. The GPU has 14 SMs and the chapter compute capacity 7.x lists 8 tensor cores per SM straightly. If so, ...
sof's user avatar
  • 9,519
0 votes
0 answers
44 views

Rapids Cudf library not fast enough

I have a server with 1xNvidia T4 and a 32 Cores CPU. Trying to migrate from pandas to cudf. I am performing multiple row wise transformations on my data but I could find the CPU computing faster than ...
R_M's user avatar
  • 1
1 vote
1 answer
69 views

Weird behaviour of CUDA recursion

In the following minimal reproducible example, when the recursion in device_func is active, the __synchthreads() barrier is ignored, and when debugged, breakpoint 2 occurs before breakpoint 1. If the ...
larrycaverga's user avatar
0 votes
0 answers
46 views

I want to use 11.7 version of cuda (but my driver wants 12.2) [closed]

I am a beginner in artificial intelligence. In order to test specific artificial intelligence, version 11.7 of CUDA is required. The recommended CUDA version of the driver is 12.2, but I want to use ...
황수현's user avatar
-2 votes
0 answers
61 views

CUDA Kernel Taking: Large Amount of Process Memory

I have total 811 images with each RGB image dimension of 1440x1080. Therefore total 3.52 GB. I'm using CUDA v12.2, NVIDIA RTX 3090 GPU, and Visual Studio 2019. I'm facing two issues After readImage() ...
Md. Rokonuzzaman Reza's user avatar
0 votes
1 answer
53 views

Questions about mma instruction with Nvidia ptx

Hi my understanding about mma instruction with ptx is (please tell me if I'm wrong): it is a per warp instruction it need to load specific element into register of each thread within the target warp ...
Shore's user avatar
  • 959

15 30 50 per page
1
2 3 4 5
250