Newest 'pytorch' Questions

0 votes

1 answer

97 views

PyTorch not recognizing RTX 5090 (sm_120) on Windows 11 – CUDA error: no kernel image available

I'm trying to use PyTorch with an NVIDIA GeForce RTX 5090 (Blackwell architecture, CUDA Compute Capability sm_120) on Windows 11, and I keep running into compatibility issues. PyTorch detects CUDA, ...

sajjadesmaili

41

asked yesterday

0 votes

0 answers

58 views

How do I visualize the latent representation produced by the Stable Diffusion VAE?

I am trying to visualize the latent representation produced by the VAE inside a Stable Diffusion pipeline from diffusers import StableDiffusionPipeline import torch # A CUDA ordinal is simply the ...

Yilmaz

51k

asked yesterday

0 votes

0 answers

29 views

AWS SageMaker PyTorch Model Deployment - is entry_point needed?

I'm trying to deploy a pre-trained PyTorch model to SageMaker using the Python SDK. I have a model.tar.gz file that is uploaded to S3, with the following structure: code/ code/requirements.txt code/...

RefresherM

1

asked Nov 28 at 15:14

Tooling

0 votes

0 replies

49 views

Good packages for bounded Linear Quantile Regression?

I'm looking for a good package to train a linear quantile regression model, i.e. $\hat y = \sum_{i=1}^n w_i \cdot X_i$. With $x_i$ are the input features, and $w_i$ are the bounded trainable weights. ...

student13

13

asked Nov 28 at 14:50

0 votes

0 answers

29 views

Attribution Error when using Huggingface transformers Trainer with FSDP

I am now trying to use FSDP in Huggingface transformers Trainer. The training script is something like train_dataset = Mydataset(...) args = TrainingArguments(...) model = LlamaForCausalLM....

MR_Xhao

11

asked Nov 28 at 4:11

0 votes

0 answers

51 views

Optimization Challenge in Hugging Face: Effcienntly Serving Muliple, Differently Sized LLMs on a Single Gpu with PyTorch [closed]

I am currently working on a Python based Gen AI project that requires the efficient deployment and serving of multiple LLMs specifically models with different parameter counts ( Llama-2 7B and Mistral ...

Amira Yassin

1

asked Nov 27 at 23:11

2 votes

1 answer

77 views

Having trouble with R's torch and tensor dimensions

I am trying to follow along with this webpage: https://jtr13.github.io/cc21fall2/tutorial-on-r-torch-package.html I am trying to understand R's implementation of PyTorch. I am having some trouble with ...

Huy Pham

173

asked Nov 27 at 9:28

0 votes

0 answers

47 views

How to force NCCL build to embed PTX for all kernels (prevent linker from stripping ncclDevKernel PTX)?

I am compiling NCCL 2.27.5-1 (I tried also 2.28.9-1) from source for a V100 GPU (sm_70). My goal is to have libnccl.so contain compute_70 PTX for every kernel. Despite passing explicit -gencode=arch=...

CiZ

10

asked Nov 26 at 17:05

-1 votes

1 answer

46 views

YOLOv8 custom training loop using v8DetectionLoss fails to converge on custom dataset (7 classes) [closed]

I am trying to implement a custom training loop for object detection using YOLOv8 (Ultralytics) and PyTorch. My goal is to fine-tune a pre-trained yolov8n.pt model on the Aquarium dataset, which ...

Quốc Tiến Trần

7

asked Nov 26 at 13:42

1 vote

0 answers

71 views

PyTorch installed via uv project shows CPU-only version on Windows with CUDA specification in pyproject.toml

I'm trying to set up a Python project using uv and pyproject.toml on Windows. I want to install the CUDA-enabled PyTorch, but after installing, when I check the version, it shows CPU-only. Here’s my ...

wonone11

11

asked Nov 25 at 9:01

Advice

0 votes

0 replies

29 views

When using TensorDictPrioritizedReplayBuffer, should I apply the priority weight manually or not?

With Prioritized Experience Replay (PER), we use Beta parameter, so we can find weight that will be used to offset the bias introduced by PER. Now, with PyTorch's TensorDictPrioritizedReplayBuffer, I ...

Bejo

13

asked Nov 25 at 6:43

1 vote

2 answers

126 views

pytorch Module B=A, A.to('cpu'), but the tensor in B is still in GPU, why?

After converting module A to CPU, the origin parameter tensor still stays on the GPU? When it is released? Is it wrong if I reuse the parameter? My code: import torch.nn as nn class A(nn.Module): ...

jiwei zhang

11

asked Nov 21 at 10:11

2 votes

1 answer

25 views

PyTorch .view() operation to manipulate tensor dimensions vis a vis using torch.unbind followed by torch.cat

In Torch, .view() reshapes the tensor. However, there are multiple ways to reshape a multi-dimensional tensor to a target shape. How does it decide between those different ways? For example, in Torch, ...

Sanchit

21

asked Nov 20 at 21:47

2 votes

1 answer

604 views

PyTorch fails on Windows Server 2019: “Error loading c10.dll” (works fine on Windows 10)

I'm trying to deploy a Python project on Windows Server 2019, but PyTorch fails to import with a DLL loading error. On my local machine (Windows 10, same Python version), everything works perfectly. ...

Rael Clariana

21

asked Nov 20 at 17:59

1 vote

1 answer

61 views

.so file built on same CPU but different EC2 instances lead to missing symbols

I am building a wheel of PyTorch from source, based on their https://github.com/pytorch/pytorch/blob/v2.6.0/.ci/manywheel/build_common.sh CI build script. I tested on a "local" instance of a ...

Corneau

193

asked Nov 18 at 21:40

Collectives™ on Stack Overflow

PyTorch not recognizing RTX 5090 (sm_120) on Windows 11 – CUDA error: no kernel image available

How do I visualize the latent representation produced by the Stable Diffusion VAE?

AWS SageMaker PyTorch Model Deployment - is entry_point needed?

Good packages for bounded Linear Quantile Regression?

Attribution Error when using Huggingface transformers Trainer with FSDP

Optimization Challenge in Hugging Face: Effcienntly Serving Muliple, Differently Sized LLMs on a Single Gpu with PyTorch [closed]

Having trouble with R's torch and tensor dimensions

How to force NCCL build to embed PTX for all kernels (prevent linker from stripping ncclDevKernel PTX)?

YOLOv8 custom training loop using v8DetectionLoss fails to converge on custom dataset (7 classes) [closed]

PyTorch installed via uv project shows CPU-only version on Windows with CUDA specification in pyproject.toml

When using TensorDictPrioritizedReplayBuffer, should I apply the priority weight manually or not?

pytorch Module B=A, A.to('cpu'), but the tensor in B is still in GPU, why?

PyTorch .view() operation to manipulate tensor dimensions vis a vis using torch.unbind followed by torch.cat

PyTorch fails on Windows Server 2019: “Error loading c10.dll” (works fine on Windows 10)

.so file built on same CPU but different EC2 instances lead to missing symbols

Hot Network Questions