#cuda - kbin.social

governa, 1 month ago to proxmox

How to Passthrough NVIDIA GPU to #Proxmox VE 8 Containers for #CUDA / #AI Acceleration and Media Transcoding

https://linuxhint.com/passthrough-nvidia-gpu-proxmox-ve-8-cuda-ai-media-transcoding/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jay, 1 month ago to ai

🚩 Entire Team Lived in Bubble for Years - Proclaim Never Heard of OpenCL

https://www.theregister.com/2024/03/26/uxl_foundation_cuda_alternative

#UXLFoundation #UXL #CUDA #OpenCL #Khronos #AI #GPU #Heterogenous #Acceleration #oneAPI

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

KathyReid, 2 months ago to linux

Sure, faster, better #GPUs and humanoid robots are cool, but have you ever installed #CUDA on #Linux properly the first time around?

Yeah, me neither.

Focusing on technologies without making those technologies easier to obtain or easier to develop reinforces digital divides.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

stib, 2 months ago to NixOS

Has anyone got #CUDA to work on #Nixos? I can get my cards recognised by nvidia-smi, but cuda doesn't seem to be installed.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ KathyReid

sos, 2 months ago to programming

Wanted to use CUDA?

Now you're stuck with Nvidia hardware because it's illegal to run it on anything else.

DISGUSTING!

https://www.tomshardware.com/pc-components/gpus/nvidia-bans-using-translation-layers-for-cuda-software-to-run-on-other-chips-new-restriction-apparently-targets-zluda-and-some-chinese-gpu-makers

#programming #pytorch #tensorflow #cuda #ml #ai #machinelearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ wonziu

wagesj45, 2 months ago to weirdgirlmemes

I hate when your run into an issue in your program, you google it, and zero results show up. :pepe_g:

#relatable #llama #huggingface #transformers #cuda #config

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 3 months ago to python

Going Further with CUDA for Python Programmers 🚀

The second part of Jeremy Howard's lecture on #CUDA for #Python programmers is now available 👇🏼

📽️: https://www.youtube.com/watch?v=eUuGdh3nBGo

This lecture focuses on the following topics:
✅ Optimized Matrix Multiplication
✅ Shared Memory Techniques for CUDA
✅ Implementing Shared Memory Optimization
✅ Translating Python to CUDA and Performance Considerations
✅ Numba: Bringing Python and CUDA Together

Notebook: https://github.com/cuda-mode/lectures/blob/main/lecture5/matmul_l5.ipynb

#DataScience #deeplearning

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ daridrea

Methylzero, 3 months ago to hpc

#HPC #CUDA #OpenCL #LAPACK
If you had to do a lot of linear least square solves, with potentially rank-deficient matrices, what would you use on a GPU? On CPUs, LAPACK's DGELSY does work, but most GPU libraries seem to not implement routines for rank-deficient matrices.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

governa, 3 months ago to Amd

#AMD Quietly Funded A Drop-In #CUDA Implementation Built On ROCm: It's Now #OpenSource

https://www.phoronix.com/review/radeon-cuda-zluda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ devSJR, MagicLike

denzilferreira, 3 months ago to Amd

ZLUDA, funded by AMD is bringing CUDA to a Radeon near you. ML/AI rejoice!

#cuda #AMD #AI #ML
https://www.phoronix.com/review/radeon-cuda-zluda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

el0j, 3 months ago to random

Nvidias #CUDA moat under attack?

"AMD Quietly Funded A Drop-In CUDA Implementation Built On ROCm" -- https://www.phoronix.com/review/radeon-cuda-zluda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ bitinn

ramikrispin, 3 months ago to python

(1/2) Getting started with CUDA! 👇🏼

A new crash course for getting started with #CUDA with #Python by Jeremy Howard 🚀. CUDA is NVIDIA's programming model for parallel computing on GPUs. CUDE is being used by tools such as #PyTorch #tensorflow and other #deeplearning and LLMs frameworks to speed up calculations. The course covers the following topics:
✅ Setting up CUDA
✅ CUDA foundation
✅ Working with Kernel
✅ CUDA with PyTorch

Course 📽️: https://www.youtube.com/watch?v=nOxKexn3iBo

#datascience #machinelearning

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 3 months ago

(2/2) Colab notebook : https://colab.research.google.com/drive/180uk6frvMBeT4tywhhYXmz3PJaCIA_uk?usp=sharing
Source code: https://github.com/cuda-mode/lecture2

#datascience #deeplearning #machinelearning #python #cuda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jannem, 4 months ago to GraphicsProgramming

@VileLasagna Has a blog post on the relative speed of different #GPU compute frameworks on the same hardware and driver.

Tl;dr: on an #Nvidia card, with Nvidia drivers, #CUDA is the slowest, by far. Fastest is our old stalwart #OpenCL - almost twice as fast when used only for compute. #Vulcan is good, and the least affected by using the card for your desktop at the same time. Read it - it's good.

#HPC #gpgpu #compute

https://vilelasagna.ddns.net/coding/if-you-want-performance-maybe-you-should-drop-cuda/

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

slashtechno, 4 months ago to poetry

I've been facing many issues with using #Poetry (#pythonpoetry) with my #Python based #objectdetection project. I love Poetry for publishing packages, but think that #conda would be better since I have to deal with #CUDA and whatnot. Anyone familiar with a way to use pyproject.toml for publishing and building packages, even if Poetry isn't being used for dependency management?

For context, here's the project I'm working on: https://github.com/slashtechno/wyzely-detect

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

harish, 5 months ago to Amd

So I bought a fancy #AMD graphics card because I didn‘t want to support the #Nvidia #CUDA hegemony. I also had high hopes for their supposedly more open drivers.

I am not sure if this was a great idea, because while it‘s been super good for my kids and their games, it‘s been a steep uphill climb (both ways) to get #ROCm and #HIP to do anything.

And the core bits are distributed as these precompiled packages that only work on a handful of specific versions of Linux distributions.

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 6 months ago to random

OK so I'm ready for today's #GPGPU lesson with the new laptop. My only gripe for the lesson will be that #Rusticl in #Mesa 23.2 doesn't support #profiling information. Apparently the feature was merged at a later commit
https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24101
and I even tried upgrading to my distro's experimental 23.3-rc1 packages, but trying to use rusticl on those packages segfaults. So either I've messed up something with this mixed upgrade, or I've hit an actual bug.

reply

expand (14)

collapse (14)

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 6 months ago

I'm still moderately annoyed by the fact that there's no single #OpenCL platform to drive all computer devices on this machine. #PoCL comes close because it supports both the CPU and the #NVIDIA dGPU through #CUDA, but the not the #AMD iGPU (there's an #HSA device, but). #Rusticl supports the iGP (radeonsi) and the CPU (llvmpipe), but not the dGPU (partly because I'm running that on proprietary drivers for CUDA). Everything else has at best one supported device out of three available.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Brett_E_Carlock, 7 months ago to random

Do I have anyone in my wider network with skills in programming CUDA, SYCL, and OpenCL?

We want to determine feasibility of migrating CUDA-only code to SYCL (via SYCLomatic?): OpenCV feature detection/extraction modules (SIFT, HAGOG, ORB, AKAZE).

The intent is to upstream all feasible work.

This, hopefully, should stand to benefit everyone instead of being limited to NVIDIA.

Currently in info gathering/people connecting phase, not yet funded & ready to go.

#CUDA #SYCL #OpenCL #OpenCV

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mwfc, giuseppebilotta, oblomov

sri, 8 months ago to random

What an amazing talk by @airlied on the state of vendors, compute and community feedback. Please take the 45 minutes to watch - worth every minute! https://youtu.be/HzzLY5TdnZo

#oneapi #sycl #compute #cuda

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kernellogger

chrxh, 8 months ago to genart

After one more year of intensive work and numerous test runs, a new major update for https://github.com/chrxh/alien is finally polished and ready. It offers possibilities I had only dreamed of before. 🪐

YouTube: https://youtu.be/dSkxvi9igqQ

#ArtificialLife #generativeart #Cuda

video/mp4

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ jonikorpi, mcc, aebrer, scdollins

schenklklopfer, 9 months ago to foss German

Hat jemensch Erfahrung mit #Upscailing von #DVDs auf etwas, das besser ist als #Kartoffelqualität?

Habe #CUDA, suche #FOSS.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ MagicLike

pekka, 10 months ago to random

chipStar 1.0 released! It's a tool for compiling and running CUDA/HIP applications on SPIR-V-supported OpenCL or LevelZero platforms. v1.0 can already run various HPC applications correctly. See: https://github.com/CHIP-SPV/chipStar/releases/tag/v1.0
#opencl #levelzero #spirv #cuda #hip

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ giuseppebilotta, oblomov

blaise, 10 months ago to homelab

#homelab question:

Should I add NVIDIA Tesla K40m 12GB GDDR5 Passive CUDA GPU accelerator to my #kubernetes server?
(Cisco UCS 220 m3, 128G )

Will it help with virtual terminal sessions?
Will it help with work loads that access the #cuda API?

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 10 months ago (edited 10 months ago) to Amd

Anyway, as I mentioned recently, I have a new workstation that finally allows me to test our code using all three backends (#CUDA, #ROCm/‌#HIP and #CPU w/ #OpeMP) thanks to having an #AMD #Ryzen processor with an integrated #GPU in addition to a discrete #NVIDIA‌ GPU.
Of course the iGPU is massively underpowered compared to the high-end dGPU workhorse, but I would expect it to outperform the CPU on most workloads.
And this is where things get interesting.

reply

expand (17)

collapse (17)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ giuseppebilotta

giuseppebilotta, 10 months ago

So, one of the reasons why we could implement the #HIP backend easily in #GPUSPH is that #AMD provides #ROCm drop-in replacement for much of the #NVIDIA #CUDA‌ libraries, including #rocThrust, which (as I mentioned in the other thread) is a fork of #Thrust with a #HIP/‌#ROCm backend.
This is good as it reduces porting effort, but it also means you have to trust the quality of the provided implementation.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

giuseppebilotta, 10 months ago (edited 10 months ago)

Turns out, the #AMD #HIP ecosystem is less mature than the #NVIDIA‌ #CUDA one it emulates (unsurprising, giving how much more recent it is), and has obviously been tested much less in more exotic hardware configurations and with the wide variety of software and developers the CUDA libraries have had interactions with.
In the few days in which I've had the opportunity to play with it, I've already discovered two issues with it:

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...