popey, to ubuntu
@popey@mastodon.social avatar

Work it, baby!
An older Entroware Athena laptop is breaking more than a sweat. Poor little nVidia GTX 980M is struggling a touch! 😅

A screenshot of btop (bottom) with the cpu a little busy running some python. Getting a bit warm!

governa, to proxmox
@governa@fosstodon.org avatar

How to Passthrough NVIDIA GPU to VE 8 Containers for / Acceleration and Media Transcoding

https://linuxhint.com/passthrough-nvidia-gpu-proxmox-ve-8-cuda-ai-media-transcoding/

jay, to ai
infektor,
@infektor@mastodon.gamedev.place avatar

@giuseppebilotta @jay SYCL and OpenCL are central to this effort, neither are on their own are enough to be competitive on every front with the CUDA ecosystem.

giuseppebilotta,

@infektor @jay
I appreciate the spirit, but I can't say I see it reflected in the results I've seen so far.

KathyReid, to linux
@KathyReid@aus.social avatar

Sure, faster, better and humanoid robots are cool, but have you ever installed on properly the first time around?

Yeah, me neither.

Focusing on technologies without making those technologies easier to obtain or easier to develop reinforces digital divides.

stib, to NixOS
@stib@aus.social avatar

Has anyone got to work on ? I can get my cards recognised by nvidia-smi, but cuda doesn't seem to be installed.

sos, to programming
@sos@mastodon.gamedev.place avatar
wagesj45, to weirdgirlmemes
@wagesj45@mastodon.jordanwages.com avatar

I hate when your run into an issue in your program, you google it, and zero results show up. :pepe_g:

ramikrispin, to python
@ramikrispin@mstdn.social avatar

Going Further with CUDA for Python Programmers 🚀

The second part of Jeremy Howard's lecture on for programmers is now available 👇🏼

📽️: https://www.youtube.com/watch?v=eUuGdh3nBGo

This lecture focuses on the following topics:
✅ Optimized Matrix Multiplication
✅ Shared Memory Techniques for CUDA
✅ Implementing Shared Memory Optimization
✅ Translating Python to CUDA and Performance Considerations
✅ Numba: Bringing Python and CUDA Together

Notebook: https://github.com/cuda-mode/lectures/blob/main/lecture5/matmul_l5.ipynb

Methylzero, to hpc


If you had to do a lot of linear least square solves, with potentially rank-deficient matrices, what would you use on a GPU? On CPUs, LAPACK's DGELSY does work, but most GPU libraries seem to not implement routines for rank-deficient matrices.

governa, to Amd
@governa@fosstodon.org avatar

Quietly Funded A Drop-In Implementation Built On ROCm: It's Now

https://www.phoronix.com/review/radeon-cuda-zluda

denzilferreira, to Amd
@denzilferreira@techhub.social avatar

ZLUDA, funded by AMD is bringing CUDA to a Radeon near you. ML/AI rejoice!


https://www.phoronix.com/review/radeon-cuda-zluda

el0j, to random
@el0j@mastodon.gamedev.place avatar

Nvidias moat under attack?

"AMD Quietly Funded A Drop-In CUDA Implementation Built On ROCm" -- https://www.phoronix.com/review/radeon-cuda-zluda

ramikrispin, to python
@ramikrispin@mstdn.social avatar

(1/2) Getting started with CUDA! 👇🏼

A new crash course for getting started with #CUDA with #Python by Jeremy Howard 🚀. CUDA is NVIDIA's programming model for parallel computing on GPUs. CUDE is being used by tools such as #PyTorch #tensorflow and other #deeplearning and LLMs frameworks to speed up calculations. The course covers the following topics:
✅ Setting up CUDA
✅ CUDA foundation
✅ Working with Kernel
✅ CUDA with PyTorch

Course 📽️: https://www.youtube.com/watch?v=nOxKexn3iBo

#datascience #machinelearning

ramikrispin,
@ramikrispin@mstdn.social avatar
jannem, to GraphicsProgramming
@jannem@fosstodon.org avatar

@VileLasagna Has a blog post on the relative speed of different compute frameworks on the same hardware and driver.

Tl;dr: on an card, with Nvidia drivers, is the slowest, by far. Fastest is our old stalwart - almost twice as fast when used only for compute. is good, and the least affected by using the card for your desktop at the same time. Read it - it's good.

https://vilelasagna.ddns.net/coding/if-you-want-performance-maybe-you-should-drop-cuda/

jannem,
@jannem@fosstodon.org avatar

@hyc @VileLasagna
It's a simple computing test. Results are of course going to differ by the task - and ultimately the only benchmark that matters is your production code.

With that said, there's nothing odd going on with the source that I can see; this kind of simple structure isn't very rare in "real" code; and the difference is quite striking.

It's not the only data point I've seen - there's a recent CFD code that also shows impressive results with opencl.

hyc,
@hyc@mastodon.social avatar

@jannem @VileLasagna are you able to compare the compiler outputs of each to see how similar or different they are?

slashtechno, to poetry
@slashtechno@fosstodon.org avatar

I've been facing many issues with using #Poetry (#pythonpoetry) with my #Python based #objectdetection project. I love Poetry for publishing packages, but think that #conda would be better since I have to deal with #CUDA and whatnot. Anyone familiar with a way to use pyproject.toml for publishing and building packages, even if Poetry isn't being used for dependency management?

For context, here's the project I'm working on: https://github.com/slashtechno/wyzely-detect

slashtechno,
@slashtechno@fosstodon.org avatar

@kytta Interesting - what do you recommend I migrate to, especially if I want to use conda? I imagine if using any build tool with conda, the build tool would not be able to package the program if dependencies are installed with conda and not directly with pip - correct?

kytta,
@kytta@fosstodon.org avatar

@slashtechno yeah, that's where my expertise ends, as I do not know anything about conda. In Python packaging, you don't package the dependencies with the app, so you don't need to install them. Instead, the wheels just have the package names, so pip can resolve the dependency tree and pull the needed packages by itself

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • mdbf
  • ngwrru68w68
  • cubers
  • magazineikmin
  • thenastyranch
  • rosin
  • khanakhh
  • InstantRegret
  • Youngstown
  • slotface
  • Durango
  • kavyap
  • DreamBathrooms
  • megavids
  • tacticalgear
  • osvaldo12
  • normalnudes
  • tester
  • cisconetworking
  • everett
  • GTA5RPClips
  • ethstaker
  • anitta
  • Leos
  • provamag3
  • modclub
  • lostlight
  • All magazines