cerisara

@cerisara@mastodon.online

Researcher on #ML, #NLP, #speech in #Nancy, #France

This profile is from a federated server and may be incomplete. Browse more on the original instance.

cerisara, 20 days ago to llm

#LLM on CPU only.

For inference, the best option right now is llama.cpp with quantized LLM in GGUF format. There are several high-lever wrappers around llama.cpp that makes it easy to use: ollama, vllama...

For inference with very big LLM and very small RAM, the only option is airLLM: it's slow, but you can run llama3-70b

For finetuning quantized LLM with LoRA, the only option afaik is also llama.cpp (look for "finetune"). It's a work in progress but usable and promising!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ doboprobodyne

Jigsaw_You, 6 months ago to LLMs

deleted_by_author

Loading...

cerisara, 6 months ago

@Jigsaw_You It's definitely a relative notion... Transformers are for sure much better at generalizing than the pre-2017 NLP methods, and they're very likely worse than the current expectations, and future methods... 🙂

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

simon, 6 months ago to random

New LLM paper highlighting quite how weird and ridiculous these things are https://arxiv.org/abs/2307.11760

Adding "it's important to my career" can produce better results, across every model they tested!

reply

expand (32)

collapse (32)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ lacey, pluralistic, ronald, rakyat +5 more

cerisara, 6 months ago

@simon ... and that they may be better than we think but we don't know how to 'control' them!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ramikrispin, 7 months ago to python

Create a Large Language Model from Scratch with Python Tutorial 👇🏼

Another fun tutorial from freeCodeCamp, focusing on building LLM model from scratch with Python. It covers topics such as:
✅ Handling and processing text
✅ Core PyTorch functions for text
✅ Basic language models
✅ Advance methods
✅ Working with GPUs

Video 📽️: https://www.youtube.com/watch?v=UU1WVnMk4E8

#python #llm #datascience #pytorch #NLP

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

cerisara, 7 months ago

@ramikrispin I've not checked, but I guess it's more about creating a LM from scratch than a LLM...

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

EU_Commission, 9 months ago to Bulgaria

⚡ We are shifting to zero-emission mobility.
📱 What if we could also shift to digital #mobility?

The digital driving licence we proposed in March will make driving in the #EU simpler and safer.

It will be:
🏍 Valid across country borders
🚗 Accessible from your phone and other digital devices
🚙 Easier to replace, renew, or exchange online

Our proposal will also include new measures to boost road safety, including a zero-tolerance rule on drink driving.

More: https://europa.eu/!J8QqFv

An aerial view of a road to Bergheggi, Italy. The road follows the edge of a white high cliff overlooking the sea. In the background a beach and a city surrounded by mountains.
A photo of a winding road to Porto Moniz Bay. The road develops across a steep hill and is surrounded by some buildings. In the background the view of the North Atlantic Ocean.
A photo of a winding road near the Tunnels of Karamanlis in Athens, Greece. The road follows the edge of the coast with its Mediterranean shrubland biome. The shape creates a water inlet that fades into a dark blue.

reply

expand (17)

collapse (17)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ Sh4d0w_H34rt

cerisara, 9 months ago

@EU_Commission Zero emission ?? That's forgetting the cost of building (and changing) batteries and cars.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...