#LanguageModels - kbin.social

thibaultamartin, 2 months ago to writing

I am not a native English speaker. I make grammar mistakes, and this can make me difficult to understand. Instead of nagging colleagues to proofread my writing, I decided to find a tool to correct my mistakes and help me learn along the way.

I evaluated four popular tools to make an informed decision. Here is a tale of Privacy Policies, slapping OpenAI on products to catch up with competition, and grammar checkers.

https://ergaster.org/posts/2024/02/26-writing-is-hard/

#writing #ai #llm #languageModels #grammar

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh, fabian

Colarusso, 3 months ago to ai

I find this project, based on the LegalBench dataset,¹ which aims to benchmark LLMs on legal reasoning tasks interesting.

https://www.vals.ai/

TL:DR: Curious about which language models perform best on legal reasoning tasks? The latest evaluation reveals that Open AI's GPT-4 takes the lead, followed closely by Google's Gemini Pro.

¹ https://hazyresearch.stanford.edu/legalbench/

#AI #LegalTech #languageModels

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kcarruthers

bwinbwin, 3 months ago to ai

instead of called them AIs, can we call them what they really are...LMs (language models)?

AI just over-inflates their capabilities and leads to poor understanding among non-experts.

#AI #ArtificialIntelligence #LM #LanguageModels #PreciseLanguage #Pendantic

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ cakeisnotalie

jd7h, 4 months ago to LLMs

"Stochastic parrot" was included in the shortlist for Word of the Year 2023!

https://americandialect.org/wp-content/uploads/2024/01/2023-Word-of-the-Year-PRESS-RELEASE.pdf

#llms #generativeai #languagemodels #llama #openai #chatgpt

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ rticks

ocramz, 4 months ago (edited 4 months ago) to random

I've been thinking about program understanding, and about how to encourage #languagemodels to do compositional/verifiable #reasoning on program text ("statically").

This is my latest work around this: https://aclanthology.org/2023.findings-emnlp.601/

If, as some recent literature suggest, transformer-based LMs are not more expressive than regexps, this line of thinking is doomed, but at least it could be a valuable heuristic and complementary to rigorous #formalverification. #machinelearning #nlproc

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

Everybody’s talking about Mistral, an upstart French challenger to OpenAI (arstechnica.com)

On Monday, Mistral AI announced a new AI language model called Mixtral 8x7B, a "mixture of experts" (MoE) model with open weights that reportedly truly matches OpenAI's GPT-3.5 in performance—an achievement that has been claimed by others in the past but is being taken seriously by AI heavyweights such as OpenAI's Andrej...

lampinen, 6 months ago to ai

Very excited to share a substantially updated version of our preprint “Language models show human-like content effects on reasoning tasks!” TL;DR: LMs and humans show strikingly similar patterns in how the content of a logic problem affects their answers. Thread: 1/10
#LanguageModels #lms #AI #cogsci #machinelearning #nlp #nlproc #cognitivescience

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

nextcloud, 6 months ago to ai

😲Did you know compliments can influence large-language models?

They're even open to Argentinian Spanish & Swiss German slang.

Discover how Nextcloud AI is tackling ethical concerns differently:
#AI #EthicalAI #LanguageModels

https://nextcloud.com/blog/nextcloud-ethical-ai-rating/

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lysander07, 7 months ago to llm

Many new and interesting topics in our upcoming #KnowledgeGraphs - Foundations and Applications online lecture at #openhpi

knowledge representation with graphs

#RDF & RDFS

Querying RDF with #SPARQL & #SHACL

#OWL & Description Logics

ontological engineering

knowledge graph #embeddings

large #languagemodels #llm
Free registration open: https://open.hpi.de/courses/knowledgegraphs2023
@tabea @sashabruns @fizise #mooc #lecture

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ wikidata

lysander07, 7 months ago to fediverse

Finally, The Time Traveler’s Guide to #SemanticWeb Research: Analyzing Fictitious Research Themes in the ESWC „Next 20 Years“ Track has been published by Heiko Paulheim (still not present in the #fediverse ) and Irene Celino https://arxiv.org/abs/2309.13939 #knowledgegraph #ai #ontology #ontologies #llm #languagemodels #robot

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ tanepiper

mjgardner, 8 months ago to ArtificialIntelligence

I keep thinking of this bit from #Idiocracy as people keep adding “#ArtificialIntelligence” to stuff: https://www.youtube.com/watch?v=kAqIJZeeXEc

“#ChatGPT has what minds crave! It has #LanguageModels!”

“#AI" #LLM #OpenAI #Bing #Bard

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

lysander07, 9 months ago to ArtificialIntelligence

Next step in our brief timeline of (large) #languagemodels from our #ise2023 lecture was statistical language modeling with n-grams based on large text corpora as introduced and popularized by Frederick Jelinek and Stanley F. Chen using statistical tricks like Bayes Theorem, Markov Assumption, and Maximum Likelihood Estimation, etc.
Slides: https://drive.google.com/file/d/1atNvMYNkeKDwXP3olHXzloa09S5pzjXb/view?usp=drive_link
@fizise #nlp #llm #llms #artificialintelligence #ai #lecture #creativeAI

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ ErikJonker

jd7h, 9 months ago to LLMs

I'm taking some time today to test a few new libraries/tools.
These CLI tools for working with llms by @simon work like a charm! And they support unix pipes. <3

More info here: https://llm.datasette.io/en/latest/index.html

#llms #gpt #languagemodels #cli #terminal #commandline

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ simon

OC NanoLLM - A Python streamlit app that implements the smallest usable LLM

This post is meant as a small demonstration of both streamlit and languagemodels packages....

rysiek, 10 months ago to ai

Dear #AI #Fediverse, there's been some buzz recently about #LanguageModels that are not gigantic black boxes, and #MachineLearning in general, developed as #FLOSS.

There's this Google internal document, for example, that points out FLOSS community is close to eating Google's and OpenAI's cake:
ttps://www.semianalysis.com/p/google-we-have-no-moat-and-neither

So here is my question to you:

What are the best examples of useful, small, on-device models already out there?

:boost_requested:

reply

expand (41)

collapse (41)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ acute_distress, GhostOnTheHalfShell, maegul, noodlejetski +3 more