BenjaminHan

@BenjaminHan@sigmoid.social

Working on natural language, knowledge, reasoning, machine learning, and AI at a fruity company.

Husband, father, runner, German learner, piano player. A curious soul living in #PacificNorthwest (WA US).

👟 05/25/18-05/24/24 (dist # time pace/m date src):

5K 647 21:34 6'56" 4/20/24 Strava
10K 100 48:59 7'52" 5/16/24 Strava
15K 4 1:16:05 8’10” 5/19/24 Strava
HM 25 1:48:25 8’16” 5/19/24 Strava
M 7 3:44:58 8'35" 3/24/24 AppleW

#nlp #nlproc #knowledgeGraphs #ai #running #classicalMusic

This profile is from a federated server and may be incomplete. Browse more on the original instance.

lebelge, 6 months ago to classicalmusic

@classicalmusic

Lars-Erik Larsson (1908 – 1986)
https://en.wikipedia.org/wiki/Lars-Erik_Larsson

"String Quartets"
[String Quartet No. 1 in D minor, Op. 31;
String Quartet No. 2 “Quartetto Alla Serenata”, Op. 44;
String Quartet No. 3, Op. 65;
Intima Miniatyrer, Op. 20]
Helsingborgs Stråkkvartett
(Big Ben Phonogram 1987)
https://songwhip.com/helsingborgsstrakkvartett/larsson-string-quartets

#NowListening #ClassicalMusic #music #StringQuartet #ChamberMusic #SwedishComposers
#LarsErikLarsson

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ magdelenehall

BenjaminHan, 6 months ago

@lebelge @classicalmusic https://classical.music.apple.com/us/recording/lars-erik-larsson-1908-pp59-571330059

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 6 months ago to Fonts

Monaspace - An innovative superfamily of #fonts for #code https://monaspace.githubnext.com/

#programming

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ reiver

jbigham, 6 months ago to random

yesterday at UIST, we received the "Lasting Impact Award" for the original VizWiz paper -- it was super cool to receive the award, along with Robin Miller who 14 years ago was an undergraduate researcher on the project!

VizWiz was an iPhone app, released shortly after iOS added VoiceOver -- users could take a photo, ask a question, and get an answer back in a few tens of seconds from MTurk workers -- several novel things stitched together.

https://www.cs.cmu.edu/~jbigham/pubs/pdfs/2010/vizwiz.pdf

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ upol

BenjaminHan, 6 months ago

@jbigham Jeez it’s already been 14 years? Feeling my age…

Congratulations!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 6 months ago to science

“…the experiment demonstrated that plants can grow on the moon despite the intense radiation, low gravity, and prolonged intense light.”

Good to know Matt Damon can really #science the sh*t out of this.

China set up a tiny farm on the moon in 2019. How did it do?
https://phys.org/news/2023-10-china-tiny-farm-moon.html

#space #moon #lunarFarm

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ deborahh, mrundkvist

BenjaminHan, 7 months ago to apple

Our team (#Apple #KnowledgeGraph) is looking for an #intern working on data quality!

#KnowledgeGraph quality assurance is a highly important task that requires deep insights into how we integrate #reasoning, #NLP, human-in-the-loop, system tooling and #engineering into a cohesive solution!

Please respond to the hiring manager via this post: https://www.linkedin.com/posts/benjaminhan_our-team-is-looking-for-talented-interns-activity-7122271517077368832-GCcl

#hiring #jobs #NLProc

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

GreatDismal, 7 months ago to random

Back here courtesy of the Mona app, or rather of the friend who wisely recommended it to me.

reply

expand (29)

collapse (29)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ objectinspace, Binder, rolle, passenger +12 more

BenjaminHan, 7 months ago

@GreatDismal Yes — @MonaApp is the only app I know that does syncing over iOS iPadOS and macOS. Love it.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago to ArtificialIntelligence

1/

Our paper on automatic knowledge-graph-aligned dataset construction is out! The main points [1]:

We showed cyclic evaluation — the process of training GTG (graph-to-text-to-graph) or TGT using a graph-aligned dataset (screenshot 1) — reflects faithfully the same trend a unidirectional evaluation does (screenshot 2-3). It is therefore a better way to assess data quality because it does not rely on knowing ground truth matches!

#Paper #KnowledgeGraphs #NLP #NLProc #GenerativeAI

image/jpeg
image/jpeg

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh

jbigham, 7 months ago to random

an advantage machines have over humans in reading is that humans generally have to perceive words, whereas machines just have the words sitting there already in their memory.

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago

@jbigham Word also are already sitting in humans memory?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jbigham, 7 months ago to random

a challenge i have as a runner is that all the races are early in the morning and i do not like to wake up early. probably folks should drop everything and work on this important problem.

reply

expand (6)

collapse (6)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago

@jbigham Completely agreed. I can understand if the race is held in summer (to avoid sun). Another reason might be traffic control/road closure.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jbigham, 7 months ago to random

i'm going to say something that i think isn't controversial -- if a NYTimes reporter has to spend 3 days trying every possible way to "trick" your LLM into saying the thing they're desperately asking it to say… that's not a safety concern.

reply

expand (11)

collapse (11)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago

@jbigham 3 days by a single person.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago to llm

1/

If a powerful #LLM is told that “Daphne Barrington is the director of A Journey Through Time”, it would surely be able to answer the question “Who is the director of A Journey Through Time?”, right? Well, according to a recent paper [1], not quite (screenshot).

#NLProc #NLP #KnowledgeGraph #Reasoning #Papers

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ indieterminacy

BenjaminHan, 7 months ago

@kellogh Thank you — I enjoy writing them too!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago

@indieterminacy That’s a great excerpt! Read the book many many years ago and don’t remember this. Thank you for bringing this up!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago to Health

Time to train!

How to Get Strong https://www.nytimes.com/article/how-to-build-muscle-strength.html

#Health #resistanceTraining #workout

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

jbigham, 7 months ago to random

ran the pittsburgh great race this weekend! largest 10k in pennsylvania! it was okay, i ran slower than last year (35:15), but it's always a great time!

jen (wife) won, again, and is quoted in the article below… lol :)

https://www.post-gazette.com/local/city/2023/09/24/pittsburgh-great-race-runners/stories/202309240148

reply

expand (8)

collapse (8)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 7 months ago

@jbigham Pace 5’40”?!!! That’s freaking unbelievable!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jbigham, 8 months ago to random

20 years ago I started as a PhD student at UW CSE

10 years ago I started as a professor at CMU

5 years ago I started at Apple

It's been quite a ride! 🚘

reply

expand (4)

collapse (4)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

@jbigham Congratulations to the multi-anniversaries!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago to Weather

Is it #causation, or #correlation?

Extreme #ElNiño #weather switched off South American's #carbon sink | Corporate | University of Leeds https://www.leeds.ac.uk/news-environment/news/article/5391/extreme-el-ni-o-weather-switched-off-south-american-s-carbon-sink

#climatecrisis #climateChange

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ GhostOnTheHalfShell

BenjaminHan, 8 months ago to math

Is #math just symbol pushing?

When Computers Write Proofs, What's the Point of Mathematicians? https://youtu.be/3l1RMiGeTfU?si=sQMFAK7tzkS4ODZp

#ai #reasoning #mathematics #proofs #generativeAI

reply

expand (2)

collapse (2)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ villares, dhinojosa

BenjaminHan, 8 months ago

@dhinojosa Me too (via this video)! Very intriguing! https://leanprover.github.io

#Lean

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago to generativeAI

1/ In this age of LLMs and generative AI, do we still need knowledge graphs (KGs) as a way to collect and organize domain and world knowledge, or should we just switch to language models and rely on their abilities to absorb knowledge from massive training datasets?

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

expand (9)

collapse (9)

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

4/ But is that all? A recent paper revisits this question and offers a different take [2]. The authors believe just testing isolated fact retrieval is not sufficient to demonstrate the power of KGs.

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

5/ Instead, they focus on more intricate topological and semantic attributes of facts, and propose 9 benchmarks testing modern LLMs’ capability in retrieving facts with the following attributes: symmetry, asymmetry, hierarchy, bidirectionality, compositionality, paths, entity-centricity, bias and ambiguity (screenshots).

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

image/png

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

6/ In each benchmark, instead of asking LLMs to retrieve masked words from a cloze statement, it also asks the LLMs to retrieve all of the implied facts and compute scores accordingly (screenshot).

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

7/ Their result shows that even #GPT4 achieves only 23.7% hit@1 on average, even when it scores up to 50% precision@1 using the earlier proposed LAMA benchmark (screenshot). Interestingly, smaller models like BERT can outperform GPT4 on bidirectional, compositional, and ambiguity benchmarks, indicating bigger is not necessarily better.

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

8/ There are surely other benefits of using KGs to collect and organize knowledge. They do not require costly retraining to update, therefore can be updated more frequently to remove obsolete or incorrect facts. They allow more trackable reasoning and can offer better explanations. They make fact editing more straightforward and accountable (think of GDPR) compared to model editing [3].

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BenjaminHan, 8 months ago

9/ But LLMs can certainly help in bringing in domain-specific or commonsense knowledge in a data-driven way. In conclusion: why not both [4]? :-)

#KnowledgeGraphs #GenerativeAI #LLMs #NLP #NLProc #Paper

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...