persagen, to random
@persagen@mastodon.social avatar

Emergent properties of Large Language Models (LLM)

The Unpredictable Abilities Emerging From Large AI Models
Large language models like ChatGPT are big enough that they're displing startling, unpredictable behaviors
https://www.quantamagazine.org/the-unpredictable-abilities-emerging-from-large-ai-models-20230316
Discussion: https://news.ycombinator.com/item?id=35195106

Researchers at Google Brain showed how a model prompted to explain itself (a capacity called chain-of-thought reasoning) could correctly solve a math word problem, while the same model without that prompt could not.
...

persagen,
@persagen@mastodon.social avatar

Addendum 2

Comments: article placed here due to

  1. use of prompt engineering + chain of thought (mentioned above)
  2. application to long documents (here, applied to legal domain, but broadly applicable)
  3. novelty

Large Language Model Prompt Chaining for Long Legal Document Classification
https://arxiv.org/abs/2308.04138

persagen,
@persagen@mastodon.social avatar

Addendum 10

When Do Program-of-Thoughts Work for Reasoning?
https://arxiv.org/abs/2308.15452
https://github.com/zjunlp/EasyInstruct

  • reasoning capabilities of large language models pivotal in embodied AI
  • program-of-thought prompting for LLM uses programming language to tackle complex reasoning
  • e.g. mathematical reasoning; code data filtering
  • specific impact of code data on improvement of reasoning capabilities underexplored

davidak, to ai
@davidak@chaos.social avatar
  • All
  • Subscribed
  • Moderated
  • Favorites
  • megavids
  • thenastyranch
  • magazineikmin
  • tester
  • Leos
  • InstantRegret
  • rosin
  • Youngstown
  • mdbf
  • slotface
  • everett
  • osvaldo12
  • kavyap
  • DreamBathrooms
  • JUstTest
  • ethstaker
  • khanakhh
  • anitta
  • GTA5RPClips
  • tacticalgear
  • Durango
  • cubers
  • ngwrru68w68
  • cisconetworking
  • provamag3
  • normalnudes
  • modclub
  • lostlight
  • All magazines