TROM Social | Community

LLMs don’t do formal reasoning - and that is a HUGE problem

Important new study from Apple

^{Gary Marcus (Marcus on AI)}

like this

in reply to ☆ Yσɠƚԋσʂ ☆

m_f

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

Gary Marcus is an AI crank and should be disregarded

in reply to m_f

☆ Yσɠƚԋσʂ ☆

in reply to m_f • 1 year ago • •

Should the research he's discussing also be disregarded? arxiv.org/pdf/2410.05229

in reply to ☆ Yσɠƚԋσʂ ☆

m_f

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

Gary Marcus should be disregarded because he's emotionally invested in The Bitter Lesson being wrong. He really wants LLMs to not be as good as they already are. He'll find some interesting research about "here's a limitation that we found" and turn that into "LLMS BTFO IT'S SO OVER".

The research is interesting for helping improve LLMs, but that's the extent of it. I would not be worried about the limitations the paper found for a number of reasons:

There doesn't seem to be any reason to believe that there's a ceiling on scaling up
LLM's reasoning abilities improve with scale (notice that the example they use for kiwis they included the answers from o1-mini and llama3-8B, which are much smaller models with much more limited capabilities. GPT-4o got the problem correct when I tested it, without any special prompting techniques or anything)
Techniques such as RAG and Chain of Thought help immensely on many probl

like this

in reply to m_f

☆ Yσɠƚԋσʂ ☆

in reply to m_f • 1 year ago • •

Actually we do know that there are diminishing returns from scaling already. Furthermore, I would argue that there are inherent limits in simply using correlations in text as the basis for the model. Human reasoning isn't primarily based on language, we create an internal model of the world that acts as a shared context. The language is rooted in that model and that's what allows us to communicate effectively and understand the actual meaning behind words. Skipping that step leads to the problems we're seeing with LLMs.

That said, I agree they are a tool, and they obviously have uses. I just think that they're going to be a part of a bigger tool set going forward. Right now there's an incredible amount of hype associated with LLMs. Once the hype settles we'll know what use cases are most appropriate for them.

in reply to ☆ Yσɠƚԋσʂ ☆

m_f

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

The whole "it's just autocomplete" is just a comforting mantra. A sufficiently advanced autocomplete is indistinguishable from intelligence. LLMs provably have a world model, just like humans do. They build that model by experiencing the universe via the medium of human-generated text, which is much more limited than human sensory input, but has allowed for some very surprising behavior already.

We're not seeing diminishing returns yet, and in fact we're going to see some interesting stuff happen as we start hooking up sensors and cameras as direct input, instead of these models building their world model indirectly through purely text. Let's see what happens in 5 years or so before saying that there's any diminishing returns.

in reply to m_f

☆ Yσɠƚԋσʂ ☆

in reply to m_f • 1 year ago • •

I'm saying that the medium of text is not a good way to create a world model, and the problems LLMs have stem directly from people trying to do that. Just because autocomplete produces results that look fancy doesn't make it actually meaningful. These things are great for scenarios where you just want to produce something aesthetically pleasing like an image or generate some text. However, this quickly falls apart when it comes to problems where there is a specific correct answer.

Furthermore, there is plenty of progress being made with DNNs and CNNs using embodiment which looks to be far more promising than LLMs in actually producing machines that can interact with the world meaningfully. This idea that GPT is some holy grail of AI seems rather misguided to me. It's a useful tool, but there are plenty of other approaches being explored, and it's most likely that future systems will use a combination of these techniques.

in reply to ☆ Yσɠƚԋσʂ ☆

JackGreenEarth

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

It's only a problem if you expect them to do formal reasoning. They are fancy word predictors, useful for when your output doesn't need to be factually accurate. If you use them for things they're not designed for, you'll get bad results, but that would be your fault for using them in an incorrect manner, not the LLMs' faults. You don't use a screwdriver to bang in a nail and say the screwdriver 'has a HUGE problem' when it does a bad job.

like this

in reply to JackGreenEarth

Hazzard

in reply to JackGreenEarth • 1 year ago • •

I think it is a problem. Maybe not for people like us, that understand the concept and its limitations, but "formal reasoning" is exactly how this technology is being pitched to the masses. "Take a picture of your homework and OpenAI will solve it", "have it reply to your emails", "have it write code for you". All reasoning-heavy tasks.

On top of that, Google/Bing have it answering user questions directly, it's commonly pitched as a "tutor", or an "assistant", the OpenAI API is being shoved everywhere under the sun for anything you can imagine for all kinds of tasks, and nobody is attempting to clarify it's weaknesses in their marketing.

As it becomes more and more common, more and more users who don't understand it's fundamentally incapable of reliably doing these things will crop up.

like this

in reply to JackGreenEarth

☆ Yσɠƚԋσʂ ☆

in reply to JackGreenEarth • 1 year ago • •

Right, I find LLMs are fundamentally no different from Markov chains. It doesn't mean they're not useful, they're a tool that's good for certain use cases. Unfortunately, we're in a hype phase right now where people are trying to apply them for a lot of cases they're terrible at and where better tools already exist to boot.

in reply to ☆ Yσɠƚԋσʂ ☆

vrighter

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

they aren't. The only difference is that the state transition table is so unimaginably gargantuan thit we can only generate an approximation of a tiny slice of it, instead of it being literally a table

in reply to vrighter

☆ Yσɠƚԋσʂ ☆

in reply to vrighter • 1 year ago • •

exactly

in reply to JackGreenEarth

geekwithsoul

in reply to JackGreenEarth • 1 year ago • •

The problem is the laymen expect it to do reasoning, so the sales & marketing team says that it can do reasoning, and then the CEO will have consumed the Kool-Aid and restructure the company because he believes it can do reasoning.

This entry was edited (1 year ago)

like this

in reply to ☆ Yσɠƚԋσʂ ☆

Letstakealook

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

And yet people will continue to argue that llms are demonstrating understanding and problem solving. This shit is just Eliza on steroids. I'm not saying it didn't require skill or knowledge to create, but it is in no way close to what it is being billed as.

in reply to ☆ Yσɠƚԋσʂ ☆

slacktoid

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

This knife is a bad hammer.. I wonder why?

☆ Yσɠƚԋσʂ ☆ via United States | News & Politics

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Adam Tooze: Bidenomics is Maga for thinking people.

Facing war in the Middle East and Ukraine, the US looks feeble. But is it just an act?

The idea that all Biden is doing to trying to avoid a third world war isn’t convincing. Look closely and his foreign policy has been as radical as Trump’s, says history professor Adam Tooze

^{Adam Tooze (The Guardian)}

☆ Yσɠƚԋσʂ ☆ via Share Funny Videos, Images, Memes, Quotes and more

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

American Food

Dessalines likes this.

UltraGiGaGigantic via Memes

1 year ago • •

UltraGiGaGigantic
1 year ago • •

We die at the end of our lives... YOLO brolos

like this

in reply to UltraGiGaGigantic

undergroundoverground

in reply to UltraGiGaGigantic • 1 year ago • •

Until then, you know, you just have to be true to yourself because, if you're not being true to yourself, you'll be living a lie.

In the end you've just got to remember that it is what is and you've got to what you've got to do. You gotta do your thing you, know?

So just be you but a you that's true to yourself while going with the flow and bossing it your way, all the way.

Most of all, be lucky.

This entry was edited (1 year ago)

in reply to UltraGiGaGigantic

DreamButt

in reply to UltraGiGaGigantic • 1 year ago • •

Wait y'all have finite lives?

xia via Memes

1 year ago • •

xia
1 year ago • •

Future antiquities researchers

Just imagine how long it took humans to make such a thing with the primitive hammers and chisels they used in that millennium...

like this

in reply to ryannathans

Bezier

in reply to ryannathans • 1 year ago • •

Not the vrm looking things, but the red spots. It looks diseased.

This entry was edited (1 year ago)

in reply to Bezier

ryannathans

in reply to Bezier • 1 year ago • •

Gamer lights go brrrr

☆ Yσɠƚԋσʂ ☆ via Canada

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Despite Chrystia Freeland’s denials, her grandfather was complicit in the Nazi genocide

Despite Chrystia Freeland’s denials, her grandfather was complicit in the Nazi genocide ⋆ The Breach

A new book provides the most authoritative study of Mykhailo Chomiak and the history of Ukrainian Nazis in Canada

^{Peter McFarlane (The Breach)}

Magnolia_ via Linux

1 year ago • •

Magnolia_
1 year ago • •

3 New Wayland Protocols about to drop (commit & presentation timing) - Needed for 3rd Protocol (FIFO)

wayland/wayland-protocols!248
Needs 2 Acks + Review

wayland/wayland-protocols!320
Just got Completed:

wayland/wayland-protocols!256
FIFO Just got completed too.

This entry was edited (1 year ago)

like this

in reply to Magnolia_

nortio

in reply to Magnolia_ • 1 year ago • •

They got merged a few hours ago

§ɦṛɛɗɗịɛ ßịⱺ𝔩ⱺɠịᵴŧ via World News

1 year ago • •

§ɦṛɛɗɗịɛ ßịⱺ𝔩ⱺɠịᵴŧ
1 year ago • •

ICAN Congratulates Nihon Hidankyo on Receiving the Nobel Peace Prize and Calls on Nuclear-Armed States and Their Allies to Heed its Call to Ban Nuclear Weapons

ICAN congratulates Nihon Hidankyo on receiving the Nobel Peace Prize and calls on nuclear-armed states and their allies to heed its call to ban nuclear weapons

None

^{newswireeditor (Common Dreams)}

☆ Yσɠƚԋσʂ ☆ via World News

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Ukraine’s Kursk offensive is a huge strategic error

https://thehill.com/opinion/international/4831653-ukraines-kursk-offensive-is-a-huge-strategic-error/

Unknown parent

davel

Unknown parent • 1 year ago • •

NATO Expansion: What Gorbachev Heard U.S. Secretary of State James Baker’s famous “not one inch eastward” assurance about NATO expansion in his meeting with Soviet leader Mikhail Gorbachev on February 9, 1990, was part of a cascade of assurances about Soviet security given by Western leaders to Gorbachev and other Soviet officials throughout the process of German unification in 1990 and on into 1991, according to declassified U.S., Soviet, German, British and French documents posted today by the National Security Archive at George Washington University.
The Ukraine Mess That Nuland Made Assistant Secretary of State Victoria Nuland engineered Ukraine’s regime change without weighing the likely consequences.

in reply to davel

The Soviet Reporter

in reply to davel • 1 year ago • •

Do you have a source of the genocide that happened in Donbas?

Coco 📕 via World News

1 year ago • •

Coco 📕
1 year ago • •

A ‘dark period’ of repression: Jordanian authorities arrest thousands in year since October 7

Jordan has witnessed increasing popular protests expressing solidarity with Gaza and demanding an end to normalization with Israel. The Jordanian government has responded with an unprecedented crackdown on protests and free expression.

^{Synne Furnes Bjerkestrand (Mondoweiss)}

in reply to Coco 📕

تحريرها كلها ممكن

in reply to Coco 📕 • 1 year ago • •

Jordan is the fakest of the fake Arab states. It is really a security company serving the US’s interests in the region

Coco 📕 via World News

1 year ago • •

Coco 📕
1 year ago • •

New Age Weekly No 40. October 06–12, 2024

Reposted it since nobody upvoted the post on India on lemmy.ml

New Age Weekly No 40. October 06–12, 2024

^{Media Fx (CPI Official App)}

NightOwl via World News

1 year ago • •

NightOwl
1 year ago • •

Japan’s new prime minister: Dreaming of an Asian version of NATO?

According to Ishiba, an Asian version of NATO, "must ensure deterrence against the nuclear alliance of China, Russia, and North Korea. The Asian version of NATO must also specifically consider America's sharing of nuclear weapons or the introduction of nuclear weapons into the region."
The move to giving Japan access to nuclear weapons is opposed by a large majority of the population, with a reported 75 percent wanting Japan to sign the Treaty on the Prohibition of Nuclear Weapons.
In order to bring about a Pacific Treaty Organization similar to NATO, Ishiba hopes to introduce major changes allowing further militarization of Japan. A first step would be to introduce a military charter, a “Basic Law on National Security” that Ishiba describes as “one of the pillars of my foreign and national security policy.”
Such a law would legalize military use of public and private facilities and resources u

Japan’s new prime minister: Dreaming of an Asian version of NATO?

Japan's new prime minister envisions further militarizing the country and the region through a military alliance aimed at China and Russia.

^{Midori Ogasawara (rabble)}

like this

in reply to NightOwl

Stalins_Spoon

in reply to NightOwl • 1 year ago • •

They tried it with SEATO and failed

in reply to NightOwl

mlg

in reply to NightOwl • 1 year ago • •

The former members of SEATO can tell you how well that went for them lol.

christos via Open Source

1 year ago • •

christos
1 year ago • •

c-pipes: draw pipes in terminal window

gitlab.com/christosangel/c-pip…

This program written in the C language will render random coloured
zigzag lines in the terminal, while the font, speed, density and
number of lines are fully customizable.

Each line stops once it reaches the edge of the window, only for
a new line to begin.

This program was inspired by this bash script:

github.com/pipeseroni/pipes.sh

Screenshots:

Feel free to discover the endless possibilities of customization.

like this

in reply to christos

xoggy

in reply to christos • 1 year ago • •

Haven't used the original but I do enjoy letting pipes-rs run on idle terminals.

in reply to xoggy

christos

in reply to xoggy • 1 year ago • •

I haven't used the rust version, but, with a glance, pretty much the rust replica as well as the c clone I wrote lead to more or less the same outcome as the bash original script. A mesmerizing effect.

Data protection and digital competition

1 year ago • •

Data protection and digital competition
1 year ago • •

On 8 October, the European Data Protection Board (EDPB) issued guidelines on the processing of personal data on the basis of Article 6(1)(f) of the EU General Data Protection Regulation (GDPR). This Note is a quick immediate response to the EDPB comments in that document relating to the processing of certain special categories of personal data that enjoy special protection under the GDPR, commonly referred to as “sensitive data”. Specifically, the EDPB appears to suggest that such data can be processed on the basis of the “legitimate interest” legal basis set out in Article 6(1)(f) of the GDPR, provided certain “additional conditions” for processing of sensitive data contained in Article 9(2) GDPR are met. In this note, I explain why this is not clear enough.

KORFF – GDPR – sensitive data and the legitimate interest legal basis – 241011 Download

ianbrown.tech/2024/10/11/edpb-…

This entry was edited (1 year ago)

☆ Yσɠƚԋσʂ ☆ via Comics

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Read Books

in reply to ☆ Yσɠƚԋσʂ ☆

Cowbee [he/they]

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

Physical books are intimidating for me to get started, it's easier to obscure lengths and just focus on reading if it's digital, plus public domain and piracy make it more worth it

in reply to Cowbee [he/they]

☆ Yσɠƚԋσʂ ☆

in reply to Cowbee [he/they] • 1 year ago • •

I noticed that as well, I have some books I've been putting off precisely because I can see how big they are. :)

Spectre via Comics

1 year ago • •

Spectre
1 year ago • •

USA’s priorities

in reply to Spectre

remotelove

in reply to Spectre • 1 year ago • •

Estimated Russian army spending is between $85-$105 billion USD. (This has likely skyrocketed since that that estimate was taken as Russia has transitioned to a wartime economy.)

Chinese? ~$212-$230 billion USD.

Spending on military is better put in context of GDP, and actual spending is going to be very different than published or even estimated numbers. (It's likely much more, is what I am implying.)

I actually agree that this money is better spent on social welfare. It's a stupid situation across the board and many countries are guilty of this disparity.

For better or for worse, much of that money goes back into the overall economy of the country supplying the aid. Not all, but most. (This can get complicated due to the lifespan of specific types of munitions.)

What I am saying is that there is a ton of blame to pass around and poking at one country or another is an agenda, not a solution.

This entry was edited (1 year ago)

in reply to remotelove

☆ Yσɠƚԋσʂ ☆

in reply to remotelove • 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via Science

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

OHSU study uses imaging in neurosurgery patients to show how brain’s glymphatic system clears waste; lifestyle measures can keep system sharp

Brain’s waste-clearance pathways revealed for the first time

OHSU study uses imaging in neurosurgery patients to show how brain’s glymphatic system clears waste; lifestyle measures can keep system sharp.

^{OHSU News}

like this

in reply to ☆ Yσɠƚԋσʂ ☆

plinky [he/him]

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

I could have swore linking brain waste-clearing system with good sleep and alzheimers was done before

This entry was edited (1 year ago)

like this

☆ Yσɠƚԋσʂ ☆ via news

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Ukraine’s Best Fighting Vehicles Attacked Past Veseloe In Western Russia—And Got Caught In A Brutal Ambush

archive.ph/77Eb1

Ukraine’s Best Fighting Vehicles Attacked Past Veseloe In Western Russia—And Got Caught In A Brutal Ambush

The Ukrainians lost at least one CV90, a Marder, an M-2 and a Stryker.

^{David Axe (Forbes)}

☆ Yσɠƚԋσʂ ☆ via World News

1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Ukraine’s Best Fighting Vehicles Attacked Past Veseloe In Western Russia—And Got Caught In A Brutal Ambush

archive.ph/77Eb1

Ukraine’s Best Fighting Vehicles Attacked Past Veseloe In Western Russia—And Got Caught In A Brutal Ambush

The Ukrainians lost at least one CV90, a Marder, an M-2 and a Stryker.

^{David Axe (Forbes)}

in reply to ☆ Yσɠƚԋσʂ ☆

davel

in reply to ☆ Yσɠƚԋσʂ ☆ • 1 year ago • •

Did it not seem strange to them that Russia had set up no defenses along that part of the border, where the land is sparsely populated and not especially militarily strategic? Not even mine fields or an anti-tank trenches as elsewhere?

in reply to davel

☆ Yσɠƚԋσʂ ☆

in reply to davel • 1 year ago • •

That's what happens when you start drinking your own kool aid thinking that Russian army is on the brink of collapse fighting with shovels.

zlatiah via science

1 year ago • •

zlatiah
1 year ago • •

World’s oldest known (representational) artwork in Indonesian cave dated using lasers

Laser-induced imaging of radioactive elements was used to work out the age of an ancient cave painting on the Indonesian island of Sulawesi. The results reveal that the narrative scene is 51,200 years old, making it the earliest known example of representational art. This study challenges previous dating methods and suggests a deeper origin for human image-making and storytelling.

TL;DR or if you don't have access to the article: the researchers invented a faster, less-destructive and more-accurate rock art dating method & applied it to humanity's oldest known rock art in Sulawesi, Indonesia. The art is at least 51,200 years old (authors' lower estimate)!

Edit: contrary to what the news title original stated: this is the oldest representational art, not the literal oldest human-created art.

The paper itself (open access): doi.org/10.1038/s41586-024-075…

This entry was edited (1 year ago)

like this

in reply to zlatiah

Neurologist

in reply to zlatiah • 1 year ago • •

Cool.

Title might be a bit clickbait though.

It’s oldest known representational art. Not oldest known art.

For example the carvings in the Blomos cave in South Africa are atleast 75’000 years old.

Edit: Thank you for editing the title! That’s pretty weird mistake by Nature I thought they had high standards. Well they have peer reviewed and approved some dodgy research in my field recently so maybe I should be more skeptical.

This entry was edited (1 year ago)

in reply to Neurologist

asdfasdfasdf

in reply to Neurologist • 1 year ago • •

I love hearing stuff like this. 51, 000 years old is already insane. 75,000 years old is 24,000 years older than that. I can't even imagine 24,000 years older than today.

Why can't we get movies about this shit instead of another Marvel sequel? I want some scientifically accurate adventure about life in 73,000 BC.

in reply to asdfasdfasdf

Neurologist

in reply to asdfasdfasdf • 1 year ago • •

Well the problem is we know very little. So a movie like that would be complete guesswork.

You might enjoy the youtube channel “Stephan Milo” though. His videos are well sourced and have a lot of expert interviews. And he focuses on this kind of stuff.

in reply to asdfasdfasdf

Cyclist

in reply to asdfasdfasdf • 1 year ago • •

Quest For Fire.

in reply to asdfasdfasdf

acosmichippo

in reply to asdfasdfasdf • 1 year ago • •

Red ochre use has been happening for like 300k years, we just don’t have any examples of the art that survived.

This entry was edited (1 year ago)

⇧

☆ Yσɠƚԋσʂ ☆ via Technology

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via United States | News & Politics

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via Share Funny Videos, Images, Memes, Quotes and more

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

UltraGiGaGigantic via Memes

UltraGiGaGigantic 1 year ago • •

xia via Memes

xia 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via Canada

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

Magnolia_ via Linux

Magnolia_ 1 year ago • •

§ɦṛɛɗɗịɛ ßịⱺ𝔩ⱺɠịᵴŧ via World News

§ɦṛɛɗɗịɛ ßịⱺ𝔩ⱺɠịᵴŧ 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via World News

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

Coco 📕 via World News

Coco 📕 1 year ago • •

Coco 📕 via World News

Coco 📕 1 year ago • •

NightOwl via World News

NightOwl 1 year ago • •

christos via Open Source

christos 1 year ago • •

Data protection and digital competition 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via Comics

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

Spectre via Comics

Spectre 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via Science

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via news

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

☆ Yσɠƚԋσʂ ☆ via World News

☆ Yσɠƚԋσʂ ☆ 1 year ago • •

zlatiah via science

zlatiah 1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

UltraGiGaGigantic
1 year ago • •

xia
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Magnolia_
1 year ago • •

§ɦṛɛɗɗịɛ ßịⱺ𝔩ⱺɠịᵴŧ
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Coco 📕
1 year ago • •

Coco 📕
1 year ago • •

NightOwl
1 year ago • •

christos
1 year ago • •

Data protection and digital competition
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

Spectre
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

☆ Yσɠƚԋσʂ ☆
1 year ago • •

zlatiah
1 year ago • •