Meter.net News Why can't ChatGPT calculate? The secret of mathematical errors revealed

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence writes poems but struggles with math. Why can't ChatGPT and other chatbots handle even basic arithmetic? We reveal the causes of AI's mathematical mistakes, from tokenization which breaks numbers into unintelligible fragments, to the statistical learning approach that fails in mathematics.

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence, including ChatGPT, can write poems, compose music, and translate texts. Yet, it often stumbles on simple mathematical tasks. Why can't a chatbot, that handles complex language tasks, deal with math at an elementary school level?

Tokenization: When numbers break into pieces

One of the key problems is tokenization. This process divides data into smaller parts, called tokens. Imagine it like assembling a puzzle, where words are broken down into syllables. The tokenizer, the AI model responsible for this process, does not understand the meaning of numbers.

It may happen that the number 380 is perceived as one token, while 381 is perceived as two (38 and 1). This disrupts the relationships between digits and complicates the calculation.

A statistical machine that falters with numbers

Another reason for ChatGPT's mathematical difficulties is its statistical nature. The chatbot learns based on a vast amount of examples and looks for patterns in them. For instance, it learns that the phrase "Dear Sir" is often followed by the phrase "we are reaching out to you".

However, this approach faces challenges in mathematics. ChatGPT can guess that the product of numbers ending in 2 will end in 4, but it cannot handle intermediate results. Simply put, the ChatGPT model tries to guess the result based on learned patterns instead of performing a precise calculation.

The challenge of multiplication

A study conducted by Yuntian Deng from the University of Waterloo showed that ChatGPT struggles with multiplying numbers greater than four digits. The reason is that any error in a calculation step shows up in the final result.

Imagine it as a domino effect – one error triggers a chain reaction, and the result is completely off. However, there is hope that ChatGPT will improve in the future. Deng and his colleagues also tested the o1 model from OpenAI, which is characterized by logical reasoning capabilities.

This model achieved significantly better results than the standard GPT-4o and was able to correctly solve multiplications of nine-digit numbers. The o1 model thinks through the problem step by step, allowing for more accurate results.

Artificial intelligence can write articles, analyze data, and generate videos. However, there are skills and professions that this technology cannot (yet) replace. In the article, we will look at where humans still have the upper hand, why AI needs human supervision, and which abilities will become increasingly valuable to develop in the digital world.

Does your video keep buffering or game lag, even though you pay for 'fast internet'? Perhaps you simply don't have the right one for what you do at home. What internet speed do you need for watching TV, streaming movies, working from home, or online gaming? Find a clear comparison in the article.

Phishing attacks often appear credible and convincing today. Thanks to artificial intelligence, they sound personal, mimic well-known companies, and can be hard to detect. In this article, you'll learn how modern phishing works, why you should watch out for it, and how you can effectively protect yourself from it.

Forgotten passwords, recycled combinations, and fear of data leaks. Traditional login methods have their weak points. A new way to verify identity, known as passkeys, aims to change that. Without the need to remember anything and with a focus on security. How do they work, who is already using them today, and why are they becoming the new standard?

Do you know edge computing? It is a method of data processing that shifts computing power closer to the data source. This significantly reduces latency and enables faster real-time decision-making. This technology is becoming key for IoT, autonomous vehicles, or smart cities and is gradually changing the internet as we know it. What does edge computing mean and how will it impact our future?

Digital identity is becoming an integral part of modern society. What does it reveal about us? How does it work today, and what might it look like in the future? From blockchain to virtual IDs – let's look at the technologies changing the way we verify ourselves and protect our privacy in the world.