Meter.net News Why can't ChatGPT calculate? The secret of mathematical errors revealed

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence writes poems but struggles with math. Why can't ChatGPT and other chatbots handle even basic arithmetic? We reveal the causes of AI's mathematical mistakes, from tokenization which breaks numbers into unintelligible fragments, to the statistical learning approach that fails in mathematics.

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence, including ChatGPT, can write poems, compose music, and translate texts. Yet, it often stumbles on simple mathematical tasks. Why can't a chatbot, that handles complex language tasks, deal with math at an elementary school level?

Tokenization: When numbers break into pieces

One of the key problems is tokenization. This process divides data into smaller parts, called tokens. Imagine it like assembling a puzzle, where words are broken down into syllables. The tokenizer, the AI model responsible for this process, does not understand the meaning of numbers.

It may happen that the number 380 is perceived as one token, while 381 is perceived as two (38 and 1). This disrupts the relationships between digits and complicates the calculation.

A statistical machine that falters with numbers

Another reason for ChatGPT's mathematical difficulties is its statistical nature. The chatbot learns based on a vast amount of examples and looks for patterns in them. For instance, it learns that the phrase "Dear Sir" is often followed by the phrase "we are reaching out to you".

However, this approach faces challenges in mathematics. ChatGPT can guess that the product of numbers ending in 2 will end in 4, but it cannot handle intermediate results. Simply put, the ChatGPT model tries to guess the result based on learned patterns instead of performing a precise calculation.

The challenge of multiplication

A study conducted by Yuntian Deng from the University of Waterloo showed that ChatGPT struggles with multiplying numbers greater than four digits. The reason is that any error in a calculation step shows up in the final result.

Imagine it as a domino effect – one error triggers a chain reaction, and the result is completely off. However, there is hope that ChatGPT will improve in the future. Deng and his colleagues also tested the o1 model from OpenAI, which is characterized by logical reasoning capabilities.

This model achieved significantly better results than the standard GPT-4o and was able to correctly solve multiplications of nine-digit numbers. The o1 model thinks through the problem step by step, allowing for more accurate results.

Meta introduced Orion, the world's most advanced AR glasses. They combine the appearance of regular glasses with augmented reality capabilities. With a holographic display and integrated AI, they open new possibilities for interaction with the digital world. In addition, they allow hands-free video calls, display messages, and provide real-time contextual information.

Children's safety on the internet is largely the responsibility of parents. Therefore, we have prepared a comprehensive guide to protecting children online. You'll learn how to communicate openly with them about risks, set boundaries, and protect their privacy.

A new study has revealed a worrying fact: Starlink satellites from SpaceX, intended to provide internet, produce disruptive radio emissions that threaten space observations by radio telescopes. Emissions from the second generation of Starlinks are up to 30 times stronger than the previous generation, posing a serious problem for astronomers.

Do you feel like constant scrolling is consuming you? Like you can't get anything done except spend time on your phone? There's no shame in this, but something needs to be done about it. We'll show you how to do an effective digital detox that will help you rediscover the magic of the offline world.

OpenAI has introduced a new series of AI models called o1, which promise a revolution in complex thinking and problem-solving. The o1 models excel in fields such as science, programming, and mathematics, achieving results comparable to those of doctoral students. OpenAI has also focused on safety and developed a new training approach to prevent AI misuse.

Think of the breathtaking vision of the future from the movie Blade Runner – holographic ads, cyberspace, and ubiquitous networks. Can this fiction become reality? Find out what we can expect from the internet of the future and how the boundaries between reality and the virtual world will blur.

Other language versions