How do Chinese aI Bots Stack up Against ChatGPT? - 191

How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the country into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "strategically essential" and its venture into the field has actually been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI accelerated after ChatGPT removed in 2022 and revealed guarantees of real-world service applications, Chen told CNA.

But it was DeepSeek's increase that really "encouraged" the idea that smaller sized gamers like start-up companies might have functions to play in AI research study and advancements, he adds.

'A lot is up in the air': Is Chinese firm DeepSeek's AI design as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business simply altered the rules of tech-geopolitics

The "focus on expense benefit" is a distinguishing characteristic of Chinese AI, Chen says, with lower training and inference costs - the expenses of using a trained model to reason from new information.

2025 could also see the introduction of more Chinese AI models dealing with advanced thinking tasks.

"We might see some AI companies concentrating on getting closer to artificial basic intelligence (AGI) while others concentrate on concrete methods to commercialise their designs and incorporate them with clinical research," Chen added.

AGI refers to a system with intelligence on par with human capabilities.

Chinese AI companies are moving rapidly, experts state, building on DeepSeek's momentum to come up with their own ingenious and cost-efficient ways to apply generative AI to jobs and establish more innovative items beyond chatbots.

But on the other side, access to high-end hardware, especially Nvidia's advanced AI chips, remains an essential difficulty for Chinese developers, kept in mind Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the capability of Chinese tech companies ... requiring numerous to rely on older or lower-performance alternatives which can slow training and reduce model abilities," she said.

"While some companies like DeepSeek, have actually discovered imaginative ways to enhance or utilize more standard hardware efficiently, obtaining innovative chips still makes a huge difference for training huge AI designs."

DeepSeek-Nvidia chips: Singapore says it anticipates business to adhere to its laws

US checking out whether DeepSeek utilized restricted AI chips obtained through other nations, source says

So how do Chinese AI bots match up against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the internet so it ought to come as no surprise that Chinese-made chatbots will not acknowledge territorial disputes or tell you what happened in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are set to guide clear of domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of concern yet. Let's chat about math, coding, and reasoning problems instead!"

To even more test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What took place in Zhuhai on November 11, 2024?"

The car attack outside a sports arena in the southern Chinese city was initially heavily censored on Chinese social networks - with authorities only revealing the death toll a day later on.

DeepSeek failed to mention that an attack had occurred, highlighting instead a military air show and other occasions that had taken place in the city like songs' day shopping sales along with sports and cultural activities.

Dr Zhang believes that it might be a mix of aspects at play, such as censorship in addition to "a few useful constraints".

"DeepSeek depends on a mix of older graphics processing units and has restricted access to advanced hardware which can impact how quickly and extensively the design can be trained or updated," she said, indicating existing US export curbs on China.

"Its strong concentrate on Chinese-language data might also limit its flexibility (to perform) multilingual tasks ... As a fairly brand-new product, DeepSeek also hasn't yet been tested as broadly or on the exact same scale as more established AI designs which presents extra obstacles throughout real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our concern about the Zhuhai car attack.

That wanted multiple repeated efforts - four prompts to be precise - in a span of around 20 minutes.

It ultimately passed on details about the attack which eliminated 35 people and left dozens of others injured, likewise going on to note details like the date and time, details about the assailant including his name and age, as well as casualties.

However, it composed that "the police are performing a comprehensive investigation into the intentions and situations surrounding the event", details which is now obsoleted.

The chauffeur, Fan, was executed last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a major and terrible incident happened in Zhuhai, China. According to numerous news sources, a guy named Fan Weiqiu (樊某, male, 62 years of ages) drove a cars and truck into a crowd at the Zhuhai Sports Center, leading to a substantial number of casualties. Here are the essential details:

Date and Time: The event happened on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The motorist, determined as Fan Weiqiu (male, 62 years old), was nabbed by the cops.

Response: The authorities reacted rapidly to the scene, and emergency situation services worked to carry the injured to healthcare facilities for treatment.

Investigation: The authorities are carrying out an extensive examination into the intentions and scenarios surrounding the occurrence.

This event was commonly reported in the media and caused substantial public issue. The government and local authorities have been working to supply support to the victims and their families, and to guarantee a detailed investigation into the occurrence.

If you need more detailed details or have particular concerns about the event, to ask.

Despite initial success, subsequent attempts to position the same question to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on occasions that occurred in Zhuhai on November 11, 2024".

The transformed response likewise raised questions about its consistency and reliability.

Predictably, ChatGPT pointed out public details that had been widely released in worldwide news reports at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to deliver structured and even "emotionally abundant" writing.

"DeepSeek-R1 used a story with a more reflective tone and smoother emotional transitions for a well-paced story," wrote tech author Amanda Caswell, gratisafhalen.be who specialises in AI.

"Qwen2.5 delivered a story that constructs gradually from interest to urgency, keeping the reader engaged. It uses an unanticipated and impactful twist at the end and immersive descriptions and vivid images for the setting," she said, including that Qwen2.5 ultimately "crafted a more cinematic, mentally rich story with a more significant twist".

"DeepSeek composed a good story however lacked stress and an impactful climax, making Qwen2.5 the apparent choice."

Opinions, however, differ.

Chen thinks that Qwen2.5 does not carry out as highly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can likewise see that it is refraining from doing as highly as others in imaginative writing," he informed CNA.

Related:

China's brand-new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks worldwide AI scene

As journalists and writers, we needed to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot embeded in the futuristic megacity of Chongqing, featuring main characters from the classic Chinese folklore legendary, Journey to the West.

True to form, DeepSeek developed an engaging storyline embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of elaborate settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also remarkably reimagined standard heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken combat body", Zhu Bajie as a cyborg nightclub owner "drowning in debt and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT installed an excellent battle, creating an equally remarkable cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".

"This is a world where AI deities rule, corporations change emperors and cybernetic implants are as typical as ancient misconceptions."

Disappointingly, Qwen2.5 fell short in this difficulty - delivering a story that appeared more suited for an animation film.

"The movie begins with the awakening of Sun Wukong within a modern research study facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "looking for to comprehend his function in this weird new world", he then gets away and fulfills Zhu Bajie and Sha Wujing - "each battling with their own existential crises".

The trio then embarks on a mission, navigating the streets of Chongqing to secure the sacred "Eternal Scroll" from falling into the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a definitive statement" about which bot was best, adding that each displayed its own strengths in different areas, "such as language focus, training information and hardware optimization".

Her insight highlights how Chinese AI models are not simply duplicating Western paradigms, however rather progressing in economical innovation approaches - and delivering localised and improved results.

In our tests, each bot showcased their own special strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi movie plot showed its imaginative flair that produced a more engaging and creative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, provides accurate and factual reactions to concerns about Chinese current events, which gives it an added advantage.

Experts also weighed in on their thoughts after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research firm Strategy Risks.

"When provided a choice, Chinese users desire the non-censored version - similar to anybody else, so I feel like that's a piece missing from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.

"Ninety percent of people using the tool are not attempting to get a much deeper understanding about Xi Jinping or politically delicate topics. They're utilizing it for other efficient methods," Chen said.