Intellifusion Bets Big on Inference AI Chips as Demand Surges, Eyes 2026 Launch of Next-Gen Architecture

On the consumer side, Intellifusion is seeing strong uptake of its Qiancheng AI technologies in wearables, supplying Huawei, Honor, and OPPO, while its “Dr. Luka” hardware line continues to gain traction. The company expects 50%+ growth in its consumer business in H1 2025.

TMTPOST -- Intellifusion, one of China’s earliest AI chip developers, is making a strategic pivot toward AI model inference—betting that the era of training dominance is giving way to inference-led growth in computing demand.

The Shenzhen-based firm, listed on Shanghai’s STAR Market (688343.SH), unveiled its latest suite of inference-focused products on July 25, ahead of the 2025 World Artificial Intelligence Conference. Among them: the DeepQiong X6000 Mesh inference accelerator card, boasting 256 TOPS of compute and optimized for high-throughput workloads such as decoding 256 video streams in real-time and supporting large models with hundreds of billions of parameters.

Intellifusion’s new all-in-one servers—Shenmu 6203 (2U), Tianzhou 6408 (4U), and Tianzhou 680G (8U)—extend this performance into data centers and edge environments, delivering up to 4 PFLOPS of inference capacity. CEO Chen Ning says these products mark a turning point for the company, which is now “fully committed” to inference computing chips after 11 years of neural processing unit (NPU) development.

“2025 will be a defining year for AI. Large models are maturing, costs are falling, and inference is about to outpace training in both growth and application,” Chen told TMTPost.

AI development is typically divided into two stages: training, which demands massive datasets and compute, and inference, where trained models are deployed to solve real-world problems. As AI adoption broadens—from chatbots to autonomous vehicles—cloud-based inference is quickly taking center stage.

According to IDC, cloud-based inference accounted for 58.5% of AI computing power in 2022 and is projected to hit 62.2% by 2026. AMD CEO Lisa Su forecasts AI inference compute demand will grow over 80% annually—potentially surpassing training as the primary driver for data center expansion.

“The inference chip market remains a blue ocean,” Chen said. “While the training chip sector is worth hundreds of billions, inference is just beginning. We believe it will outpace training within five years.”

At the heart of Intellifusion’s new offerings is the DeepQiong X6000 Mesh accelerator card, powered by the firm’s self-developed fourth-generation NPU optimized for Transformer-based models. The card uses a D2D (die-to-die) Chiplet design and C2C (chip-to-chip) mesh architecture—an innovation in China’s AI chip ecosystem. Intellifusion claims it is the first company to mass-produce such chips using fully domestic fabrication and packaging processes.

Complementing the chip, Intellifusion is rolling out inference servers and integrated machines for data centers and smart city deployments. Customers include municipal computing centers, telecom carriers, research institutes, and major Chinese internet firms.

“The DeepSeek all-in-one machines break the ‘last mile’ in closed-loop AI deployment,” Chen said, adding that the cooling AI hype is not a retreat, but a rational reshuffling to real-world use cases.

Intellifusion’s shift is already showing results. The company reported 2024 revenue of more than 900 million yuan ($124 million), up 81.3% year-on-year. Q1 2025 revenue surged 168.2% to 264 million yuan, a record for the period.

A deal with Deyuan Fanghui to provide 4,000 PFLOPS in inference compute over three years is expected to contribute 1.6 billion yuan in revenue. Payments began in early 2025, with roughly 200 million yuan booked in the first half.

On the consumer side, Intellifusion is seeing strong uptake of its Qiancheng AI technologies in wearables, supplying Huawei, Honor, and OPPO, while its “Dr. Luka” hardware line continues to gain traction. The company expects 50%+ growth in its consumer business in H1 2025.

Looking ahead, Intellifusion is preparing to launch its next-generation inference chip architecture—“Computing Power Building Blocks 2.0”—by late 2026, featuring:

  • Nova500 NPU: Native FP8/FP4, custom operators for large models, 5× compute efficiency, 3× energy efficiency.

  • 3D Hybrid Bonded Memory: 10× bandwidth and memory efficiency.

  • NB-Mesh interconnect: Full-mesh, all-reduce, memory semantic access.

  • Advanced packaging: Heterogeneous die, UCIE D2D Chiplets.

  • NB-Link: PCIe interface with CPU-NPU shared memory access.

CTO Li Aijun says the upgrades will support embedded, edge, and cloud inference for models such as MoE (mixture of experts) and edge-scale large models.

Founded in 2014, Intellifusion has invested heavily in edge computing chips and has already shipped five generations of NPUs. In 2023, it launched its DeepEdge10 platform, targeting scenarios from IoT to intelligent computing centers.

Now, the company is placing its biggest bet yet on inference.

“Most inventions in the U.S. stay in labs,” said Chen. “But in China, the value is in large-scale implementation. AI inference chips will become the core infrastructure enabling AI to reshape all hardware—from glasses to robots—over the next five years.”

Chen believes that by linking data, algorithms, and chip development through China’s vast application scenarios, Intellifusion can drive a “data flywheel” of continuous innovation. He sees AI inference chips as China’s opportunity to gain a foothold in the Fourth Industrial Revolution.

“Our biggest asset isn’t chips. It’s our team,” he said. “With the right DNA, we’ll overcome challenges—from supply chains to ecosystems—and continue building a globally competitive inference chip company.”

本文系作者 zhangxinyue 授权钛媒体发表,并经钛媒体编辑,转载请注明出处、作者和本文链接
本内容来源于钛媒体钛度号,文章内容仅供参考、交流、学习,不构成投资建议。
想和千万钛媒体用户分享你的新奇观点和发现,点击这里投稿 。创业或融资寻求报道,点击这里

敬原创,有钛度,得赞赏

赞赏支持
发表评论
0 / 300

根据《网络安全法》实名制要求,请绑定手机号后发表评论

登录后输入评论内容

快报

更多

07:03

英海事分析公司:美国实施封锁后,仍有部分船舶通过霍尔木兹海峡

2026-04-14 23:03

国内商品期货夜盘收盘,燃油跌2.21%

2026-04-14 23:02

云南龙富公路口岸获批对外开放

2026-04-14 22:57

以防长称从伊朗移除浓缩铀是结束冲突的“先决条件”

2026-04-14 22:55

Lululemon中国回应:国内所有在售产品均不含全氟和多氟烷基物质

2026-04-14 22:54

白宫经济顾问哈塞特:油价上涨将会逆转,美联储还有降息空间

2026-04-14 22:44

两艘从伊朗出发的船只通过霍尔木兹海峡

2026-04-14 22:35

加密货币普遍大涨,比特币涨超6%升破7.6万美元

2026-04-14 22:32

中科星图:控股子公司拟不低于1110.3万元转让星图瑞云30%股权

2026-04-14 22:31

伯特利:丝杠和电机项目预计今年下半年实现量产,投产后首先配套WCBS和EMB产品

2026-04-14 22:30

中概指数涨幅扩大至2%

2026-04-14 22:26

美股全球星涨近10%创历史新高

2026-04-14 22:25

富国银行跌幅扩大至7.1%,创下年内最大单日跌幅

2026-04-14 22:22

美国联邦通信委员会主席:SpaceX公司已就太空数据中心提出请求

2026-04-14 22:19

伊朗优先考虑在伊斯兰堡举行新一轮伊美会谈

2026-04-14 22:19

美股航空股集体走高,美国航空、捷蓝航空涨约9%

2026-04-14 22:18

国内贵金属、有色金属期货集体走高

2026-04-14 22:17

伊朗内政部长指示边境省份“消除海上封锁威胁”

2026-04-14 22:17

以外长称以色列寻求与黎巴嫩“关系正常化”但要解决真主党问题

2026-04-14 22:15

伊朗称战争赔偿是伊美谈判议题,伊朗损失达2700亿美元

扫描下载App

Baidu
map