Data Centre Build News & Insights


Industry recognises International Data Centre Day
In celebration of International Data Centre Day (25 March), DCNN has brought together a series of expert perspectives from across the industry to reflect on the opportunities and challenges facing data centre infrastructure today. From soaring energy demands driven by AI workloads to the shifting geography of deployment and the importance of community stewardship, these industry figures offer a broad view of the forces reshaping the sector and what must be done to meet them responsibly: The future of deployment Ivo Ivanov, CEO of internet exchange operator DE-CIX, comments, "The AI boom used to be about building bigger, better data centres. That works for model training, where raw compute power is prioritised over connectivity and latency, but it doesn't work for inference, which is where the real value of AI is realised at edge deployments. "AI inference demands near real-time responsiveness and, as such, it can't tolerate the latency associated with long round-trip delays to remote data centre hubs. From driverless vehicles to real-time fraud detection, the deployment of AI in edge locations demands a far more distributed infrastructure environment than what we're currently used to. "AI depends on data moving constantly between locations, and if that movement slows down, performance drops off quickly, no matter how much compute you add into the equation. It's no longer about building bigger or better data centres, but about where they're deployed and how they're connected. In other words, geography is becoming just as important as power and compute. "In the coming years, that's going to stretch the definition of data centres: from hyperconnected edge deployments and emerging AI-focused data centre hubs to more experimental concepts that sit above the clouds in the Earth's orbit. The data centre map is being redrawn in real time." Sustainability continuing as a vital factor Professor Aoife Foley, IEEE Senior Member and Chair in Net Zero Infrastructure at the University of Manchester, notes, "The International Energy Agency expects data centre electricity demand to more than double by 2030, reaching around 945 terawatt-hours, slightly above Japan's annual use. This not only indicates a greater demand for energy, but also highlights the importance of balancing digital growth with sustainable energy practices. "Infrastructure and operations leaders have a responsibility here and need to consider the unnecessary waste associated with data storage and commit to generating power from more renewable sources. By eliminating unstructured data and improving operational efficiency, organisations can actually lower their maintenance costs and improve regulatory compliance. "AI workloads consume significantly more energy than traditional cloud computing tasks, and although hyperscale operators are investing in renewable energy to soften the impact, this alone is not enough. Cooling innovations such as liquid immersion and direct-to-chip systems add further efficiencies, yet they still address symptoms rather than the deeper inefficiencies in model design and compute intensity. These impacts can be reduced through smarter model optimisation and a closer alignment between data centre strategy and regional renewable generation." Infrastructure integrated into communities Dave Philp, Chief Value Officer at infrastructure engineering software and digital twin platform provider Bentley Systems, concludes, "International Data Centre Day is a moment to recognise the often-unseen critical infrastructure that underpins modern life. Like healthcare, transport, and energy systems, data centres have become essential civic infrastructure in an increasingly connected world. "Today, data centres are no longer isolated technical buildings; they are part of the fabric of local communities. Being a 'good neighbour' is no longer optional; it is fundamental to earning trust, securing consent, and operating sustainably over the long term. That requires a clear understanding of how data centres interact with local water resources, energy networks, transport infrastructure, and surrounding land use. "We need to put digital modelling, analysis, and visualisation at the heart of how we plan, design, deliver, and operate these facilities. This enables better decisions, earlier engagement, and more resilient outcomes, anticipating construction challenges, responding to environmental constraints, and embedding operational resilience from day one. "As global demand for digital services continues to accelerate, the challenge is not simply to build more data centres, but to build them better. Thoughtful design, digital insight, and a commitment to community and environmental stewardship are what turn a data centre from a functional box into a trusted, long‑term asset that genuinely serves society."

Ramon.Space, Foxconn to deliver space DC infrastructure
Ramon.Space, a developer of computing systems for satellites and space missions, has expanded its partnership with cloud infrastructure provider Ingrasys, a subsidiary of Foxconn Technology Group, to develop data centre infrastructure for use in orbit. The collaboration focuses on scaling computing platforms designed for space, which comes as demand reportedly grows for processing data generated by satellites. Traditional Earth-based infrastructure, the companies state, faces constraints including latency, bandwidth, and power availability, particularly when handling large volumes of space-generated data. Space-based data centres, they say, aim to address these challenges by enabling processing and storage directly in orbit, reducing reliance on data transmission back to Earth and supporting real-time applications. Ramon.Space develops computing, storage, and communications systems designed to operate in space environments. These systems are engineered to manage challenges such as radiation exposure, power efficiency, thermal conditions, and autonomous operation. Ingrasys has previously supported the manufacturing of Ramon.Space’s computing platforms. Under the expanded agreement, the companies will work towards a production-ready product line, using Ingrasys’ manufacturing capabilities to support scale and consistency. Scaling infrastructure for space-based data Avi Shabtai, CEO of Ramon.Space, comments, “Expanding our work with Ingrasys to build data centre capabilities in space marks a major milestone in turning space computing from innovation into infrastructure. “We are laying the foundation for in-orbit data centers that enable real-time processing, reduce reliance on downlink, and support the next generation of software-defined satellites and distributed systems.” Benjamin Ting, CEO of Ingrasys, adds, “Ingrasys is proud to extend our collaboration with Ramon.Space into the in-orbit data centre domain. “By combining world-class manufacturing with Ramon.Space’s innovative computing platform, we are helping unlock a new era of space infrastructure.” The initiative is intended to complement terrestrial data centres and cloud infrastructure, while supporting applications where processing in orbit is more efficient. These include Earth observation, communications, and government-led space programmes. The programme will begin with prototype development and testing, with plans to expand towards operational deployments as space-based data infrastructure develops.

Duos Edge AI expands Amarillo data centre footprint
Duos Technologies Group, through its subsidiary, Duos Edge AI, a provider of edge data centre (EDC) systems, has deployed a second EDC in the Amarillo, Texas market. The carrier-neutral facility is located on land in Potter County, adjacent to a major colocation site in the Texas Panhandle. It is designed to support regional demand for low-latency computing, including AI applications, enterprise workloads, and public sector services. The deployment builds on the company’s initial Amarillo site and forms part of a wider expansion across Texas, with additional locations in Lubbock, Waco, Victoria, Abilene, and Corpus Christi. The facility is designed to provide local processing capacity, reducing reliance on data centres in larger metropolitan areas and supporting improved network performance. Regional expansion and edge infrastructure strategy Duos Edge AI says the site will deliver high-density computing, increased bandwidth availability, and secure data processing capabilities for organisations operating in the region. The expansion reflects a broader strategy to develop edge infrastructure in underserved and high-growth markets. Dave Irek, Chief Operations Officer at Duos Edge AI, says, “This expansion enhances capacity and capability in the region. We are creating a robust, carrier-neutral ecosystem designed to support innovation, attract investment, and drive long-term economic growth.” Potter County Judge Nancy Tanner adds, “This collaboration with Duos Edge AI represents a significant investment in our community's future. [It] will attract new businesses, improve connectivity for our residents and schools, and position Potter County as a leader in digital infrastructure.” The new facility is expected to become operational in the coming months. For more from Duos Edge AI, click here.

Schneider, NVIDIA to advance AI data centre design
Global energy technology company Schneider Electric has expanded its collaboration with NVIDIA to develop validated designs and digital tools for large-scale AI data centres. Working alongside AVEVA, the companies outlined new developments in designing, simulating, building, and operating AI infrastructure during NVIDIA GTC in San Jose, USA. These include a reference design for NVIDIA’s latest rack-scale systems, integration of digital twin capabilities, and testing of AI-driven tools for managing data centre alarms. The announcements focus on supporting large-scale AI deployments, sometimes referred to as “AI factories”, with an emphasis on power, cooling, and operational efficiency. Reference design and digital twin integration A new reference design has been developed for NVIDIA’s Vera Rubin NVL72 rack architecture, covering power distribution and cooling requirements. The design supports higher supply voltage, improved thermal efficiency, and clustered rack configurations for AI workloads. It has been validated using electrical system and airflow modelling tools to assess performance before deployment. In parallel, AVEVA has introduced a lifecycle digital twin architecture integrated within the NVIDIA Omniverse environment. This enables simulation of power, cooling, and operational conditions, allowing operators to test and refine designs prior to construction. According to the companies, this approach is intended to reduce design cycles, improve accuracy, and support more efficient deployment of AI infrastructure. Manish Kumar, Executive Vice President, Secure Power & Data Centers at Schneider Electric, comments, “As AI workloads scale in both size and complexity, the margin for error in data centre design becomes incredibly small. “Delivering AI at scale requires tightly integrated electrical, cooling, and digital architectures that can support both unprecedented performance demands while maintaining peak energy efficiency. "By combining advanced software, digital twins, and validated reference designs, operators can simulate and optimise infrastructure before a single rack is deployed. This approach reduces risk, accelerates deployment, and ensures the efficiency and resilience needed to power the next generation of AI factories.” Vladimir Troy, Vice President of AI Infrastructure at NVIDIA, adds, “Gigawatt-scale AI factories demand a fundamentally new class of energy-efficient and highly predictable infrastructure. “Together, NVIDIA and Schneider Electric are providing the power, cooling, and digital twin architectures needed to accelerate time-to-token for our customers worldwide.” AI-based alarm management testing Schneider Electric also confirmed testing of an AI-based alarm management capability using NVIDIA Nemotron models. The system analyses real-time data from multiple sources to identify root causes of issues and recommend corrective actions. The aim is to support data centre operators in resolving incidents more quickly and consistently, while reducing unnecessary maintenance activity. The latest developments build on ongoing collaboration between the companies, including work on digital twin environments, power system modelling, and support for higher-voltage data centre architectures. For more from Schneider Electric, click here.

Nscale, Microsoft partner on large-scale campus in West Virginia
Nscale, a UK developer of AI data centres and cloud infrastructure, has signed a letter of intent with Microsoft to deliver 1.35GW of AI compute capacity at the Monarch AI campus in West Virginia, in collaboration with NVIDIA and Caterpillar. The development will deploy NVIDIA’s next-generation Vera Rubin NVL72 GPU systems, based on the NVIDIA DSX AI Factory reference design, with the undertaking expected to begin in phases from late 2027. In addition to this news, Nscale has also announced the acquisition of American Intelligence & Power Corporation (AIPCorp), which includes the Monarch Compute Campus in Mason County. The site spans up to 2,250 acres (9.1 km²) and is designed as a state-certified AI microgrid, with the potential to scale beyond 8GW of power capacity. Hyperscale AI infrastructure and power integration Under the agreement, Nscale will construct and operate the data centre infrastructure, with Microsoft supporting long-term compute services and lease arrangements. The campus is intended to support large-scale AI training and inference workloads, with high-speed connectivity to major US data centre hubs, including Ashburn and Chicago. As part of the project, Caterpillar will supply G3500 series natural gas generator sets, with plans to deliver up to 2GW of on-site power generation by the first half of 2028. The microgrid design enables the facility to operate independently of the local grid, while also allowing for potential future grid integration. The development reflects increasing demand for AI-driven data centre capacity, with industry forecasts indicating significant growth in global power requirements over the coming years. The Monarch campus is expected to build on Nscale’s existing capacity and support expansion of large-scale AI infrastructure in the US. For more from Nscale, click here.

Why DC-powered lighting matters for modern data centres
In this exclusive article for DCNN, Ton van de Wiel, Global Segment Manager, Intelligent Buildings at Signify, outlines why DC-powered LED lighting is emerging as a key consideration in making data centre infrastructure more efficient and resilient: Building resilience from the ground up The digital services that underpin modern economies – from media streaming to cloud computing – depend on a rapidly expanding global network of data centres. These facilities are not only critical to digital connectivity; they represent significant sources of employment, infrastructure investment, and tax revenue through construction and long-term operation. Today, data centre operators face a convergence of challenges. Capacity requirements are accelerating due to AI-driven workloads, energy prices are rising, and expectations around sustainability and carbon reduction are becoming more stringent. In response, the industry is re-examining its electrical infrastructure. Direct current (DC) power architectures, once limited to niche applications, are gaining traction as a foundation for higher efficiency and greater operational resilience. Within this shift, lighting – often treated as a peripheral system – can play a strategic role. DC-powered LED lighting combines high energy efficiency with relatively low implementation risk, making it an effective starting point for broader DC adoption. Beyond energy savings, lighting can also function as an intelligent layer within next-generation data centre infrastructure. How power architectures are changing Operating a data centre requires tight coordination between IT equipment, networking, cooling, security, and electrical distribution. Historically, alternating current (AC) has been the default for power distribution. However, as facilities' scale and power densities increase, electrical efficiency has become a primary design concern. Early facilities relied on 48V DC for backup systems – safe but capacity-constrained. This gave way to 230/277V AC distribution, followed by 380V DC for internal systems. Today, the extreme power demands of AI servers are driving another transition towards 650V DC and even 800V DC architectures. According to the Open Direct Current Alliance (ODCA), 650V DC represents the optimal level for building-wide distribution, balancing efficiency with safety, while organisations such as NVIDIA and the Open Compute Project are investigating 800V DC. Despite promising high-power IT loads, these higher voltages do not yet deliver the same system-wide efficiency benefits as a facility-level 650V DC approach. Outside the data centre sector, industrial sites are already deploying 650V DC systems to improve energy efficiency and resilience. One key advantage is the ability to capture regenerative energy from motor drives and robotics – energy that would otherwise be dissipated as heat. Because lighting is a continuous base load, it can readily absorb this recovered energy, reducing grid dependency and operating costs. Integrating lighting, motors, renewables, and storage on a shared DC grid reduces conversion losses, cuts copper usage through fewer conductors, and lowers transmission losses compared with 400V AC systems. When paired with solar PV and batteries, DC grids also improve self-consumption, backup capability, and flexible energy management. What’s driving the move? The momentum behind DC power in data centres is rooted in both engineering logic and economics: • Lower conversion losses — Conventional AC systems require multiple conversion steps, resulting in energy losses of up to 18%. • Alignment with IT equipment — Servers and GPUs operate natively on DC power. • Simpler renewable integration — Solar panels and battery systems produce DC, enabling more efficient connections. • Reduced system complexity — Fewer transformers and rectifiers mean simpler installation and improved reliability. • Preparedness for AI growth — Rising AI workloads are accelerating the shift towards DC-based power systems. DC power is therefore not just an alternative distribution method, but a pathway to smarter, more resilient infrastructure. Lighting as the first step Among all building systems, lighting is often the most practical candidate for early DC adoption. Connected LED lighting allows operators to pilot DC distribution with limited risk before extending it to mission-critical IT loads. The benefits are tangible: • Capital expenditure savings — DC lighting cables reduce copper use by 40%. Three-conductor DC cables (L+, L-, PE) can transmit the same power as five-conductor 400V three-phase AC cables. • Operational cost reductions — With only two current-carrying conductors, DC lighting avoids approximately 33% of cable losses compared with three-phase AC at the same current. • Improved resilience — DC lighting can operate directly from on-site solar generation or battery storage, strengthening microgrid performance during outages. DC-compatible luminaires and components are already commercially available. For example, Signify offers a 100W Xitanium LED driver designed for 620–750V DC operation, integrated into the Pacific LED Gen5 and Maxos Fusion luminaire families. These solutions achieve up to 165lm/W efficacy and can be paired with systems such as Signify Interact and Philips Dynalite. Driver-level efficiency can exceed 95%, with future potential to reach 200lm/W through ultra-high-efficiency LED modules. Sustainability and ESG outcomes DC-powered lighting supports measurable sustainability objectives: • Lower carbon emissions through reduced conversion losses and material usage • Support for certifications such as LEED Zero and BREEAM • Energy optimisation with connected lighting systems, cutting lighting energy use by up to 75% For hyperscalers like Amazon Web Services and Microsoft Azure, as well as colocation providers, these outcomes translate directly into stronger ESG reporting and progress towards carbon neutrality. DC lighting can also be implemented incrementally. Some facilities deploy rack-level DC lighting while retaining an AC backbone. Others adopt facility-wide DC grids that integrate lighting, renewables, storage, and IT infrastructure. In larger deployments, centralised emergency lighting connected to the DC backbone ensures continuous illumination during outages, reinforcing safety in mission-critical spaces. A strategic role for lighting As operators prepare for the next phase of digital expansion, DC-powered lighting offers a practical, high-impact entry point into efficient, renewable-ready DC infrastructure. Modern connected lighting systems extend far beyond illumination. With embedded sensors measuring occupancy, daylight, temperature, humidity, and air quality, luminaires form a dense, facility-wide sensing network without the need for additional hardware. Using open protocols such as DALI, BACnet, and MQTT, DC lighting networks integrate with building management systems and DCIM platforms, enabling predictive maintenance, enhanced operational intelligence, and optimised cooling and space utilisation. By simplifying cabling, reducing losses, and enabling intelligent energy management, DC lighting transforms illumination from a passive load into an active contributor to resilient, sustainable data centre operations.

Pure DC, AVK deploy 'Europe’s first' data centre microgrid
Pure Data Centres Group (Pure DC), a designer, developer, and operator of hyperscale data centres, together with AVK, a provider of power systems and electrical infrastructure for data centres, have announced the launch of what they describe as Europe’s first, large-scale, 110MW on-site microgrid, developed to support early‑phase site operational resilience. Located within Pure DC’s Dublin campus, the on‑site energy system provides the opportunity for dispatchable capacity to support data centre operations during initial development phases, prior to full integration with the national electricity system as grid connection capacity becomes available. Over time, the campus is intended to operate as part of a hybrid energy configuration, combining grid‑supplied electricity with on‑site infrastructure designed to enhance flexibility, resilience, and system stability. What AVK describes as a "first-of-its-kind deployment in Europe" showcases the ability to use its microgrid technology for on-site power generation, and the transitional and complementary role it can play in supporting the delivery of strategically important digital infrastructure. This is particularly relevant for regions where grid reinforcement and renewable generation are being delivered on a phased basis under national planning frameworks. A replicable model The microgrid also represents a blueprint for energy generation and showcases how large-scale microgrids can be replicated across Europe - with Germany, the Netherlands, and the UK having been identified as key target markets for the technology. The Mayor of Fingal County Council, Councillor Tom O'Leary, comments, “Fingal wants to remain a champion for breakthrough technologies, but we also understand that progress must be delivered in a way that is climate friendly, resilient, and aligned with Ireland’s energy transition. That’s why this project is so important. "A microgrid that can generate and manage its own power supports future integration into the national grid, integrates renewable energy, enables storage, and trials new low‑carbon fuels like biomethane. This is innovation with purpose.” Gary Wojtaszek, Pure DC’s Executive Chairman and interim CEO, notes, “The biggest barrier to deploying AI infrastructure in Europe today isn’t technology; it’s power. This microgrid proves that even the most constrained markets can unlock new digital capacity, giving Ireland the opportunity to lead Europe’s next chapter of AI infrastructure. "The future of AI infrastructure will be built where energy and compute come together, and that’s exactly what we’re building at Pure.” Speaking about the project, Ben Pritchard, CEO of AVK-SEG, adds, “We are delighted to have worked with Pure DC to deliver this groundbreaking project. While several microgrids are already in operation in the US, until today there were none of these deployments in Europe. This project demonstrates how carefully designed onsite energy infrastructure can complement national energy planning frameworks. “This recognises that power is now the new differentiator for data centres, and that energy has shifted from being a utility to a strategic asset - shaping the location, design, economics, and competitiveness for operators. "The first of many in Europe, this microgrid has the capability to revolutionise the data centre power race as we know it, providing a complementary solution that will ease gridlock and pave the way for greater take-up of AI and cloud.” Powering the digital economy Pure DC’s microgrid is comprised of three, interconnected energy centres, with each building generating up to 30 MW of power. Energy Centre 1 (EC1) and EC2 will be fully operational by the end of 2026 and will be followed by EC3 at a later stage. The design includes combined heat and power (CHP) capability, with infrastructure in place to enable heat recovery and potential future connection to district heating networks, subject to third‑party demand and regulatory approvals. Waste heat recovery systems are also used to improve operational efficiency within the energy centres. Future water management measures include rainwater harvesting and on‑site treatment, reducing reliance on mains water for engine‑related processes. The system is engineered to accommodate incremental changes in fuel composition - including hydrogen blending - supporting future decarbonisation of the gas network in line with national policy developments. Pure DC’s battery energy storage system (BESS) is integrated to manage load fluctuations and enhance operational efficiency, improving response times and enabling more optimal engine operation. The BESS is designed to support future renewable energy integration as part of a broader transition pathway. For more from Pure DC, click here, and for more from AVK, click here.

Legrand's UPS wins Data Centre World award
French multinational infrastructure products manufacturer Legrand’s Keor FLEX modular uninterruptible power supply (UPS) has won the Best Reuse or Recycling of Products, Energy, or Data Centre Infrastructure category at the Data Centre World Awards 2026. The award was presented during Data Centre World London, held on 4–5 March at ExCel London, and recognises projects and technologies that support resource reuse, waste reduction, and improved sustainability across the data centre sector. Keor FLEX was recognised for its modular architecture and design approach aimed at extending the operational lifespan of critical power infrastructure. Modular design focused on lifecycle extension Unlike traditional UPS systems that require replacement of the entire unit at the end of its lifecycle, the Keor FLEX system allows individual power or bypass modules to be replaced or refurbished independently. The system uses a hot-swappable modular design, allowing capacity to be expanded or maintained without taking the entire system offline. According to Legrand, the system achieves 98.6% efficiency in online double conversion mode and more than 99% efficiency in ECO mode. It also has an 85% recyclability rate under IEC/TR 62635, with more than 69% recyclable metal content and packaging that includes 50% recycled material. The UPS integrates silicon carbide technology and a low-impedance internal busbar architecture, designed to reduce thermal stress on components and extend the lifespan of power modules. Keor FLEX also supports a universal battery interface that allows existing VRLA, lithium-ion, or nickel-zinc battery systems to be retained during upgrades. Marc Marazzi, Vice President at Legrand Data Center Solutions Europe, says, “Data centres are under pressure to deliver more compute power while reducing environmental impact. “Keor FLEX proves that sustainability and performance are not mutually exclusive. By designing circularity into the core architecture, we’ve created a UPS platform that extends asset life, reduces waste, lowers energy consumption, and supports evolving AI workloads - all while improving total cost of ownership. This reflects Legrand’s broader sustainability commitments, including being awarded an ‘A’ rating by CDP for the second consecutive year.” The system is designed to scale from 100kW to 1.2MW per frame, with up to 4.8MW available in parallel configurations for larger data centre environments. For more from Legrand, click here.

Socomec launches energy audit initiative for UKI data centres
Socomec, a manufacturer of low voltage power management systems, has launched an energy audit programme for data centres in the UK and Ireland, aimed at helping operators measure energy use and meet reporting requirements under the EU Energy Efficiency Directive (EED). Under EU EED rules, owners and operators of facilities with a capacity above 500kW must disclose their power usage effectiveness (PUE) and other environmental performance indicators each year. The next reporting deadline is 15 May 2026. The directive closely aligns with the UK’s Energy Savings Opportunity Scheme (ESOS) and the ISO 50001 standard, which requires organisations to monitor and report energy consumption and power utilisation accurately. Improving PUE is also becoming an operational priority for data centres as electricity costs increase and workloads linked to artificial intelligence raise power demand. Socomec estimates that improving PUE by 0.1 - from 1.6 to 1.5, for example - can reduce annual energy consumption by around 6–8%. For a 2MW data centre, this could equate to more than £100,000 in yearly energy savings while also extending the lifespan of existing infrastructure. Energy infrastructure assessments for operators Data centre operators in the UK and Ireland can apply for an assessment of their energy infrastructure through the programme. Socomec’s engineers will carry out site inspections covering IT and non-IT loads, including UPS systems, server racks, cooling equipment, lighting, and switchgear. The aim is to determine PUE and identify gaps in existing metering capabilities. Participating facilities receive a report outlining energy efficiency measures, estimated cost savings, and potential return on investment. The findings are intended to support decision-making across sustainability, finance, and engineering teams. The audits are particularly relevant for older colocation data centres seeking to measure PUE at rack level using Measuring Instrument Directive-compliant metering. More detailed measurement can also allow operators to allocate energy costs more accurately between tenants. Colin Dean, Managing Director of Socomec, says, “The EU EED represents a gold standard for sustainable energy management and it’s only a matter of time before other countries follow Germany’s example and start penalising non-compliance. "In addition, there is a fear - particularly among legacy data centre operators - that a rip-and-replace approach is needed to achieve modern energy efficiency. At Socomec, our aim is to plug this gap with proactive and practical guidance, showing that metering can be retrofitted to improve efficiency without infrastructure overhaul or operational downtime. “Our energy audit is designed to help operators of mission-critical data centres take informed action towards sustainability while maximising their investments. With clear, accurate insights into PUE, data centres can turn energy data into action, optimise operational costs, and drive long-term resilience.”

Reshaping data infrastructure to help carriers digitally transform
At MWC Barcelona 2026, Yuan Yuan, President of Huawei Data Storage Product Line, shared Chinese multinational technology company Huawei's key insights and innovations for enabling carriers to plan their data infrastructure, address challenges in AI adoption, and prepare for IT architecture transformation in the AI era. Data preparation for AI: From dormancy to awakening In the age of AI, data is an essential asset. Yuan noted that in the past two years, over 90% of enterprises actively embraced AI for business innovation, but fewer than 10% have successfully mastered and scaled AI technology. There are three primary challenges: persistent data silos that hinder data collaboration across regions and organisations; a lack of quality data supply, especially industry-specific knowledge; and inefficiencies in the data preparation phases like data collection, cleansing, and labelling. This results in AI applications falling short of commercial viability, raising doubts about the return on investment. Yuan predicts, "In the future, cold data will become a thing of the past. Data will shift from 'offline' to 'always online,' and retention policies will move from being compliance-driven to a principle of retaining and never deleting. Consequently, data volumes will expand from petabytes to exabytes, which will drive demand for greener, more efficient data infrastructure." Architectural transformation: From storing data to storing knowledge and memory As AI agents become the primary consumers of data, data infrastructure must evolve to embrace new data paradigms, including vector, graph, and key-value (KV) semantics. To eliminate AI hallucinations and enable continual AI evolution, data infrastructure must be capable of storing knowledge and memory. Yuan discussed Huawei's AI data platform, an innovative solution that integrates knowledge, memory, and inference acceleration services into a single storage system. This consolidated approach significantly reduces system complexity and O&M costs. The platform delivers a massive upgrade in performance. Inference efficiency (measured in tokens generated per second) is multiplied, while latency (time to first token) is reduced by 90%. Furthermore, the continual evolution of data, knowledge, and memory makes AI agents smarter over time. As Yuan explains, "In the future, every carrier will need its own AI data platform to help agents understand business processes, acquire domain-specific expertise, and iterate and upgrade rapidly. Otherwise, AI will remain nothing more than an expensive toy." AI adoption planning: From AI exploration to AI-driven service upgrades Although many carriers have made AI a strategic priority and are beginning to adopt it, significant challenges remain in real-world deployment: inference failure, inference costs, and inference speed. Yuan presented an intelligent computing service platform, jointly developed with a Chinese carrier, that tackles these challenges. The platform uses the KV cache technology to improve storage resource utilisation and supports inference applications of different large models like DeepSeek and Qwen. It optimises cost-effectiveness by innovatively eliminating repeated computing via querying. Through the collaboration of on-chip memory, DRAM, and AI storage, the platform enables PB-scale KV cache storage. This improves the overall throughput by more than 10 times, reduces inference costs by about 50%, and shortens response time to less than one second. In addition, algorithm optimisation addresses challenges like low KV cache hit ratios and inference failure due to long-sequence inputs in research report analysis. Serving as the foundation for AI, the platform has been deployed at scale at the group to enable multidimensional innovation across services, including internal IT systems, B2C services, B2B services, and B2H services. Yuan says, "Planning AI training and inference platforms requires more than focusing on computing power and models; deep collaboration between storage and compute is also essential to improve system-level efficiency and user experience." Yuan highlighted that AI is reshaping data infrastructure. In the AI era, storage systems will evolve into intelligent engines, which will not only store critical data assets, but also serve as the knowledge sources and memory carriers for the continuous evolution of AI agents. He called on carriers to prioritise accumulation and protection of quality data, and to plan and build a unified AI data platform that supports a wide range of large model applications while enabling service innovation for both internal operations and external offerings. Huawei says it will continue to advance technological innovation and architectural upgrades to help carriers digitally transform. For more from Huawei, click here.



Translate »