Advanced Search
Turn off MathJax
Article Contents
WANG Qinding, TAN bin, HUANG Guangping, DUAN Wei, YANG Dong, ZHANG Hongke. Lightweight Incremental Deployment for Computing-Network Converged AI Services[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT250663
Citation: WANG Qinding, TAN bin, HUANG Guangping, DUAN Wei, YANG Dong, ZHANG Hongke. Lightweight Incremental Deployment for Computing-Network Converged AI Services[J]. Journal of Electronics & Information Technology. doi: 10.11999/JEIT250663

Lightweight Incremental Deployment for Computing-Network Converged AI Services

doi: 10.11999/JEIT250663 cstr: 32379.14.JEIT250663
Funds:  The National Key Research and Development Program of China (2022YFB2901302)
  • Received Date: 2025-07-15
  • Rev Recd Date: 2025-09-10
  • Available Online: 2025-09-15
  •   Objective   The rapid expansion of Artificial Intelligence (AI) computing services has heightened the demand for flexible access and efficient utilization of computing resources. Traditional Domain Name System (DNS) and IP-based scheduling mechanisms are constrained in addressing the stringent requirements of low latency and high concurrency, highlighting the need for integrated computing-network resource management. To address these challenges, this study proposes a lightweight deployment framework that enhances network adaptability and resource scheduling efficiency for AI services.  Methods   The AI-oriented Service IDentifier (AISID) is designed to encode service attributes into four dimensions: Object, Function, Method, and Performance. Service requests are decoupled from physical resource locations, enabling dynamic resource matching. AISID is embedded within IPv6 packets (Fig. 5), consisting of a 64-bit prefix for identification and a 64-bit service-specific suffix (Fig. 4). A lightweight incremental deployment scheme is implemented through hierarchical routing, in which stable wide-area routing is managed by ingress gateways, and fine-grained local scheduling is handled by egress gateways (Fig. 6). Ingress and egress gateways are incrementally deployed under the coordination of an intelligent control system to optimize resource allocation. AISID-based paths are encapsulated at ingress gateways using Segment Routing over IPv6 (SRv6), whereas egress gateways select optimal service nodes according to real-time load data using a weighted least-connections strategy (Fig. 8). AISID lifecycle management includes registration, query, migration, and decommissioning phases (Table 2), with global synchronization maintained by the control system. Resource scheduling is dynamically adjusted according to real-time network topology and node utilization metrics (Fig. 7).  Results and Discussions   Experimental results show marked improvements over traditional DNS/IP architectures. The AISID mechanism reduces service request initiation latency by 61.3% compared to DNS resolution (Fig. 9), as it eliminates the need for round-trip DNS queries. Under 500 concurrent requests, network bandwidth utilization variance decreases by 32.8% (Fig. 10), reflecting the ability of AISID-enabled scheduling to alleviate congestion hotspots. Computing resource variance improves by 12.3% (Fig. 11), demonstrating more balanced workload distribution across service nodes. These improvements arise from AISID’s precise semantic matching in combination with the hierarchical routing strategy, which together enhance resource allocation efficiency while maintaining compatibility with existing IPv6/DNS infrastructure (Fig. 23). The incremental deployment approach further reduces disruption to legacy networks, confirming the framework’s practicality and viability for real-world deployment.  Conclusions   This study establishes a computing-network convergence framework for AI services based on semantic-driven AISID and lightweight deployment. The key innovations include AISID’s semantic encoding, which enables dynamic resource scheduling and decoupled service access, together with incremental gateway deployment that optimizes routing without requiring major modifications to legacy networks. Experimental validation demonstrates significant improvements in latency reduction, bandwidth efficiency, and balanced resource utilization. Future research will explore AISID’s scalability across heterogeneous domains and its robustness under dynamic network conditions.
  • loading
  • [1]
    刘强, 崔莉, 陈海明. 物联网关键技术与应用[J]. 计算机科学, 2010, 37(6): 1–4,10. doi: 10.3969/j.issn.1002-137X.2010.06.001.

    LIU Qiang, CUI Li, and CHEN Haiming. Key technologies and applications of internet of things[J]. Computer Science, 2010, 37(6): 1–4,10. doi: 10.3969/j.issn.1002-137X.2010.06.001.
    [2]
    WANG Shuo, ZHANG Xing, ZHANG Yan, et al. A survey on mobile edge networks: Convergence of computing, caching and communications[J]. IEEE Access, 2017, 5: 6757–6779. doi: 10.1109/ACCESS.2017.2685434.
    [3]
    TRIGKA M and DRITSAS E. Edge and cloud computing in smart cities[J]. Future Internet, 2025, 17(3): 118. doi: 10.3390/fi17030118.
    [4]
    SINGH R and GILL S S. Edge AI: A survey[J]. Internet of Things and Cyber-Physical Systems, 2023, 3: 71–92. doi: 10.1016/j.iotcps.2023.02.004.
    [5]
    ALSADIE D. Advancements in heuristic task scheduling for IoT applications in fog-cloud computing: Challenges and prospects[J]. PeerJ Computer Science, 2024, 10: e2128. doi: 10.7717/peerj-cs.2128.
    [6]
    PENG Xiaohui, SUN Yixuan, ZHANG Zhenghui, et al. DSparse: A distributed training method for edge clusters based on sparse update[J]. Journal of Computer Science and Technology, 2025, 40(3): 637–653. doi: 10.1007/s11390-025-4821-5.
    [7]
    SU Weixing, LI Linfeng, LIU Fang, et al. AI on the Edge: A comprehensive review[J]. Artificial Intelligence Review, 2022, 55(8): 6125–6183. doi: 10.1007/s10462-022-10141-4.
    [8]
    张宏科, 于成晓, 权伟, 等. 融算网络体系基础研究[J]. 电子学报, 2022, 50(12): 2928–2934. doi: 10.12263/DZXB.20221140.

    ZHANG Hongke, YU Chengxiao, QUAN Wei, et al. Fundamental research on computing integration networking[J]. Acta Electronica Sinica, 2022, 50(12): 2928–2934. doi: 10.12263/DZXB.20221140.
    [9]
    ZHANG Zhen, CHANG Chaokun, LIN Haibin, et al. Is network the bottleneck of distributed training?[C]. Proceedings of the Workshop on Network Meets AI & ML (NetAI '20), USA, 2020: 8–13. doi: 10.1145/3405671.3405810. (查阅网上资料,未找到本条文献出版地信息,请确认).
    [10]
    ISMAIL A A, KHALIFA N E, and EL-KHORIBI R A. A survey on resource scheduling approaches in multi-access edge computing environment: A deep reinforcement learning study[J]. Cluster Computing, 2025, 28(3): 184. doi: 10.1007/s10586-024-04893-7.
    [11]
    AKTAS F, SHAYEA I, ERGEN M, et al. AI-enabled routing in next generation networks: A survey[J]. Alexandria Engineering Journal, 2025, 120: 449–474. doi: 10.1016/j.aej.2025.01.095.
    [12]
    GAO Tianfu and DONG Qingkuan. DNS-BC: Fast, reliable and secure domain name system caching system based on a consortium blockchain[J]. Sensors, 2023, 23(14): 6366. doi: 10.3390/s23146366.
    [13]
    DAN O, PARIKH V, and DAVISON B D. IP geolocation through reverse DNS[J]. ACM Transactions on Internet Technology, 2022, 22(1): 17. doi: 10.1145/3457611.
    [14]
    DENG Shuiguang, ZHAO Hailiang, FANG Weijia, et al. Edge intelligence: The confluence of edge computing and artificial intelligence[J]. IEEE Internet of Things Journal, 2020, 7(8): 7457–7469. doi: 10.1109/JIOT.2020.2984887.
    [15]
    陈前斌, 谭颀, 贺兰钦, 等. 云雾混合网络下基于多智能体架构的资源分配及卸载决策研究[J]. 电子与信息学报, 2021, 43(9): 2654–2662. doi: 10.11999/JEIT200256.

    CHEN Qianbin, TAN Qi, HE Lanqin, et al. Research on resource allocation and offloading decision based on multi-agent architecture in cloud-fog hybrid network[J]. Journal of Electronics & Information Technology, 2021, 43(9): 2654–2662. doi: 10.11999/JEIT200256.
    [16]
    ZHOU Guangyao, TIAN Wenhong, BUYYA R, et al. Deep reinforcement learning-based methods for resource scheduling in cloud computing: A review and future directions[J]. Artificial Intelligence Review, 2024, 57(5): 124. doi: 10.1007/s10462-024-10756-9.
    [17]
    SHEN Wangbo, LIN Weiwei, WU Wentai, et al. Reinforcement learning-based task scheduling for heterogeneous computing in end-edge-cloud environment[J]. Cluster Computing, 2025, 28(3): 179. doi: 10.1007/s10586-024-04828-2.
    [18]
    BALAKRISHNAN H, BANERJEE S, CIDON I, et al. Revitalizing the public internet by making it extensible[J]. ACM SIGCOMM Computer Communication Review, 2021, 51(2): 18–24. doi: 10.1145/3464994.3464998.
    [19]
    KNIGHT S, NGUYEN H X, FALKNER N, et al. The Internet topology zoo[J]. IEEE Journal on Selected Areas in Communications, 2011, 29(9): 1765–1775. doi: 10.1109/JSAC.2011.111002.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(11)  / Tables(3)

    Article Metrics

    Article views (33) PDF downloads(8) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return